Agents

27 posts tagged “agents”

Agent systems become production systems the moment they can call tools. This tag groups posts on building agents that keep working under load and under adversarial inputs.

Start here:

LLM Development Guide

A practical, trust-first workflow for planning, prompting, executing, and reviewing LLM-assisted development.

Feb 16, 2026 1 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 16: Worked Example: Converting an Ansible Playbook to a Go Temporal Workflow

An end-to-end example of migrating a procedural runbook to a durable Temporal workflow using reference implementations, phased prompts, and verification.

Feb 13, 2026 3 min read

llm software-engineering workflow+1

Cruvero - AI Agent Ecosystem Platform

A production-grade, Temporal-native AI agent orchestration platform in Go: durable multi-agent workflows, governance, MCP integration, and an operational UI.

Feb 12, 2026 8 min read

go llm agents+5

LLM Development Guide

Chapter 15: Worked Example: Creating a Helm Chart From a Reference Chart

An end-to-end example of the workflow: plan, prompt docs, work notes, execution loop, and verification to create a new Helm chart from a known-good reference.

Feb 11, 2026 4 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 14: Building a Prompt Library: Governance + Quality Bar

How to build and maintain a team prompt library that stays useful: structure, templates, contribution rules, and governance.

Feb 9, 2026 2 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 13: Templates + Checklists: The Copy/Paste Kit

Minimal templates for plans, prompts, work notes, and checklists. Copy, adapt, and keep the workflow consistent.

Feb 7, 2026 4 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 12: Team Collaboration: Handoffs, Shared Prompts, and Review

How to make LLM-assisted development work on a team: handoff artifacts, shared prompt libraries, and review discipline.

Feb 5, 2026 3 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 11: Measuring Success: Solo + Team Metrics Without Fake Precision

How to measure whether LLM-assisted development is actually helping: practical metrics, baselines, and lightweight reporting.

Feb 3, 2026 2 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 10: Stop Rules + Pitfalls: When to Upgrade, Bail, or Go Manual

Concrete stop rules for LLM-assisted development, plus common pitfalls and a recovery checklist when things go sideways.

Jan 31, 2026 3 min read

llm software-engineering workflow+1

PinnedGo + AI in Production

MCP Servers in Production: Hardening, Backpressure, and Observability (Go)

A production-first guide to running Model Context Protocol servers safely: threat modeling, input validation, budgets, rate limits, and telemetry, implemented with Go patterns.

Jan 31, 2026 11 min read

go mcp llm+4

LLM Development Guide

Chapter 9: Security & Sensitive Data: Sanitize, Don't Paste Secrets

Practical data-handling rules for LLM-assisted development: what never to paste, how to sanitize, and how to verify you didn't leak secrets.

Jan 29, 2026 2 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 8: Choosing the Right Model: Capability Tiers, Not Hype

Model choice is an engineering decision: match capability to task complexity, upgrade when stuck, and avoid stale vendor claims.

Jan 27, 2026 3 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 7: Large Projects with Phase Documents + Implementation Prompts

An example-heavy pattern for multi-week LLM-assisted work: phase specifications, implementation prompt documents, and strict execution gates.

Jan 26, 2026 5 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 6: Scaling the Workflow: Phases, Parallelism, Hygiene

How to scale LLM-assisted development from a 1-day task to multi-week work: sub-phasing, parallelization, and repo hygiene.

Jan 25, 2026 3 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 5: The Execution Loop: Review Discipline + Commit Discipline

A repeatable execution loop for LLM-assisted work: implement small units, update notes, verify, and commit (without batching).

Jan 23, 2026 3 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 4: Work Notes: External Memory + Running Log

How to preserve state across LLM sessions with work notes: decisions, assumptions, open questions, session logs, and commit links.

Jan 21, 2026 3 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 3: Prompt Documents: Prompts That Survive Sessions

Turn your plan into reusable prompt docs: phase-aligned prompts with constraints, deliverables, session management, and verification.

Jan 19, 2026 3 min read

llm software-engineering workflow+1

LLM Development Guide

Chapter 2: Planning: Plan Artifacts, Constraints, Definition of Done

How to turn vague work into a phased plan that an LLM can execute safely: goals, constraints, references, verification, and stop rules.

Jan 17, 2026 3 min read

llm software-engineering workflow+1

PinnedLLM Development Guide

Chapter 1: A Practical Workflow for LLM-Assisted Development That Doesn't Collapse After Day 2

A trust-first, executable loop for LLM-assisted development: plan artifacts, prompt docs, work notes, verification, and commit discipline (with a worked example).

Jan 15, 2026 5 min read

llm software-engineering workflow+1

Production Systems

Agent Observability That Doesn't Lie

Tracing decisions, tool calls, cost, and side effects across an agent system - without turning logs into a data leak.

Dec 20, 2025 5 min read

observability agents mcp+3

Production Systems

Cost Is a Reliability Problem

Budgets, quotas, and circuit breakers for LLM systems - because runaway cost is just an outage where the failure mode is billing.

Dec 13, 2025 5 min read

cost agents llm+3

Production Systems

Durable Agents with Temporal: Retries, Idempotency, and Long-Running State

Most agent demos assume perfect networks and short runs. Production agents need durable execution: retries, idempotency, replay, and human-in-the-loop - without losing state.

Dec 6, 2025 5 min read

temporal agents durable-execution+3

Production Systems

Evals for Tool-Using Agents: Regression Tests Beyond Prompts

If your eval can't fail when a tool breaks, it's not an eval. Here's how to test tool selection, arguments, side effects, and safety - without flaky 'vibe checks.'

Nov 29, 2025 5 min read

evals agents mcp+3

Production Systems

From Stdio to Enterprise: The MCP Gateway Pattern

When MCP outgrows local stdio servers, you need a gateway: auth, tenancy, quotas, routing, audit, and safe tool contracts - without killing developer velocity.

Nov 22, 2025 8 min read

mcp agents go+3

Production Systems

Tool Discovery at Scale: Solving the Million Tool Problem

Once you have dozens (or hundreds) of tools, 'just include all schemas' stops working. Here's a production-first playbook for tool discovery, ranking, and safe invocation.

Nov 15, 2025 6 min read

agents mcp retrieval+3

Production Systems

The Real Security Model for Agents

Prompt injection is real - but it's not just an LLM problem. Secure agents by securing tools, secrets, egress, and outputs with a zero-trust contract.

Oct 18, 2025 5 min read

security agents mcp+3

Go MCP Server Ecosystem

Production-grade MCP servers in Go that expose iCloud, Todoist, and Notion as safe, typed tools for LLM agents.

Sep 1, 2024 5 min read

go mcp llm+3