LLM Development Guide
A practical, trust-first workflow for planning, prompting, executing, and reviewing LLM-assisted development.
27 posts tagged “agents”
Agent systems become production systems the moment they can call tools. This tag groups posts on building agents that keep working under load and under adversarial inputs.
Start here:
A practical, trust-first workflow for planning, prompting, executing, and reviewing LLM-assisted development.
An end-to-end example of migrating a procedural runbook to a durable Temporal workflow using reference implementations, phased prompts, and verification.
A production-grade, Temporal-native AI agent orchestration platform in Go: durable multi-agent workflows, governance, MCP integration, and an operational UI.
An end-to-end example of the workflow: plan, prompt docs, work notes, execution loop, and verification to create a new Helm chart from a known-good reference.
How to build and maintain a team prompt library that stays useful: structure, templates, contribution rules, and governance.
Minimal templates for plans, prompts, work notes, and checklists. Copy, adapt, and keep the workflow consistent.
How to make LLM-assisted development work on a team: handoff artifacts, shared prompt libraries, and review discipline.
How to measure whether LLM-assisted development is actually helping: practical metrics, baselines, and lightweight reporting.
Concrete stop rules for LLM-assisted development, plus common pitfalls and a recovery checklist when things go sideways.
A production-first guide to running Model Context Protocol servers safely: threat modeling, input validation, budgets, rate limits, and telemetry, implemented with Go patterns.
Practical data-handling rules for LLM-assisted development: what never to paste, how to sanitize, and how to verify you didn't leak secrets.
Model choice is an engineering decision: match capability to task complexity, upgrade when stuck, and avoid stale vendor claims.
An example-heavy pattern for multi-week LLM-assisted work: phase specifications, implementation prompt documents, and strict execution gates.
How to scale LLM-assisted development from a 1-day task to multi-week work: sub-phasing, parallelization, and repo hygiene.
A repeatable execution loop for LLM-assisted work: implement small units, update notes, verify, and commit (without batching).
How to preserve state across LLM sessions with work notes: decisions, assumptions, open questions, session logs, and commit links.
Turn your plan into reusable prompt docs: phase-aligned prompts with constraints, deliverables, session management, and verification.
How to turn vague work into a phased plan that an LLM can execute safely: goals, constraints, references, verification, and stop rules.
A trust-first, executable loop for LLM-assisted development: plan artifacts, prompt docs, work notes, verification, and commit discipline (with a worked example).
Tracing decisions, tool calls, cost, and side effects across an agent system - without turning logs into a data leak.
Budgets, quotas, and circuit breakers for LLM systems - because runaway cost is just an outage where the failure mode is billing.
Most agent demos assume perfect networks and short runs. Production agents need durable execution: retries, idempotency, replay, and human-in-the-loop - without losing state.
If your eval can't fail when a tool breaks, it's not an eval. Here's how to test tool selection, arguments, side effects, and safety - without flaky 'vibe checks.'
When MCP outgrows local stdio servers, you need a gateway: auth, tenancy, quotas, routing, audit, and safe tool contracts - without killing developer velocity.
Once you have dozens (or hundreds) of tools, 'just include all schemas' stops working. Here's a production-first playbook for tool discovery, ranking, and safe invocation.
Prompt injection is real - but it's not just an LLM problem. Secure agents by securing tools, secrets, egress, and outputs with a zero-trust contract.
Production-grade MCP servers in Go that expose iCloud, Todoist, and Notion as safe, typed tools for LLM agents.
No posts match your search.