State machines in agentic systems

Definition

A state machine, in the context of an agentic system, is the orchestration contract that governs what the system is allowed to do at every step of execution.

This note covers state machines as the runtime control plane for agents: durability, idempotency, retries, approval gates, tool authority, and recovery.

It does not define the state machine formalism itself — states, events, transitions, guards, terminal states, and the comparison to alternative behavior formalisms (Hierarchical State Machines, Behavior Trees, Rule Engines, PDDL, HTN) are covered in 1.2.1 Finite State Machines (FSM).

Note	Lens	Question answered
1.1.1	Distributed systems thinking	How is the state machine run safely?
1.2.1	Planning + decision systems	What is the formal model? Why FSM vs others?

The Orchestration Question

The structural question for any agentic system is:

Who controls the next transition?

Three valid answers:

The workflow controls it. (Deterministic orchestration.)
The LLM proposes it and the workflow validates it. (Bounded LLM control.)
The LLM controls it inside a sandbox. (Autonomous loop.)

Most production agentic systems use answer 2.

Each answer corresponds to one of the patterns below.

Architectural Patterns

Deterministic workflow with LLM as worker

The workflow controls every transition. The LLM performs bounded tasks (classify, extract, summarize, draft) inside predefined states.

Use for:

regulated workflows
support automation
document assistants
approval systems
production operations

This is the safest default.

LLM-controlled transitions

The workflow exposes a finite list of allowed transitions. The LLM selects one.

Prompt contract:

{
  "current_state": "planning",
  "allowed_transitions": [
    "retrieve_context",
    "ask_clarifying_question",
    "request_approval",
    "fail"
  ]
}

Any output outside allowed_transitions is rejected at the event boundary.

Use for:

dynamic routing
flexible research workflows
triage
multi-step tool selection

Do not use for:

irreversible actions
financial movement
production changes without approval
security-sensitive operations

LLM-planned, machine-executed

The LLM produces a plan. The workflow validates and executes each step under deterministic rules.

{
  "plan": [
    { "step": "search_docs", "query": "refund policy" },
    { "step": "read_customer_record", "customer_id": "123" },
    { "step": "draft_refund_decision" }
  ]
}

The workflow validates each step:

Is the operation allowed?
Does the user have permission?
Is approval required?
Are the arguments valid?
Is the risk level acceptable?

The LLM proposes. The orchestration disposes.

Supervisor and actors

A supervisor state machine coordinates one or more LLM actor agents.

supervisor
├── researcher_agent
├── drafter_agent
├── checker_agent
└── tool_agent

The supervisor owns transitions, retries, approval gates, and termination.

Each actor has a bounded role.

Microsoft 2026 Azure agent orchestration guidance: use the lowest level of complexity that meets requirements. Multi-agent systems add coordination overhead, latency, and failure modes; justify them only when a single agent is insufficient.

LLM ↔ Orchestration Communication

State snapshot

A state snapshot is the structured subset of orchestration state exposed to the LLM for a single decision.

The LLM receives only decision-relevant state.

{
  "current_state": "planning",
  "user_goal": "answer employee question about refund policy",
  "available_context": ["employee_handbook", "refund_policy"],
  "allowed_transitions": ["retrieve_context", "ask_clarifying_question", "draft_answer"],
  "forbidden_actions": ["issue_refund", "change_policy"]
}

The state snapshot is input. It is not authority.

Structured LLM output

The LLM emits a structured event proposal.

{
  "event": "NEED_RETRIEVAL",
  "payload": { "query": "refund policy for damaged products", "sources": ["policy_docs"] },
  "confidence": 0.82
}

The orchestration converts the proposal into an event the state machine accepts.

The LLM does not mutate state directly.

Event boundary

The event boundary is the deterministic interface where LLM output becomes a state-machine input.

Required checks:

schema validation
allowed-event validation
permission validation
tool-argument validation
risk classification
audit logging

const allowedEvents = ['NEED_RETRIEVAL', 'ASK_USER', 'DRAFT_READY', 'FAIL'];
if (!allowedEvents.includes(llmOutput.event)) {
  send({ type: 'LLM_OUTPUT_REJECTED' });
}

Memory vs state

State is the operational execution record.

Memory is information used by the agent across steps or sessions.

They must be separated.

State examples:

current step
retry count
approval status
tool result
active plan

Memory examples:

user preference
prior conversation summary
company policy fact
retrieved document excerpt

LangGraph treats these as separate concerns to support durable execution, human-in-the-loop, and replay.

Distributed Systems Concerns

Durability

State and context must persist after every transition. A crash in awaiting_tool_result should resume in awaiting_tool_result, not restart planning.

Persisted records:

current state
pending event
context (counters, tool results, plan)
approval decisions
transition log

Storage:

Local: SQLite, Postgres, Redis
Google: Firestore, Cloud SQL, BigQuery
Workflow-native: Temporal, AWS Step Functions, Google Cloud Workflows, Inngest, Trigger.dev

Idempotency

Network retries cause tools to execute more than once.

Either the tool is idempotent, or the orchestration tracks an attempt ID and the tool deduplicates.

Non-idempotent operations require stronger approval controls.

Out-of-order events

Tool results may arrive after the orchestration has moved on. The state machine accepts or rejects events based on current state, not blindly.

Time itself is an event source. timeout, tick, and deadline_passed are first-class inputs, not afterthoughts.

Most agent failures in production are missed timeouts.

Tool authority

Tool authority is the deterministic permission boundary that determines whether a model-proposed action may execute.

The LLM may propose:

{ "tool": "refund_customer", "amount": 500 }

The orchestration checks:

Is refund_customer allowed in the current state?
Is the amount below the auto-approval threshold?
Does the current user have permission?
Is the customer record loaded?
Is the action idempotent?
Is human approval required?

Only then can the tool run.

Human approval gates

Human approval is a durable transition checkpoint, not a prompt instruction.

ready_to_execute_sensitive_tool
→ awaiting_human_approval
→ approved
→ execute_tool

The orchestration must persist at the approval state and resume without replaying prior agent work.

Mandatory human-in-the-loop gates make orchestration synchronous at that step.

Tool-call loops and termination

A tool-call loop is repeated tool selection without progress toward completion.

Required controls:

max iterations
max tool calls
max cost
repeated-tool detection
progress scoring
forced termination state

Microsoft guidance: guard against infinite tool-call loops with iteration limits.

State compression

State compression transforms accumulated execution history into a compact representation for model context.

Use when:

workflows are long
context windows fill
tool outputs are large
multi-agent transcripts accumulate noise

Store full state externally. Pass compressed state to the LLM.

{
  "summary": "User asked about refund eligibility. Policy says damaged products qualify within 30 days. Customer order is 18 days old.",
  "open_decision": "Determine whether refund needs manager approval.",
  "allowed_transitions": ["request_approval", "draft_response", "fail"]
}

Production Architectures

Node / TypeScript

XState (orchestration)
Fastify or NestJS
Postgres (state persistence)
Redis
BullMQ (durable queues)
OpenTelemetry
Docker

Google stack

Cloud Run
Pub/Sub
Firestore
BigQuery
Vertex AI
Cloud Logging
IAM

Workflow-native

Temporal
AWS Step Functions
Google Cloud Workflows
Inngest
Trigger.dev

The orchestration model remains the same.

Only execution infrastructure changes.

Architecture Selection

Pattern	LLM role	Orchestration role	Use when
Deterministic workflow	Worker	Full controller	Compliance, support, operations
LLM-controlled transitions	Chooser	Validator and executor	Flexible routing, triage
LLM-planned workflow	Planner	Plan validator and executor	Variable-step tasks
Supervisor + actors	Specialist workers	Coordinator	Multi-agent systems
Fully autonomous loop	Controller	Minimal guard	Research demos, low-risk sandboxes

Minimal Orchestration Shape

A durable orchestration shape using XState. The model itself (states, events, transitions, guards) is covered in 1.2.1; this snippet shows the runtime concerns: async invocation, validation, retry, terminal states.

import { createMachine, assign } from 'xstate';

const agentMachine = createMachine({
  id: 'agent',
  initial: 'planning',
  context: {
    userGoal: '',
    plan: null,
    toolResults: [],
    retryCount: 0
  },
  states: {
    planning: {
      invoke: {
        src: 'callPlannerLLM',
        onDone: {
          target: 'validatingLLMOutput',
          actions: assign({ proposedEvent: ({ event }) => event.output })
        },
        onError: 'failed'
      }
    },

    validatingLLMOutput: {
      always: [
        { guard: 'isAllowedRetrievalRequest', target: 'retrieving' },
        { guard: 'isAllowedToolRequest',      target: 'awaitingApproval' },
        { guard: 'isValidFinalAnswer',        target: 'completed' },
        { target: 'repairing' }
      ]
    },

    retrieving:       { invoke: { src: 'retrieveContext', onDone: 'planning', onError: 'repairing' } },
    awaitingApproval: { on: { APPROVED: 'executingTool', REJECTED: 'completed' } },
    executingTool:    { invoke: { src: 'executeTool', onDone: 'planning', onError: 'repairing' } },
    repairing:        { always: [{ guard: 'canRetry', target: 'planning' }, { target: 'failed' }] },

    completed: { type: 'final' },
    failed:    { type: 'final' }
  }
});

The shape is:

LLM output → proposed event → validation → transition

Not:

LLM output → direct state mutation

Final Rule

The orchestration question is not "do we use a state machine."

It is "who controls the next transition."

For enterprise agentic systems, the workflow validates LLM-proposed transitions. The LLM does not control authority.

For the formal model itself — states, events, transitions, guards, terminal states, FSM versus alternatives — see 1.2.1 Finite State Machines (FSM).

References

Anthropic — Building Effective Agents — anthropic.com/research/building-effective-agents
LangGraph overview — docs.langchain.com/oss/python/langgraph/overview
Microsoft Azure Architecture Center — AI Agent Design Patterns — learn.microsoft.com/en-us/azure/architecture/ai-ml/guide/ai-agent-design-patterns
Temporal — temporal.io
XState — stately.ai/docs/xstate