Agent Call Loop 🔴

The Agent is where AI "thinking" happens in OpenClaw. From building context, calling LLMs, executing tools, to context compaction — this chapter digs deep into the Agent's reasoning engine.

Learning Objectives

After reading this chapter, you'll be able to:

Understand the Bootstrap phase and how it builds the complete Agent context
Trace the tool call loop (ReAct pattern) end-to-end
Understand compaction — when it triggers and how it works
Understand multi-provider failover

I. The ReAct Loop

OpenClaw's Agent reasoning is based on the ReAct (Reasoning + Acting) pattern:

flowchart TB
    START["User message received"]
    BOOTSTRAP["Bootstrap phase\nbuild full context"]
    LLM_CALL["Call LLM\n(streaming)"]
    CHECK{"AI chose what?"}
    TEXT_REPLY["Text reply\n→ stream to user"]
    TOOL_CALL["Call tool\n(bash/file/MCP)"]
    TOOL_RESULT["Get tool result"]
    APPEND["Append tool result\nto message history"]
    COMPACT{"Context exceeds\ntoken limit?"}
    COMPACTION["Compact context\ngenerate summary"]
    DONE["Done\nsave session history"]

    START --> BOOTSTRAP
    BOOTSTRAP --> LLM_CALL
    LLM_CALL --> CHECK
    CHECK -->|"text reply"| TEXT_REPLY
    CHECK -->|"tool call"| TOOL_CALL
    TOOL_CALL --> TOOL_RESULT
    TOOL_RESULT --> APPEND
    APPEND --> COMPACT
    COMPACT -->|"yes"| COMPACTION
    COMPACT -->|"no"| LLM_CALL
    COMPACTION --> LLM_CALL
    TEXT_REPLY --> DONE

II. Bootstrap: Building Context

Bootstrap is the "context preparation" phase before each agent reasoning cycle. It assembles the complete System Prompt:

Core instructions (Agent definition text) — always included
CLAUDE.md / AGENTS.md (project spec docs) — loaded hierarchically
Active Skill files (SKILL.md) — filtered by agent Skill config
Memory injections (from Memory plugin) — recent relevant memories
Tool list (JSON Schemas of available tools)

Bootstrap Budget

Context can't be unlimited — it's bounded by the LLM's context window. bootstrap-budget.ts implements a budget allocation system that tracks character counts and truncates when needed.

type BootstrapBudgetAnalysis = {
  hasTruncation: boolean;
  totals: {
    rawChars: number;
    injectedChars: number;
    bootstrapMaxChars: number;
    bootstrapTotalMaxChars: number;
  };
};

III. Tool Call Loop (ReAct)

When the LLM decides to call a tool:

sequenceDiagram
    participant LLM as "LLM Provider"
    participant AGENT as "Agent Runtime"
    participant TOOL as "Tool Executor"

    AGENT->>LLM: Send messages (with tool schemas)
    LLM-->>AGENT: Streaming response (with tool_use block)
    AGENT->>TOOL: Execute tool (executeToolCall)
    alt requires approval
        TOOL-->>AGENT: Request approval
        AGENT->>AGENT: Wait for user approval
    end
    TOOL-->>AGENT: Tool result
    Note over AGENT: Append tool_result to history
    AGENT->>LLM: Continue conversation (with tool_result)
    LLM-->>AGENT: Final text reply

Available Tools

Category	Examples	Source
File system	`read_file`, `write_file`	Core built-in
Shell	`bash`	Core built-in (security-gated)
Code	`str_replace_editor`	Core built-in
Memory	`memory_create`, `memory_search`	memory-core plugin
MCP tools	Any tool from MCP servers	mcporter plugin

IV. Compaction

When session history grows too large, exceeding the LLM's context window, compaction triggers.

Trigger Condition

How It Works

Compaction calls the LLM to generate a summary of the conversation history. The summary replaces the detailed history — dramatically reducing token count while preserving key information.

Key instructions to the LLM during compaction:

MUST PRESERVE:
- Active tasks and their current status
- Batch operation progress (e.g., '5/17 items completed')
- The last thing the user requested
- Decisions made and their rationale
- TODOs, open questions, and constraints

Identifier Preservation

const IDENTIFIER_PRESERVATION_INSTRUCTIONS =
  'Preserve all opaque identifiers exactly as written (no shortening), ' +
  'including UUIDs, hashes, IDs, tokens, API keys, hostnames, IPs, ports, URLs, and file names.';

This prevents the LLM from "helpfully" shortening UUIDs or IDs in summaries, which would break subsequent operations.

V. Multi-Provider Failover

When multiple LLM Providers are configured, model-fallback.ts implements automatic failover:

# config.yaml
agents:
  default:
    model: anthropic/claude-opus-4-5
    modelFallbacks:
      - model: openai/gpt-4o
        triggerOnErrors: ['rate_limit_error', 'overloaded_error']
      - model: ollama/llama3.1
        triggerOnErrors: ['all']  # ultimate fallback

Key Source Files

File	Size	Role
`src/agents/agent-command.ts`	29KB	Agent reasoning main scheduler
`src/agents/bootstrap-budget.ts`	12KB	Bootstrap budget allocation
`src/agents/compaction.ts`	16KB	Context compaction (summary generation)
`src/agents/bash-tools.exec.ts`	51KB	Bash tool execution logic
`src/agents/acp-spawn.ts`	33KB	Multi-agent spawning

Summary

ReAct loop: LLM → tool calls → tool results → LLM, until LLM gives text reply.
Bootstrap builds full context: core instructions + CLAUDE.md + Skills + Memory + tool schemas.
Compaction solves long session problem: when history hits ~60% of context window, LLM summarizes old history.
Bash tool has security gates: requires approval by default; configurable to allowlist or auto-approve.
Multi-provider failover: configure backup providers, automatic switching on error types.

← Routing Engine | → Plugin SDK Design