agent_memory_and_state | jun ren's digital garden

🗓️ 21032026 2100

AGENT MEMORY AND STATE

How agents retain, retrieve, and manage information across interactions to maintain coherent behavior over time

Why Agents Need Memory

Without memory, every LLM call is stateless — the agent has no context beyond what's in the current prompt. Memory enables agents to:

Type	What It Stores	Lifespan	Storage Approach
Short-term	Current conversation / task context	Single session	Context window (message history)
Long-term	Facts, preferences, learned knowledge	Persistent	Vector DB, key-value store, file system
Episodic	Specific past interactions and outcomes	Persistent	Structured logs, indexed by time/task

The simplest form — just the conversation history passed in the LLM's context window. Effective but limited by token count. Strategies to manage it:

Persistent knowledge that survives across sessions. Typically involves:

Embedding + vector search: store information as vectors, retrieve by semantic similarity
Structured storage: key-value pairs, relational DB for structured facts
File-based: markdown files, JSON — simple but effective for smaller knowledge bases

Records of what happened in specific past interactions — useful for learning from experience:

Approach	How It Works	Best For
Centralized	Single state store shared by all agents	coordinator_router_pattern, simpler multi-agent setups
Distributed	Each agent maintains its own state	Loosely coupled agents, parallel execution
External	State stored in external system (DB, cache)	Production systems, persistence across restarts

Pattern	Memory Considerations
Single agent	Short-term context usually sufficient
coordinator_router_pattern	Coordinator needs memory of subtask results; sub-agents may be stateless
agent_as_tool_pattern	Primary agent holds all state; sub-agent tools are deliberately stateless
loop_review_critique_pattern	Needs memory of previous iterations to avoid repeating the same fixes

Concern	Detail
Token limits	Context window is finite — aggressive summarization or retrieval needed for long tasks
Retrieval quality	Vector search isn't perfect — irrelevant memories can mislead the agent
Staleness	Stored facts can become outdated — need expiration or validation strategies
Cost	Every memory retrieved and injected into context costs tokens
Privacy	Persistent memory may store sensitive information — consider retention policies