OpenClaw Memory

Part 2: Why Your Agent Forgets

The 3 failure modes explained

Quick Recap: The 4 Layers

From Part 1, we learned about:

Bootstrap Files - Permanent identity (reloaded from disk)
Session Transcript - Conversation history (gets compacted)
Context Window - Active memory (fixed size)
Retrieval Index - Searchable archive (files only)

Now let's see how these layers fail.

A Real-World Example

Summer Yue (Meta's Director of Alignment) had to physically run to her computer to stop her OpenClaw agent from bulk deleting hundreds of emails.

What Happened

She told the agent: "Don't do anything until I say so"
Agent worked fine on test inbox for weeks
Pointed it at real inbox with thousands of messages
Context window filled up → compaction fired
That instruction (given in chat, not saved to file) disappeared from the summary
Agent went autonomous and started deleting emails

The Lesson

If it's not written to a file, it doesn't exist.

If Meta's Director of Alignment can lose control, so can you.

Important: Compaction is normal. The failure is relying on chat-only rules to survive long sessions.

The 3 Failure Modes

When your agent forgets, it's always one of these:

Failure A: Never Stored
Failure B: Compaction Changed Context
Failure C: Session Pruning

Let's break down each one.

Failure A: Never Stored

Most common cause (90% of cases)

The instruction/preference/decision only existed in conversation.

It was never written to a file.

When compaction fires or new session starts → gone forever

Failure A: What Gets Lost

Examples of things that disappear:

Instructions given in chat: "Always check X before doing Y"
Preferences mentioned casually: "I prefer JSON over YAML"
Decisions made mid-conversation: "Let's use approach B"
Constraints stated once: "Don't touch the production database"
Context from earlier: "Remember, this is for the client demo"

Failure A: Why It Happens

The mechanism:

You give instruction in chat
Agent acknowledges and follows it
Session continues, context fills up
Compaction summarizes old messages
Your instruction gets compressed or dropped
Agent no longer "remembers" it

This is exactly what happened to Summer Yue

Failure A: Bootstrap File Limits

Critical limitation:

20,000 characters per file
150,000 characters total across all bootstrap files
If exceeded → file gets truncated
Truncated portion is invisible to the agent

Check with: /context list

Failure A: The Fix Preview

Solution (covered in Part 3):

Write important instructions to bootstrap files:

AGENTS.md for workflow rules
MEMORY.md for current context
USER.md for preferences

These files survive compaction because they're reloaded from disk.

Failure B: Compaction Changed Context

The lossy compression problem

Long session hits token limit → compaction summarizes old messages

The summary can be lossy:

Drops important details
Loses nuance
Misses specific constraints

Failure B: How It Happens

The compaction process:

Context window approaches limit (e.g., 156K / 200K tokens)
OpenClaw triggers compaction
Model summarizes old conversation into ~5-10K tokens
Summary replaces detailed history
Agent now operates from summary, not original words

Note: Recent messages (~20K tokens) stay intact

Failure B: Example

Original conversation (detailed):

"When processing user data, always validate email format first, then check if the domain is on our allowlist. If validation fails, log the error with timestamp and user ID, but don't send an email notification - just update the dashboard. Only send notifications for successful validations."

After compaction (summary):

"Discussed user data validation workflow with email checks and error logging."

All the specific constraints are gone

Failure B: What Survives

After compaction, the model retains:

Recent messages (~20K tokens stay intact)
Bootstrap files (reloaded from disk)
Summary of old messages (lossy)

Lost:

Exact wording of old instructions
Specific constraints and edge cases
Context and reasoning behind decisions
All images shared before compaction

Failure C: Session Pruning

Tool outputs trimmed to optimize caching

Session pruning trims old tool results from memory:

File reads
Browser snapshots
API responses

Important: This is actually your friend!

Failure C: Why It's Good

Pruning helps, not hurts:

✓ Reduces bloat without destroying conversation
✓ Only affects tool results, not messages
✓ User and assistant messages stay intact
✓ Optimizes prompt caching (saves money)
✓ Delays compaction (the dangerous one)

Failure C: Cache-TTL Mode

The recommended pruning mode

{
  "contextPruning": {
    "mode": "cache-ttl",
    "ttl": "5m"
  }
}

Auto-enabled for Claude setups
Trims tool results after 5 minutes
Preserves conversation flow
Improves cache hit rates

Failure C: The Confusion

Most people confuse pruning with compaction.

They are completely different:

Pruning = automatic, safe, let it happen
Compaction = dangerous, loses context

Pruning is not a failure mode - it's a feature that prevents failure.

Compaction vs Pruning

Let's make this crystal clear:

Compaction vs Pruning

Compaction

Summarizes entire conversation
Changes what model sees
Affects everything
Fires when context fills
Lossy (drops details)
Permanent
⚠️ Dangerous

Pruning

Trims old tool results
In-memory only
Only affects tool outputs
Happens continuously
Lossless (on-disk intact)
Temporary
✓ Your friend

Compaction: The Two Paths

There are two ways compaction can happen:

Good Path: Controlled compaction with memory flush
Bad Path: Emergency compaction from overflow

The Good Path

Controlled compaction with memory flush

Context nearing limit (e.g., 156K / 200K tokens)
Memory flush fires first (automatic safety net)
Agent saves important context to disk
Compaction summarizes old history
Agent continues with summary + saved context

Minimal context loss

The Bad Path

Emergency compaction from overflow

Context overflows limit (e.g., 205K / 200K tokens)
API rejects the request
No memory flush (already past the trigger point)
Emergency compaction fires
Aggressive summarization to recover

Maximum context loss

Good Path vs Bad Path

Good Path

Triggered early (headroom)
Memory flush saves context
Controlled summarization
Minimal loss

Bad Path

Triggered by overflow
No memory flush
Emergency compression
Maximum loss

Configuration determines which path you take

Diagnostic Tool: /context list

First thing to check when debugging memory issues

Shows you:

Which bootstrap files are loaded
File sizes (raw vs injected)
Truncation status
Token estimates

/context list: What to Check

Is MEMORY.md loading? If missing, it's not in context
Is anything TRUNCATED? Content over 20K chars gets cut
Do injected chars = raw chars? If not, content is being cut

If a file is truncated, the truncated portion is invisible to the agent

The 3 Failure Modes: Summary

Never Stored (90%): Instructions given in chat, never written to files
Compaction Changed Context: Lossy summarization drops details
Session Pruning: Not actually a failure - it's helpful (cache-TTL mode)

Key insight: Compaction is the enemy, pruning is the friend

Diagnostic Checklist

When your agent forgets something, ask:

Was it written to a file? → If no, that's Failure A
Did it happen after a long session? → Probably Failure B (compaction)
Was it a tool result? → That's Failure C (pruning, expected)

First step: Run /context list to verify what's loaded

Next: How to Fix It

Now you know why agents forget.

Part 3 covers the solutions:
Configuration, file architecture, and best practices