sandclaw-memory

Self-Growing Tag-Dictionary RAG for Any Device

Give your AI long-term memory that grows smarter over time.

No GPU. No vector database. No external dependencies. Just pip install and go.

from sandclaw_memory import BrainMemory

brain = BrainMemory(tag_extractor=my_ai_func)
brain.save("User loves Python and React")
context = brain.recall("what does the user like?")
# -> Returns relevant memories as Markdown, ready for LLM injection

Why sandclaw-memory?

Feature	sandclaw-memory	Vector DB (Pinecone, Weaviate)	mem0
Setup	`pip install` (done)	Docker + API keys + config	`pip install` + API key
GPU required	No	Often yes	No
Cost over time	Decreases (self-growing)	Constant	Constant
Search	FTS5 + BM25 (proven)	Cosine similarity	Embedding-based
Privacy	100% local	Cloud required	Cloud optional
Dependencies	0 (stdlib only)	Many	Several
Size	~50KB	100MB+ Docker	~5MB

Why tags instead of vectors? Vector embeddings are powerful but opaque. When a wrong result appears, you can't explain why. sandclaw-memory uses AI-powered tag extraction, so the AI understands synonyms and context (e.g., "vehicle" and "car" get the same tag), while every result is traceable to human-readable tags. Same semantic understanding, zero infrastructure, and you can debug it.

The Self-Growing Advantage

Traditional RAG calls AI for every search. sandclaw-memory learns:

Day 1:  "Built a page with React"  → AI extracts: [react, frontend, page]
         keyword_map registers: "React" → react, "page" → page

Day 30: "React Native app"        → keyword_map instant match! No AI needed.
         Cost: $0.00

Day 90: 80% of saves match keyword_map → AI calls reduced by 80%

The more you use it, the cheaper it gets.

Quick Start

Install

pip install sandclaw-memory

3-Line Memory

from sandclaw_memory import BrainMemory

with BrainMemory(tag_extractor=my_tag_func) as brain:
    brain.save("Important: migrate to FastAPI by Q2")
    print(brain.recall("what's the migration plan?"))

With OpenAI

import json, openai
from sandclaw_memory import BrainMemory

def tag_extractor(content: str) -> list[str]:
    resp = openai.chat.completions.create(
        model="gpt-4o-mini",
        messages=[{
            "role": "system",
            "content": "Extract 3-7 keyword tags. Return JSON array only."
        }, {"role": "user", "content": content}],
        temperature=0,
    )
    return json.loads(resp.choices[0].message.content)

with BrainMemory(
    db_path="./my_memory",
    tag_extractor=tag_extractor,
) as brain:
    brain.start_polling()  # Background tag extraction every 15s

    brain.save("User prefers Python and TypeScript")
    brain.save("Decided to use PostgreSQL", source="archive")

    context = brain.recall("what tech stack?")
    # Inject 'context' into your LLM's system prompt

With Claude

import json, anthropic
from sandclaw_memory import BrainMemory

client = anthropic.Anthropic()

def tag_extractor(content: str) -> list[str]:
    resp = client.messages.create(
        model="claude-haiku-4-5-20251001",
        max_tokens=200,
        messages=[{"role": "user",
                   "content": f"Extract 3-7 tags as JSON array:\n{content}"}],
    )
    return json.loads(resp.content[0].text)

brain = BrainMemory(tag_extractor=tag_extractor)

See examples/ for LangChain, custom callbacks, and scheduling patterns.

Architecture

┌─────────────────────────────────────────────────┐
│                  BrainMemory                     │
│    save() → recall() → start_polling()          │
├─────────────────────────────────────────────────┤
│                                                  │
│  ┌─────────┐  ┌──────────┐  ┌────────────────┐  │
│  │   L1    │  │    L2    │  │      L3        │  │
│  │ Session │  │ Summary  │  │   Archive      │  │
│  │         │  │          │  │                │  │
│  │ 3-day   │  │ 30-day   │  │ Permanent      │  │
│  │ rolling │  │ AI/text  │  │ SQLite + FTS5  │  │
│  │ Markdown│  │ summary  │  │ + self-growing │  │
│  │  logs   │  │          │  │   tag dict     │  │
│  └────┬────┘  └────┬─────┘  └───────┬────────┘  │
│       │            │                │            │
│  ┌────┴────────────┴────────────────┴────────┐  │
│  │          Intent Dispatcher                 │  │
│  │  CASUAL → L1  │ STANDARD → L1+L2         │  │
│  │               │ DEEP → L1+L2+L3           │  │
│  └───────────────┴───────────────────────────┘  │
│                                                  │
│  ┌──────────────────────────────────────────┐   │
│  │     Self-Growing Tag Pipeline            │   │
│  │  Stage 1: keyword_map (instant, free)    │   │
│  │  Stage 2: AI callback (async, queue)     │   │
│  │  → keyword_map grows → Stage 2 shrinks   │   │
│  └──────────────────────────────────────────┘   │
└─────────────────────────────────────────────────┘

Three Memory Layers

Layer	What	Retention	Storage	Speed
L1 Session	Conversation logs	3 days (rolling)	Markdown files	Instant
L2 Summary	Period summaries	30 days	In-memory	Instant
L3 Archive	Important memories	Forever	SQLite + FTS5	~1ms

Intent-Based Depth

The dispatcher auto-detects how deep to search:

brain.recall("what time is it?")           # → CASUAL  (L1 only)
brain.recall("summarize this month")       # → STANDARD (L1 + L2)
brain.recall("why did we pick React?")     # → DEEP    (L1 + L2 + L3)

# Or override manually:
brain.recall("anything", depth="deep")

API Reference

BrainMemory

BrainMemory(
    db_path="./memory",           # Where to store files
    tag_extractor=my_func,        # REQUIRED: (str) -> list[str]
    promote_checker=None,         # Optional: (str) -> bool
    depth_detector=None,          # Optional: (str) -> str
    duplicate_checker=None,       # Optional: (str, str) -> bool
    conflict_resolver=None,       # Optional: (str, str) -> str
    polling_interval=15,          # Seconds between maintenance cycles
    rolling_days=3,               # L1 retention period
    summary_days=30,              # L2 summary period
    max_context_chars=15_000,     # Context budget for LLM injection
    encryption_key=None,          # Optional SQLCipher encryption
)

Core Methods

Method	Description
`save(content, source="chat", tags=None)`	Save to memory. `source="archive"` forces L3.
`recall(query, depth=None)`	Recall relevant memories as Markdown.
`search(query, limit=20)`	Search L3 archive, returns `list[MemoryEntry]`.
`promote(content, tags=None)`	Manually save to L3. Returns memory ID.
`summarize(llm_callback=None)`	Generate a 30-day summary.
`start_polling()`	Start background maintenance loop.
`stop_polling()`	Stop background maintenance loop.
`run_maintenance()`	Run one maintenance cycle manually.
`get_stats()`	Get stats from all layers.
`get_tag_stats()`	Get tag usage counts.
`export_json(path=None)`	Export all data as JSON.
`on(event, callback)`	Register an event hook.
`close()`	Stop polling and close DB.

The 5 AI Callbacks

Callback	Signature	Default
`tag_extractor`	`(str) -> list[str]`	REQUIRED
`promote_checker`	`(str) -> bool`	`len(content) > 200`
`depth_detector`	`(str) -> str`	Keyword matching
`duplicate_checker`	`(str, str) -> bool`	SequenceMatcher > 0.85
`conflict_resolver`	`(str, str) -> str`	Keep newer text

Event Hooks

brain.on("before_save",    lambda content, source: ...)
brain.on("after_save",     lambda content, source: ...)
brain.on("before_promote", lambda content: ...)
brain.on("after_promote",  lambda content, mem_id: ...)
brain.on("after_recall",   lambda query, depth, result: ...)
brain.on("after_cycle",    lambda stats: ...)

Examples

File	Description
`basic_usage.py`	Simplest setup with OpenAI
`with_openai.py`	All 5 callbacks with OpenAI
`with_anthropic.py`	Claude API integration
`with_langchain.py`	LangChain agent + memory
`custom_callbacks.py`	Custom callbacks + hooks
`scheduling.py`	Polling, FastAPI, Celery

Compatibility

Python: 3.9, 3.10, 3.11, 3.12, 3.13
OS: Windows, macOS, Linux
Dependencies: None (Python stdlib only)
SQLite: Uses built-in sqlite3 module (FTS5 optional, graceful fallback)

Contributing

See CONTRIBUTING.md for guidelines.

License

MIT License. See LICENSE for details.

Origin

This library was born from the multi-layer RAG engine built for SandClaw — an AI-powered trading IDE. We extracted and open-sourced the memory system so any developer can use it.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
docs		docs
examples		examples
sandclaw_memory		sandclaw_memory
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sandclaw-memory

Why sandclaw-memory?

The Self-Growing Advantage

Quick Start

Install

3-Line Memory

With OpenAI

With Claude

Architecture

Three Memory Layers

Intent-Based Depth

API Reference

BrainMemory

Core Methods

The 5 AI Callbacks

Event Hooks

Examples

Compatibility

Contributing

License

Origin

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

sandclaw-memory

Why sandclaw-memory?

The Self-Growing Advantage

Quick Start

Install

3-Line Memory

With OpenAI

With Claude

Architecture

Three Memory Layers

Intent-Based Depth

API Reference

BrainMemory

Core Methods

The 5 AI Callbacks

Event Hooks

Examples

Compatibility

Contributing

License

Origin

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages