Tell it once. Your local AI remembers the rest.
Stop re-explaining your setup every session. Drop-in memory for Ollama, Claude Desktop, or Cursor (via MCP) — fully local, set up in 20 minutes.
It carries what you told it into the next session.
Representative session. The kit recalls only what you've saved — nothing leaves your machine.
Four moving parts. Nothing to learn.
Save a note
You (or your agent) save a fact — your stack, a preference, a decision.
Embed it locally
A local model turns it into a vector. No cloud, no API key.
Retrieve by meaning
Later, the most relevant memories are found by similarity — not keywords.
Feed your AI
Those memories go to your agent, so it answers like it knows you.
A kit, not a framework.
- memory module
save()/search()The core store. Read it in one sitting. - MCP server
memory_save/memory_searchPlugs into Claude Desktop, Cursor, or your own agent. - example agentA runnable CLITell it something, restart, watch it remember.
- README~20-minute setupCopy-paste, start to finish.
Free OSS memory libraries are frameworks you study. Recall is a small, legible kit you run today — that's the £19.
Exactly what you're running.
The hard parts are left out, on purpose.
Two problems are deliberately excluded — so the core stays small enough to actually read and trust.
Relevance filtering
It doesn't decide "what's worth remembering" for you. You save what matters.
Contradiction handling
It won't reconcile memories that disagree over time. It stores and recalls — faithfully.
These are genuinely hard problems, and most tools that claim them do them badly. Recall would rather be honest about its edges than pretend.
Straight answers.
Does my data leave my machine?
No. Embeddings and storage are fully local — no network calls, no API keys.
Which models does it use?
llama3.2 for chat and nomic-embed-text for embeddings, via Ollama. Both are swappable.
Can I use it in production?
Yes — swap the storage for Postgres + pgvector. The rest of the kit doesn't change.
What's not included?
Relevance filtering and contradiction handling — deliberately, to keep the core legible.
What do I need first?
Python 3.11+ and Ollama installed locally. That's it.
Give your local AI a memory tonight.
one-time purchase · instant download · runs fully local