Persistent memory for AI coding agents — small surface, lower token bill
v3 Release — May 14, 2026“Small surface. Tested rules. Honest memory.”
I spend my days writing enterprise code — multi-system integrations, strict conventions, dozens of services and stakeholders that all need to be explained to the AI before it can be useful. I spend my nights on side projects, including a tablet game called Ultima Mobile Classic. Different stacks, different scales, same problem: every morning I sat down and re-explained everything. The architecture. The conventions. The bugs we’d already fixed. The names of the systems. The personality of each project. The AI started from zero every single session.
So I started writing it down. A CLAUDE.md file. Then it grew. Then it grew more. Eventually it was 2,000 lines, and the AI was ignoring half of it. I was spending more time managing context than shipping code — at work and at home.
I needed something better. Not a bigger instructions file — a smaller one. One that stayed small no matter how much I knew, and only handed the AI what it actually needed for the task at hand.
That’s what RunawayContext is. It was built in production, for production — large codebases, strict governance, multiple contributors — and the same tool now runs my weekend side projects. Same patterns, both scales.
RunawayContext is a persistent-memory system for AI coding agents. Instead of stuffing every fact about your project into one giant instructions file, it keeps your knowledge in a structured SQLite database on your machine and serves the AI a small, focused brief — usually under 3,000 tokens — at the start of every session.
The AI knows where to look up deeper details when it needs them. Your token bill drops. Your project’s institutional memory survives between sessions. And nothing leaves your machine.
Every version was driven by something I learned by actually using the previous one. Each iteration kept the same goal — persistent context, small surface, lower tokens — and tightened the screws on how that goal is enforced.
The pieces that make persistent context actually work in practice.
knowledge.db and sessions.db. FTS5 full-text search built in. Optional semantic retrieval via sqlite-vec. Zero network calls by default.
Client class inside custom agents, or call the CLI from shell scripts. Or skip the binaries entirely and just use the markdown template.
You don’t need every feature on day one. RunawayContext has a six-tier maturation ladder so a solo project can start with just a markdown template and graduate up to enterprise federation only when the team actually needs it. Each tier has a real promotion gate — not a feature toggle, but an earned milestone.
| Tier | Name | Users | What you get |
|---|---|---|---|
| T0 | Hello World | Solo | Just the markdown template. No install. Good for trying the discipline before committing to the tooling. |
| T1 | Solo | 1 person | Full SQLite DB, FTS5 search, drift detection. No MCP server yet — the markdown brief is plenty for one project. |
| T2 | Solo Power | 1 person | MCP server with 13 tools, semantic search, telemetry, drafts inbox. The sweet spot for serious solo work. |
| T3 | Pair / Squad | 2–5 people | Author attribution, git-based export/import, conflict resolution. Knowledge survives team members coming and going. |
| T4 | Team | 5–20 people | Visibility ACLs, hash-chained audit log, garbage detection, promotion gates. The level where governance starts to matter. |
| T5 | Org / Enterprise | 20+ people | Federation across projects, SSO bindings, OpenTelemetry export, fine-grained grants. Terminal tier. |
RunawayContext is AI-native — the install procedure is a prompt you paste to your coding agent. It clones the repo, installs dependencies, runs the contract tests, and reports diagnostics. Takes about five minutes.
Already on v1 or v2? The same prompt handles the upgrade non-destructively — reference the appropriate section of INSTALL_PROMPT.md. Nothing in your existing knowledge base is touched until the migration verifies clean.
This started as a fix for my own problem — long days on the day-job codebase, late nights on side projects, and one AI that kept forgetting things across both. But the more I used it, the more it became obvious that the problem isn’t mine alone. Anyone working with AI coding agents hits this wall: the instructions file grows, the AI stops reading half of it, and you start spending more time on context management than on the actual work.
If RunawayContext is useful to you — in your team’s production codebase, on a personal project, or anywhere in between — I’d love to hear how. Issues, pull requests, and stories from the field all welcome. If it’s saved you a few thousand tokens or a few hours of re-explaining, and you want to chip in, the donate button is at the top of the page. Either way — thanks for taking a look. ♥
RunawayContext is free and open source under a permissive license. Star the repo if it’s useful, open an issue if it’s broken, and tell a friend if it saves you a session.
View on GitHub → Support This Project ♥