Back to browse
GitHub Repository

Your AI's memory grows forever. Your token bill doesn't. A cross-provider memory layer for LLM apps.

9 starsPython

Memory for LLM apps that cuts input tokens up to 80% (avg 68%)

by degutemesgen·May 23, 2026·3 points·0 comments

AI Analysis

●●SolidSolve My ProblemBig Brain

Cuts token bills 68% by swapping full history for vector-retrieved signals.

Strengths
  • Wire-protocol style interception means zero changes to existing LLM provider code.
  • Automatic decay logic prevents context bloat without manual pruning rules.
  • SQLite persistence allows memory to survive restarts and share across processes.
Weaknesses
  • Alpha status with shifting APIs makes it risky for production deployment today.
  • Competes directly with established patterns in LangChain, LlamaIndex, and Mem0.
Category
Target Audience

Developers building stateful LLM applications or chatbots

Similar To

LangChain Memory · LlamaIndex · Mem0

Similar Projects

AI/ML●●Solid

Token Saving Tinyscreenshot Skill

4x token savings on screenshots with readable text at 800px grey.

Solve My ProblemBig Brain
franze
211mo ago