Sephera – AST-compressed context packs and native MCP server in Rust
Tree-sitter AST compression cuts LLM context tokens 50-70% while preserving API structure.
Cut your Claude / OpenAI / Gemini bill 70–95% on AI coding. Local proxy that compresses context, keeps provider caches hot, and verifies LLM output ($0 hallucination guard). Drop-in for Cursor, Claude Code, Codex, Aider + 34 more and custom providers — 30s, no code changes
Entropy-based context compression beats naive token stuffing, but the category is crowded.
Developers using AI coding assistants (Cursor, Copilot, Claude Code)
Cursor · Continue · Sourcegraph Cody
Tree-sitter AST compression cuts LLM context tokens 50-70% while preserving API structure.
SLM classifiers compress context based on tool call intent before LLM sees it.
Cuts cargo test output from 61 lines to 1 — saves 60-90% of wasted LLM tokens.
Cuts token costs 70% with receipts proving no accuracy drop on hard evals.
Compressed JSON bundles fit tight context windows better than pasting files.
Cross-corpus Zstd dictionary + per-record AES-256-GCM, but cryptography audit is missing.