Sephera – AST-compressed context packs and native MCP server in Rust
Tree-sitter AST compression cuts LLM context tokens 50-70% while preserving API structure.

AST + embeddings for codebase search—but Sourcegraph Cody, Cursor, and Continue already solve this.
AI agents, code search tools, and developers building agentic coding workflows.
Sourcegraph Cody · Cursor · Continue.dev
when we tested this tool (Context+) for solving an issue on the OpenCode repository, the agent using this tool used ~6.5k fewer tokens, found the code and fixed it in half the time!
the results were surprising: 6 to 10k tokens saved per prompt, completed task in ~2 minutes while the agent running without the tool took ~4 mins for the same and got stuck in loops
built an entire beast by using all the modern tools that we could think of: undo trees, semantic search by meaning (by haskellforall), advanced refactoring, blast radius, advanced file context trees, restore points... i can keep going on
semantic code search and context trees are the future of agentic coding and this tool proves it
the feature i loved the most is semantic search and how it gets things done 2x faster with least possible tokens
it makes an agent that actually knows what it’s doing and not just guessing, it makes meaning from your code similar to RAG. if you aren't optimizing your context, you are just burning money
this tool is still under development, it can have unexpected behavior and the docs need updates but the video shows the reality of how fast it can be
github: https://github.com/ForLoopCodes/contextplus get here: https://contextplus.vercel.app youtube: https://www.youtube.com/watch?v=2YKQPzQFvvg x: https://x.com/forloopcodes/status/2027745057483460757
Tree-sitter AST compression cuts LLM context tokens 50-70% while preserving API structure.
Tree-sitter + agentic flow ~50x faster than raw LLM calls, but codebase visualization is crowded category.
AST-based code indexing saves tokens but Cursor and Continue already do this.
Tree-sitter + FTS5 + MCP = tokens saved for AI agents to actually code, not search.
Forces LLMs to debug with AST evidence instead of pattern-matching symptoms.
Tree-sitter interface extraction cuts token usage by 6x, but chat context window optimization is becoming table stakes.