Reduce Claude Code token usage ~50% with Headroom
60 DAUs saved 10.5B tokens — real savings for Claude Code power users.
An OpenCode plugin that reduces token usage by up to 45% with zero configuration. It compresses tool descriptions, compacts read output, and adds line-range edit support.
It actually attacks a concrete, expensive nuisance: repeated token bloat from tool schemas and file blobs. The line-range edit expansion is a neat trick — let the model reference lines instead of pasting content — and the README ships per-model benchmarks (up to ~45% savings) plus one-line installation so you can try it without changing your workflow. Expect real wins in edit-heavy sessions, though results will vary with project size and tooling.
Developers using OpenCode and LLM-assisted coding workflows (AI/ML engineers, full‑stack and backend developers who use code-editing LLMs)
60 DAUs saved 10.5B tokens — real savings for Claude Code power users.
93% token reduction via deterministic sanitization instead of trusting raw web content.
Compiles browser sessions into deterministic skills, slashing agent token costs by 90%.
Mac app wrapper around Headroom compression for Claude Code.
Cuts agent token costs by 98% compared to grep without needing GPU inference.
Static Model2Vec embeddings beat transformer retrieval quality while running entirely on CPU.