SatGate – Budget enforcement proxy for MCP tool calls (L402/macaroons)
Macaroon-based budget enforcement for AI agents—fills a real economic governance gap.

Hard caps that block requests mid-flight beat provider dashboards that alert 6 hours too late.
Developers running AI agents in production with budget constraints
Helicone · Portkey · Langfuse
So I built a proxy. You swap one environment variable, it routes through Costile instead of calling Anthropic directly, and when you hit your daily or monthly limit it blocks further requests immediately. No SDK changes, no code refactor. Took me about a weekend. Currently supports Anthropic, with OpenAI next.
It's MIT licensed and self-hostable in about 5 minutes. Try the demo at costile.com if you want to poke at it.
I've got anomaly detection on the roadmap, but I'm second-guessing the scope — is surfacing cost spikes enough, or do people actually need to know why the agent went off the rails? The former is straightforward to build, the latter is a much harder problem. Curious where others would draw that line.
Macaroon-based budget enforcement for AI agents—fills a real economic governance gap.
Transparent proxy blocks agent leaks to LLM APIs without touching agent code.
MCP budget gating as a zero-dep npx proxy—solves the real friction of runaway tool costs.
Tracks tokens not dollars—clever design that avoids pricing drift headaches.
SDK blocks over-budget calls without proxying traffic through their servers.
LiteLLM and OpenRouter already solve multi-provider routing better and have production users.