Wyolet Relay – high throughput, open source LLM router
Self-hosted LLM gateway pooling keys across providers for failover and cost tracking.

Another LLM gateway when LiteLLM and Portkey already dominate the space.
Small teams building multi-LLM applications who want self-hosted control
LiteLLM · Portkey · Helicone
I got together with a few guys and we built an LLM gateway.
It's designed for small teams working on early-stage products, and can be deployed to AWS using a single command (i.e. `mantis deploy`).
It's self-hosted, and is designed to belong to you.
Self-hosted LLM gateway pooling keys across providers for failover and cost tracking.
Drop-in OpenAI API gateway with failover—LiteLLM does this but this has a dashboard.
Go gateway with circuit breakers, but auth isn't production-ready yet.
First gateway with native MCP server—connect Claude Code or Cursor in one command.
Semantic caching for LLM APIs exists (Anthropic prompt caching, Langchain, Miniplex, vLLM); gateway routing is table stakes.
Product Algebra routing plus an explicit 'dharma' pipeline (no-self regularization, entropy/mindfulness metrics, compassion and ethos scores) is a strikingly specific approach — it moves beyond cost/capability heuristics into cross-modal interaction scoring and reputation-driven incentives. There's real engineering here (1s perception loop, SQLite memory, Telegram UX, multi-provider SDK support), but the repo reads young and claim-heavy: I want reproducible benchmark artifacts, links from the code to the cited 439-model experiments, and clearer deployment/security guidance before trusting it for critical workloads.