Back to browse
GitHub Repository

Zero trust LLM gateway. OpenAI-compatible proxy with semantic routing and load balancing across OpenAI, Anthropic, Ollama, vLLM, and any compatible backend. Identity-based access, virtual API keys, and end-to-end encryption via OpenZiti

65 starsGo

LLM-Gateway – Zero-Trust LLM Gateway

by michaelquigley·Mar 27, 2026·7 points·1 comment

AI Analysis

●●SolidBig BrainSolve My Problem

Zero-trust networking via zrok beats LiteLLM when your GPUs sit behind NAT.

Strengths
  • zrok overlay networking eliminates port forwarding and VPN setup for distributed backends
  • Three-layer semantic routing cascade picks models without manual routing tables
  • Single Go binary with no database or message queue dependencies
Weaknesses
  • LLM gateway space is crowded with LiteLLM, OpenRouter, and cloud providers
  • zrok dependency adds complexity for teams already using Kubernetes service mesh
Target Audience

Teams deploying self-hosted LLMs across distributed infrastructure

Similar To

LiteLLM · OpenRouter · vLLM

Post Description

I built an OpenAI-compatible LLM gateway that routes requests to OpenAI, Anthropic, Ollama, vLLM, llama-server, SGLang... anything that speaks /v1/chat/completions. Single Go binary, one YAML config file, no infrastructure.

It does the things you'd expect from this kind of gateway... semantic routing via a three-layer cascade (keyword heuristics, embedding similarity, LLM classifier) that picks the best model when clients omit the model field, weighted round-robin load balancing across local inference servers with health checks and failover.

The part I think is most interesting is the network layer. The gateway and backends communicate over zrok/OpenZiti overlay networks... reach a GPU box behind NAT, expose the gateway to clients, put components anywhere with internet connectivity behind firewalls... no port forwarding, no VPN. Zero-trust in both directions. Most LLM proxies solve the API translation problem. This one also solves the network problem.

Apache 2.0. https://github.com/openziti/llm-gateway

I work for NetFoundry, which sponsors the OpenZiti project this is built on.

Similar Projects

Developer Tools●●Solid

LLM Gateway for OpenAI/Anthropic Written in Golang

Runs as a single binary with embedded SQLite and zero-config start, acting as a transparent, provider-agnostic proxy that logs model, tokens, latency, cost and API key hashes while leaving full body capture opt-in. It also proxies streaming responses in real time and exposes stable JSON analytics endpoints — a practical, instrumentable way to get reproducible, audit-ready traces for real LLM traffic, though long-term value depends on how it handles provider edge-cases and SDK compatibility.

Solve My ProblemNiche GemSlick
oatmale
423mo ago