GitHub Repository

Zero trust LLM gateway. OpenAI-compatible proxy with semantic routing and load balancing across OpenAI, Anthropic, Ollama, vLLM, and any compatible backend. Identity-based access, virtual API keys, and end-to-end encryption via OpenZiti

65 starsGo

LLM-Gateway – Zero-Trust LLM Gateway

Name: LLM-Gateway – Zero-Trust LLM Gateway
Availability: InStock
Author: michaelquigley

by michaelquigley·Mar 27, 2026·7 points·1 comment

Visit Project View on HN

AI Analysis

●●SolidBig BrainSolve My Problem

Zero-trust networking via zrok beats LiteLLM when your GPUs sit behind NAT.

Strengths

•zrok overlay networking eliminates port forwarding and VPN setup for distributed backends
•Three-layer semantic routing cascade picks models without manual routing tables
•Single Go binary with no database or message queue dependencies

Weaknesses

•LLM gateway space is crowded with LiteLLM, OpenRouter, and cloud providers
•zrok dependency adds complexity for teams already using Kubernetes service mesh

Post Description

I built an OpenAI-compatible LLM gateway that routes requests to OpenAI, Anthropic, Ollama, vLLM, llama-server, SGLang... anything that speaks /v1/chat/completions. Single Go binary, one YAML config file, no infrastructure.

It does the things you'd expect from this kind of gateway... semantic routing via a three-layer cascade (keyword heuristics, embedding similarity, LLM classifier) that picks the best model when clients omit the model field, weighted round-robin load balancing across local inference servers with health checks and failover.

The part I think is most interesting is the network layer. The gateway and backends communicate over zrok/OpenZiti overlay networks... reach a GPU box behind NAT, expose the gateway to clients, put components anywhere with internet connectivity behind firewalls... no port forwarding, no VPN. Zero-trust in both directions. Most LLM proxies solve the API translation problem. This one also solves the network problem.

Apache 2.0. https://github.com/openziti/llm-gateway

I work for NetFoundry, which sponsors the OpenZiti project this is built on.

Similar Projects

Infrastructure●●Solid

AI Cost Firewall – OpenAI-compatible gateway with semantic caching

LLM gateway with Redis + Qdrant caching, but LiteLLM does this.

SlickShip It

vcaluser

112mo ago

Infrastructure●●Solid

UnifyRoute – Self-hosted OpenAI-compatible LLM gateway with failover

Drop-in OpenAI API gateway with failover—LiteLLM does this but this has a dashboard.

Solve My ProblemSlick

unifyroute

113mo ago

Infrastructure●●Solid

LunarGate – a self-hosted OpenAI-compatible LLM gateway

Go gateway with circuit breakers, but auth isn't production-ready yet.

Ship ItNiche Gem

jmartenka

302mo ago

Developer Tools●●Solid

Lightport – AI gateway that makes LLM providers OpenAI-compatible

Stripped-down Portkey fork handling protocol translation for 77 providers without enterprise bloat.

Ship ItSolve My Problem

smokybay

101mo ago

Developer Tools●●Solid

LLM Gateway for OpenAI/Anthropic Written in Golang

Runs as a single binary with embedded SQLite and zero-config start, acting as a transparent, provider-agnostic proxy that logs model, tokens, latency, cost and API key hashes while leaving full body capture opt-in. It also proxies streaming responses in real time and exposes stable JSON analytics endpoints — a practical, instrumentable way to get reproducible, audit-ready traces for real LLM traffic, though long-term value depends on how it handles provider edge-cases and SDK compatibility.

Solve My ProblemNiche GemSlick

oatmale

423mo ago

Infrastructure●Mid

A peer-2-peer network for sharing and accessing LLMs through OpenAI API

Distributed LLM inference over P2P instead of centralized APIs, but early-stage and unproven.

Bold BetBig Brain

modinfo

113mo ago