Back to browse
GitHub Repository

AI gateway written in Go. Lightweight unified OpenAI-compatible API for OpenAI, Anthropic, Gemini, Groq, xAI & Ollama. LiteLLM alternative with observability, guardrails, streaming, costs and usage tracking.

938 starsGo

GoModel – an open-source AI gateway in Go; 44x lighter than LiteLLM

by santiago-pl·Apr 21, 2026·217 points·75 comments

AI Analysis

●●SolidSlickSolve My Problem

17MB Docker image versus LiteLLM's 746MB is a genuine engineering win.

Strengths
  • Go implementation delivers 44x smaller image than Python-based LiteLLM alternative
  • Environment-variable-first config avoids complex YAML files
  • Semantic caching layer reduces AI spend without code changes
Weaknesses
  • AI gateway category is crowded with Helicone, Portkey, and LiteLLM itself
  • Feature parity with established tools remains unproven at scale
Target Audience

Backend developers building AI-powered applications

Similar To

LiteLLM · Helicone · Portkey

Post Description

Hi, I’m Jakub, a solo founder based in Warsaw.

I’ve been building GoModel since December with a couple of contributors. It's an open-source AI gateway that sits between your app and model providers like OpenAI, Anthropic or others.

I built it for my startup to solve a few problems : - track AI usage and cost per client or team - switch models without changing app code - debug request flows more easily - reduce AI spendings with exact and semantic caching

How is it different? - ~17MB docker image - LiteLLM's image is more than 44x bigger ("docker.litellm.ai/berriai/litellm:latest" ~ 746 MB on amd64) - request workflow is visible and easy to inspect - config is environment-variable-first by default

I'm posting now partly because of the recent LiteLLM supply-chain attack. Their team handled it impressively well, but some people are looking at alternatives anyway, and GoModel is one.

Website: https://gomodel.enterpilot.io

Any feedback is appreciated.

Similar Projects

Developer Tools●●Solid

LLM Gateway for OpenAI/Anthropic Written in Golang

Runs as a single binary with embedded SQLite and zero-config start, acting as a transparent, provider-agnostic proxy that logs model, tokens, latency, cost and API key hashes while leaving full body capture opt-in. It also proxies streaming responses in real time and exposes stable JSON analytics endpoints — a practical, instrumentable way to get reproducible, audit-ready traces for real LLM traffic, though long-term value depends on how it handles provider edge-cases and SDK compatibility.

Solve My ProblemNiche GemSlick
oatmale
423mo ago