TUI Settlers of Catan built with Llamafile and Bonsai PrismML Models
Runs completely offline with llamafile but LLMs make unreliable Catan negotiation partners.

Single executable bundles models and runs everywhere — still no other tool does portable LLMs this well.
ML engineers, developers deploying local LLMs, infrastructure teams
llama.cpp · Ollama · LM Studio
Runs completely offline with llamafile but LLMs make unreliable Catan negotiation partners.
Hub-and-spoke IR translates LLM APIs without N^2 adapter hell.
LiteLLM already does this with more providers, more features, and way more maturity.
Runs as a single binary with embedded SQLite and zero-config start, acting as a transparent, provider-agnostic proxy that logs model, tokens, latency, cost and API key hashes while leaving full body capture opt-in. It also proxies streaming responses in real time and exposes stable JSON analytics endpoints — a practical, instrumentable way to get reproducible, audit-ready traces for real LLM traffic, though long-term value depends on how it handles provider edge-cases and SDK compatibility.
Selling a $49 system prompt with 3 stars and no visible technical differentiation.
SSH proxy trick keeps LLM execution local while commands run on air-gapped servers.