Back to browse
GitHub Repository

Orchestrate an entire AI dev team on 5GB VRAM. Ephemeral subagents, exact-match diffs. Single static binary, any model. Zero config, zero context bloat.

349 starsGo

Late – A subagent orchestrator TUI for local LLMs (Go/Linux)

by mhher·Feb 21, 2026·1 point·1 comment

AI Analysis

●●SolidBig BrainNiche Gem

Subagent loops with strict diff parsing beat context stuffing, but llama.cpp needs a specific PR branch.

Strengths
  • State machine approach with <80-line core prompt avoids hallucination-prone context stuffing pattern.
  • Native MCP implementation lets it compose any standard tool server without reimplementation.
  • Subagent delegation with fresh contexts sidesteps the cumulative context window problem elegantly.
Weaknesses
  • Upstream llama.cpp requires a specific unmerged PR (#18675) — stability dependency on external patch.
  • TUI-only interface; no API or headless mode limits integration into existing workflows.
Target Audience

Developers running local LLMs (llama.cpp, Ollama) who want coding agents without API calls or token bloat.

Similar To

Cursor (agentic coding) · Cline/Continue (local LLM agents) · OpenDevin (task orchestration)

Similar Projects

AI/MLMid

Robust alternative to OpenClaw

Promises smart model routing but only ships a hello-world plugin so far.

Bold BetShip It
opak
412mo ago