Sub-Agent MCP: LLM delegation and sub-agent orchestration via MCP
YAML-defined sub-agents with tool allowlists beat monolithic agent context bloat.
A tiny async task-tree orchestrator library for Python, behavior-tree inspired and LLM-ready.
Behavior-tree orchestration for agents when LangGraph and AutoGen already exist.
Developers building LLM agent workflows
LangGraph · AutoGen · CrewAI
YAML-defined sub-agents with tool allowlists beat monolithic agent context bloat.
AgentForge compresses common production patterns—token-aware rate limiting (token-bucket), retry+exponential backoff, prompt templates and cost tracking—into a tiny async core and lets you flip providers with one parameter. The multi-agent mesh and ReAct loop bits are the most interesting engineering bets here, and the repo includes benchmarks and a Streamlit demo, but it lives in a crowded space next to LangChain and similar toolkits so real differentiation will come from adoption and edge-case robustness.
Smart local‑first routing that only escalates to expensive cloud planners when necessary is the standout idea — combined with per‑run cost accounting and full Ollama offline support it solves a real operational itch. The repo is a pragmatic, CLI/TUI-focused toolkit (scraping + cache, MCP server mode) that feels useful for teams wanting a no‑friction orchestrator, but it’s playing in a crowded space of agent frameworks so the novelty is incremental rather than revolutionary.
AgentForge packs provider adapters (Claude, GPT‑4, Gemini, Perplexity), token-aware rate limiting, retry/backoff, and a MockLLMClient for tests into a tiny dependency surface — the 15KB footprint and 2 dependencies is an attention-grabber. The 3‑tier Redis cache and benchmark claims (huge latency/memory wins vs LangChain, 88% cache hit) make it a tempting low-overhead alternative, though you should validate provider feature parity and benchmarks against your workload.
Mycelium-style bus lets parallel Claude sessions share context without a central orchestrator.
Persistent Python runtime keeps state alive across tool calls, unlike Claude Code's stateless tools.