Phone a Friend for Claude Code – GPT, Gemini, DeepSeek via MCP
Claude debates GPT and Gemini in parallel rounds; costs $0.02–0.05 per brainstorm.
All-in-one AI chat studio — 7 providers (Ollama, Claude, OpenAI, vLLM, Claude Code, Codex, Gemini CLI), RAG knowledge base, MCP tool integration, Mem0 shared memory, and 3-step pipeline. 100% local-capable. MIT licensed.
Hybrid pipeline splits reasoning (cloud) and execution (local), but multi-model orchestration is becoming crowded.
Developers and AI practitioners who want hybrid cloud+local LLM workflows without high API costs
LM Studio · Jan.ai · LocalAI
Phase 1 – A cloud LLM (Claude/GPT/Gemini) decomposes the prompt into structured sub-tasks Phase 2 – Local Ollama models process each sub-task (free, private, runs on your GPU) Phase 3 – The cloud LLM integrates the results into a coherent final answer
The motivation: cloud APIs are great at reasoning and structure but cost money. Local Ollama models are free but sometimes inconsistent. The pipeline lets you use each where it's strongest.
Also includes: - FastAPI + React web UI (accessible from LAN/mobile) - SQLite chat history - ChromaDB-based RAG - Discord webhook notifications
Stack: Python, PyQt6, FastAPI, React, Ollama, Anthropic/OpenAI/Google APIs. MIT license.
Claude debates GPT and Gemini in parallel rounds; costs $0.02–0.05 per brainstorm.
Smart key management via 1Password keeps secrets out of Claude's context window.
Document parsing A/B test arena with ELO ranking—niche but real alternative to OCR Arena.
Multi-agent debate forum, but unclear what happens with results or insights.
Orchestrates multi-AI governance, but demo is theater—no production backend, unclear scaling story.
Orchestrates real-time skepticism between models to catch hallucinations before you see them.