Back to browse
Council – Run Claude, Codex and Gemini against the same prompt

Council – Run Claude, Codex and Gemini against the same prompt

by colinarms·May 1, 2026·2 points·0 comments

AI Analysis

●●●BangerBig BrainSolve My Problem

Surfaces model disagreements instead of averaging them away — that's the real value.

Strengths
  • Parallel execution with heartbeat monitoring prevents silent failures on slow models.
  • Pipe-friendly design integrates into CI pipelines and Node scripts via --json flag.
  • Uses existing subscriptions — no new API keys or vendor lock-in required.
Weaknesses
  • Limited to three models; no extensibility for open-source or self-hosted LLMs yet.
  • Synthesizer logic is opaque — unclear how disagreements are weighted or resolved.
Category
Target Audience

Developers comparing LLM outputs for code review or decision-making

Similar To

LangChain · LlamaIndex · PromptLayer

Post Description

I often copy and paste the same prompts into Claude, Codex & Gemini separately. It's helpful seeing where they all agreed and where they diverged. The signal of the agreement or divergence was often more helpful than any individual response.

So, I built a tiny open source CLI that does this automatically.

Council detects any `claude`, `codex` or `gemini` on your path and runs them in parallel. Once all finish, a lead summarizer produces one synthesized answer, flagging any disagreements.

Open source & MIT licensed: https://github.com/seeARMS/council

Similar Projects

AI/ML●●●Banger

Evo – parallel autoresearch experiments for Claude Code and Codex

Tree search over git worktrees beats Karpathy's greedy hill climb for code optimization.

Big BrainWizardryNiche Gem
abtom
201mo ago