GitHub Repository

All-in-one AI chat studio — 7 providers (Ollama, Claude, OpenAI, vLLM, Claude Code, Codex, Gemini CLI), RAG knowledge base, MCP tool integration, Mem0 shared memory, and 3-step pipeline. 100% local-capable. MIT licensed.

8 starsPython

I built a desktop app combining Claude, GPT, Gemini with local Ollama

Name: I built a desktop app combining Claude, GPT, Gemini with local Ollama
Availability: InStock
Author: tsunamayo

by tsunamayo·Mar 1, 2026·1 point·1 comment

Visit Project View on HN

AI Analysis

●●SolidSolve My ProblemSlick

Hybrid pipeline splits reasoning (cloud) and execution (local), but multi-model orchestration is becoming crowded.

Strengths

•3-phase decompose-execute-integrate pipeline uses cloud reasoning where it's strong (structure) and local inference for cost
•Dual interface (PyQt6 desktop + React web UI) means access from PC, phone, and LAN without extra deployment
•Built-in ChromaDB RAG and Discord webhooks show complete feature set, not just a proof-of-concept

Weaknesses

•Windows-only desktop app limits adoption; macOS and Linux users stuck with web UI
•Competes directly with LM Studio, Jan, and LocalAI—no clear differentiation beyond the 3-phase approach

Post Description

I built a desktop app (PyQt6, Windows) that orchestrates multiple AI models in a 3-phase pipeline:

Phase 1 – A cloud LLM (Claude/GPT/Gemini) decomposes the prompt into structured sub-tasks Phase 2 – Local Ollama models process each sub-task (free, private, runs on your GPU) Phase 3 – The cloud LLM integrates the results into a coherent final answer

The motivation: cloud APIs are great at reasoning and structure but cost money. Local Ollama models are free but sometimes inconsistent. The pipeline lets you use each where it's strongest.

Also includes: - FastAPI + React web UI (accessible from LAN/mobile) - SQLite chat history - ChromaDB-based RAG - Discord webhook notifications

Stack: Python, PyQt6, FastAPI, React, Ollama, Anthropic/OpenAI/Google APIs. MIT license.

Similar Projects

Developer Tools●●Solid

Phone a Friend for Claude Code – GPT, Gemini, DeepSeek via MCP

Claude debates GPT and Gemini in parallel rounds; costs $0.02–0.05 per brainstorm.

Crowd PleaserShip It

spranab

103mo ago

Developer Tools●●Solid

Dragoman – Multi-model routing for Claude Code via sub-agents

Smart key management via 1Password keeps secrets out of Claude's context window.

Solve My ProblemCozy

asakin

1023d ago

Developer Tools●●Solid

A/B test your own VLMs for document parsing (Self-hosted Arena)

Document parsing A/B test arena with ELO ranking—niche but real alternative to OCR Arena.

Solve My ProblemSlickNiche Gem

matthew624

103mo ago

AI/ML●Mid

Agent Alcove – Claude, GPT, and Gemini debate across forums

Multi-agent debate forum, but unclear what happens with results or insights.

Crowd PleaserRabbit Hole

nickvec

64263mo ago

Other●Mid

Integrate governance before your AI stack executes – COMMAND console

Orchestrates multi-AI governance, but demo is theater—no production backend, unclear scaling story.

Bold Bet

Burnmydays

103mo ago

AI/ML●●Solid

A multi-model interface where LLMs debate with each other

Orchestrates real-time skepticism between models to catch hallucinations before you see them.

Solve My ProblemShip It

capibara13

4921d ago