Thisorthis.ai – Compare responses from 50 AI models side-by-side

Name: Thisorthis.ai – Compare responses from 50 AI models side-by-side
Availability: InStock
Author: parthsamin

by parthsamin·Feb 24, 2026·2 points·0 comments

Visit Project View on HN

AI Analysis

●●SolidSolve My ProblemCrowd Pleaser

Kills the copy-paste workflow, but model comparison UIs already exist elsewhere.

Strengths

•Smart multi-model aggregation in one interface eliminates manual tab-switching pain.
•SmartPick LLM evaluator (Clarity, Accuracy, Completeness, Helpfulness) adds real value vs dumb side-by-side.
•Workspaces with persistent system prompts and history reduce prompt-setup friction for power users.

Weaknesses

•Crowded space: Poe, Hugging Face Spaces, and Claude Projects already compare models at scale.
•Freemium model depends on API costs; unclear if pricing aligns with user switching from free alternatives.

Post Description

Hey HN — I'm Parth, I built thisorthis.ai because I was tired of copy-pasting the same prompt across ChatGPT, Claude, and Gemini tabs to figure out which model actually gave the best answer.

What it does: You type one prompt, pick 2–6 models (we support 47 text models and several image models across OpenAI, Anthropic, Google, xAI, Meta, Amazon, Mistral, Cohere, AI21), and see every response side-by-side. There's also a feature called SmartPick that uses an LLM evaluator to score each response on Clarity, Accuracy, Completeness, and Helpfulness — useful when you're comparing 6 models and don't want to read everything carefully.

Beyond comparison, there are two other things I've built:

Workspaces — You can create multi-panel layouts where each panel has its own model, system prompt, and conversation history. So instead of "Hey ChatGPT, you're a code reviewer" every time, you set it once and the panel remembers. I use a "Customer Support" workspace with 6 panels daily — Ticket Drafter on Claude Haiku, Escalation Handler on Sonnet, Knowledge Base Builder on GPT-4o, etc.

Prompts Library — Hundreds of prompts across 10 categories. Less interesting technically, but saves a surprising amount of time.

Some things HN might care about: * No API keys needed — we handle all provider connections * Private Mode does zero-trace testing (nothing stored, nothing logged) * Everything is encrypted at rest * Image generation comparison works too (ChatGPT Image vs Grok Imagine vs Gemini) * Free tier exists with limited models and capacity. Paid tiers are $29/$59/$99.

Tech stack if anyone's curious: AWS (DynamoDB, Lambda, SQS, S3), with separate provider integrations for each AI model. The tricky part was building context management for multi-turn conversations across different providers — each has its own message format, token limits, and quirks.

We hit #11 on Product Hunt last year when we launched and have ~15K users. But honestly the feedback I most want is from this community — what's missing, what's broken, what would make you actually use this daily?

Happy to answer any questions about the architecture, pricing model, or anything else.

Similar Projects

AI/ML●●●Banger

Council – Run Claude, Codex and Gemini against the same prompt

Surfaces model disagreements instead of averaging them away — that's the real value.

Big BrainSolve My Problem

colinarms

202mo ago

AI/ML●●Solid

Why use one AI model when you can use all of them at once!

One prompt, many models — that simple idea is executed with practical extras: independent conversation threads per model, full-text history/search, and bring‑your‑own API keys so you don't copy/paste. The landing page sells the daily‑driver vibe (lifetime one-time pricing is an attention grabber), but the concept itself is not novel; I'd want clearer UI for cost controls, API key security and model/version management before trusting it for heavy use.

SlickSolve My Problem

lurker325

105mo ago

AI/ML●Mid