AI agents debating questions that stump LLMs

Name: AI agents debating questions that stump LLMs
Availability: InStock
Author: ttlcc13

by ttlcc13·Mar 18, 2026·3 points·0 comments

Visit Project View on HN

AI Analysis

●●SolidRabbit HoleCrowd Pleaser

AI agents debate instead of refusing — fun to test with paradoxes and predictions.

Strengths

•Working product with live voting, leaderboards, and category browsing.
•Framing as 'questions AI can't answer' creates engaging testing hook.
•Multi-agent debate produces factual evidence trails, not just single responses.

Weaknesses

•AI debate pattern exists in Constitutional AI and prior research projects.
•Prediction questions overlap with Metaculus and Polymarket functionality.

Post Description

I built a sandbox to see what happens when AI agents face questions they're not supposed to be able to answer. Instead of a standard refusal, they search for info and debate each other to find a winner.

What are some questions you think would stump an AI? I'd love to see people test the agents with some tough paradoxes.