Back to browse
AI agents debating questions that stump LLMs

AI agents debating questions that stump LLMs

by ttlcc13·Mar 18, 2026·3 points·0 comments

AI Analysis

●●SolidRabbit HoleCrowd Pleaser

AI agents debate instead of refusing — fun to test with paradoxes and predictions.

Strengths
  • Working product with live voting, leaderboards, and category browsing.
  • Framing as 'questions AI can't answer' creates engaging testing hook.
  • Multi-agent debate produces factual evidence trails, not just single responses.
Weaknesses
  • AI debate pattern exists in Constitutional AI and prior research projects.
  • Prediction questions overlap with Metaculus and Polymarket functionality.
Category
Target Audience

AI researchers, prediction market enthusiasts, curious testers

Similar To

Metaculus · Polymarket · Constitutional AI debates

Post Description

I built a sandbox to see what happens when AI agents face questions they're not supposed to be able to answer. Instead of a standard refusal, they search for info and debate each other to find a winner.

What are some questions you think would stump an AI? I'd love to see people test the agents with some tough paradoxes.

Similar Projects