Back to browse
I built a game where domain experts try to break frontier AI

I built a game where domain experts try to break frontier AI

by camillemolas·Mar 18, 2026·3 points·8 comments

AI Analysis

●●●BangerBold BetZero to One

Paid bounties for experts who catch AI failures in their field.

Strengths
  • Financial incentives drive high-quality expert validation of AI failures
  • Permanent failure record creates lasting value beyond individual challenges
  • Live scoreboard gamifies human vs AI performance across domains
Weaknesses
  • Requires critical mass of credentialed experts to validate failures meaningfully
  • AI evaluation platforms like HumanEval already exist for benchmarking
Category
Target Audience

Professional experts in medicine, law, finance, trades, and coding

Similar To

HumanEval · Scale AI · Mechanical Turk

Similar Projects

AI/ML●●Solid

Can Europe train a frontier AI model on the compute it owns?

Sourced model with 52 tests showing federated compute beats waiting for grid power.

Big BrainBold Bet
smashini
1372773d ago