I built a game where domain experts try to break frontier AI

Name: I built a game where domain experts try to break frontier AI
Availability: InStock
Author: camillemolas

by camillemolas·Mar 18, 2026·3 points·8 comments

AI Analysis

●●●BangerBold BetZero to One

Paid bounties for experts who catch AI failures in their field.

Strengths

Weaknesses

•Requires critical mass of credentialed experts to validate failures meaningfully
•AI evaluation platforms like HumanEval already exist for benchmarking

LLM model showdown in snake, but the novelty wears off after five minutes of watching.

Crowd PleaserRabbit Hole

giza182

323mo ago

Compute CAPEX sim inspired by Dario Amodei's Dwarkesh podcast, but pure toy model.

CozyRabbit Hole

jimmyechan

804mo ago

AI/ML●●Solid

Sourced model with 52 tests showing federated compute beats waiting for grid power.

Big BrainBold Bet

smashini

1372773d ago

AI/ML●●Solid

Beats humans at pronunciation scoring but doesn't ship product integration yet.

Big BrainWizardry

fabiosuizu

1313mo ago

Security●●●●Gem

First multi-GPU TEE stack for training trillion-parameter models with under 10% overhead.

WizardryBold BetZero to One

oscarmoxon

713mo ago

YAML schema for DDD artifacts that lets LLMs read and write your domain model.

Niche GemSolve My Problem

goloroden

201mo ago