I built a thinking framework for Claude
The site pairs a concrete 5-step rubric (ground facts, stress-test failures, reframe, trace implications, audit reasoning) with a blind A/B UI so humans can judge if structure actually helps an LLM — neat experimental rigor that most prompt experiments skip. It’s clever and pragmatic, but the idea isn’t brand-new (it’s essentially formalized chain-of-thought prompting) and the scope is narrow (Claude-only, small sample of topics).