Clarity, See what concepts your LLM uses and trace it to training data
Trace LLM outputs to training data when most interpretability tools are post-hoc.

Idle game mechanics actually teach overfitting and compute tradeoffs correctly.
Students and non-technical people learning ML concepts
I'm a professor in AI and wanted to make model training feel intuitive for people without a technical background. The loss curve graph, the overfitting trap, the compute/quality tradeoff - all mechanics grounded in how training actually works.
Phase 1 is live (~20 min). AI safety mechanics will come in Phase 2 with rival labs and regulatory pressure.
Built with React + Vite.
Curious what HN thinks, especially anyone in AI who can tell me what I got wrong.
Trace LLM outputs to training data when most interpretability tools are post-hoc.
First LLM with per-token interpretability tracing input, concepts, and training provenance.
Fixed-latency language-rule decisions beat traditional token-by-token LLM agents.
Karpathy's microgpt in C99, proves tiny coordinated models beat single large models on logic.
Build a LLaMA-style model from scratch with zero ML prerequisites or math.
Sourced model with 52 tests showing federated compute beats waiting for grid power.