Steerling-8B, a language model that can explain any token it generates
First LLM with per-token interpretability tracing input, concepts, and training provenance.

Trace LLM outputs to training data when most interpretability tools are post-hoc.
AI researchers, ML engineers, teams needing model interpretability
Anthropic interpretability research · Mechanistic interpretability tools
First LLM with per-token interpretability tracing input, concepts, and training provenance.
GDB trace debugging for FORTH, but it's a workflow demo without reusable tooling.
Remote tool invocation from UI is clever for debugging agentic workflows.
Automated rollback on regression is a killer feature LangSmith doesn't have.
Deterministic rule extraction from traces — same input always produces same output, no tokens.
Replays agent traces step-by-step to pinpoint exact failure turns automatically.