TLA PreCheck – TS DSL that proves state machines via TLA+
TLA+ model checking without learning TLA+ — build fails if spec and code diverge.
Formal verification for LLM workflows—CTL model checking, Z3 proofs, zero hallucination math.
Enterprise teams building LLM automation systems requiring provable correctness.
LangChain · CrewAI · Pydantic validators for structured output
So I built Aura-State — an open-source Python framework that compiles LLM workflows into formally verified state machines.
Instead of hoping your agent does the right thing, Aura-State proves it does — using algorithms from hardware verification and statistical learning:
CTL Model Checking — proves workflow safety before execution (same technique used in flight control systems) Z3 Theorem Prover — formally verifies every LLM extraction against business rules Conformal Prediction — calibrated 95% confidence intervals on every field MCTS Routing — Monte Carlo Tree Search scores ambiguous transitions Sandboxed AST — zero-hallucination math, compiled from English rules
Live benchmark results (GPT-4o-mini, 10 transcripts): → 100% budget extraction accuracy → 20/20 Z3 proofs passed → 3/3 temporal properties proven → 65 unit tests passing
The gap between "it usually works" and "it provably works" is smaller than people think.
TLA+ model checking without learning TLA+ — build fails if spec and code diverge.
Forces 13B models to solve SWE-bench tasks by making the problem smaller, not the model bigger.
Compile-time generated scheduler beats manual match-loop-state hell for complex state machines.
Formal verification for AI agents before compilation, unlike LangChain or AutoGen.
Formally verified EVM bytecode with zero sorries—actually ships working proofs.
Dafny + Claude Code creates provably correct React logic, but limited to greenfield projects.