ContextSubstrate – Capture, diff, replay AI agent runs (Git agent work)
Git for AI agent runs—pack, diff, replay, and verify agent work with content addressing.
agent-replay is a 100% local, SQLite-powered CLI tool for time-travel debugging AI agents that lets you replay execution traces, diff behavioral changes, fork runs to test fixes, and run AI-powered evaluations or safety guardrails to eliminate hallucinations and production failures.
Replay, fork, diff, eval agent traces locally—like Git for agent behavior, fills a real gap.
AI agent developers, teams running production agentic systems, prompt/model debugging workflows
Langfuse · Arize AI · Weights & Biases Traces
Git for AI agent runs—pack, diff, replay, and verify agent work with content addressing.
Fork from step 8 and replay downstream — saves money when agents fail at step 9.
Replay-first architecture beats LangSmith's static traces for debugging non-deterministic agents.
Deterministic state capture solves the impossible 'reproduce this bug' problem.
Turns an agent run into a verifiable .epi bundle you can hand to auditors or replay locally for debugging. Concrete engineering choices stand out — crash-safe SQLite WAL storage, Ed25519 sealing, and an embedded viewer — though wider integrations (Kubernetes/CICD hooks, verifier tooling) and stronger ecosystem docs will be needed for real adoption.
Execution anchors enable replay-from-step debugging for non-deterministic agent runs.