GitHub Repository

agent-replay is a 100% local, SQLite-powered CLI tool for time-travel debugging AI agents that lets you replay execution traces, diff behavioral changes, fork runs to test fixes, and run AI-powered evaluations or safety guardrails to eliminate hallucinations and production failures.

5 starsTypeScript

Time-travel debugging and side-by-side diffs for AI agents

Name: Time-travel debugging and side-by-side diffs for AI agents
Availability: InStock
Author: hireclay

by hireclay·Feb 28, 2026·1 point·0 comments

Visit Project View on HN

AI Analysis

●●●BangerSolve My ProblemShip ItBig Brain

Replay, fork, diff, eval agent traces locally—like Git for agent behavior, fills a real gap.

Strengths

•Time-travel debugging (replay, fork from any step, change input mid-trace) is genuinely novel for agents
•SQLite local-first architecture means zero cloud dependency, works offline with full trace history
•Automatic evaluations (hallucination detection, guardrails, golden datasets) solve agent quality blind spot

Weaknesses

•Work-in-progress status; unclear if core replay/fork mechanics are fully functional or MVP promises
•No comparison to Langfuse, Arize, or Weights & Biases tracing; positioning vs. observability platforms fuzzy

Post Description

agent-replay provides time-travel debugging and side-by-side diffs to pinpoint exactly where AI agents hallucinate or fail. It replaces manual log diving with a local-first toolkit to replay, fork, and automatically evaluate agent traces for faster iteration. It's a work-in-progress. I'd love any feedback. Thank you.

Similar Projects

Developer Tools●●Solid

ContextSubstrate – Capture, diff, replay AI agent runs (Git agent work)

Git for AI agent runs—pack, diff, replay, and verify agent work with content addressing.

Big BrainWizardry

scalefirst

113mo ago

AI/ML●●●Banger

Time Machine – Debug AI Agents by Forking and Replaying from Any Step

Fork from step 8 and replay downstream — saves money when agents fail at step 9.

Solve My ProblemZero to One

deva00

212mo ago

Developer Tools●●Solid

SafeRun – Replay debugging and inline prevention for AI agents

Replay-first architecture beats LangSmith's static traces for debugging non-deterministic agents.

Ship ItSolve My Problem

Tidianez

1114d ago

AI/ML●●●Banger

SafeRun – Replay debugging and inline prevention for AI agents 3

Deterministic state capture solves the impossible 'reproduce this bug' problem.

Zero to OneSolve My Problem

Tidianez

3013d ago

AI/ML●●Solid

EPI – Cryptographically verifiable execution artifacts for AI agents

Turns an agent run into a verifiable .epi bundle you can hand to auditors or replay locally for debugging. Concrete engineering choices stand out — crash-safe SQLite WAL storage, Ed25519 sealing, and an embedded viewer — though wider integrations (Kubernetes/CICD hooks, verifier tooling) and stronger ecosystem docs will be needed for real adoption.

Niche GemWizardry

afridi_epilabs

103mo ago

Developer Tools●●Solid

A step debugger for AI agents

Execution anchors enable replay-from-step debugging for non-deterministic agent runs.

Big BrainDark Horse

sesquieu

102mo ago