Handler – Open-source local sandboxes and control plane for code agents
Firecracker microVMs isolate coding agents so you can review before merging.
agentctl is a focused v1 control layer for coding agents. It gates a small set of high-risk actions, records structured traces for every decision, and replays prior sessions against a different policy.
Replaying past sessions against stricter policies beats guessing rules before you know the risks.
Developers using autonomous coding agents like Claude Code or Codex
ShieldedAI · LLM Guard
The workflow I keep coming back to: write a permissive policy, let the agent run for a week, then tighten the rules and replay the old sessions to see what would have been blocked. Much better than guessing at policy upfront, and it’s the part of the tool I didn’t expect to use as much as I do.
Every gated decision gets written to jsonl, so you can grep, diff, or feed traces back through a stricter policy without re-running the agent. There’s also a TUI for browsing sessions, inspecting individual gate decisions, and stepping through replays interactively, which makes it easier to spot patterns across runs.
Currently works with Claude Code and MCP-based clients like Codex.
Still a WIP and mostly a project for myself, but figured others experimenting with coding agents might find it interesting.
Firecracker microVMs isolate coding agents so you can review before merging.
Agents install their own sandbox via Docker, solving the dependency hell problem.
Terraform for agent fleets solves the terminal sprawl problem nobody else addressed.
MCP integration for feature flags is timely, but LaunchDarkly owns this category.
Docker sandboxing for agents when most runners just trust whatever the LLM outputs.
Istio-style sidecars for AI agents solving enterprise compliance gaps.