Playwright Test Studio
Local Rainforest alternative using Tauri desktop app and Playwright MCP.
An agentic workflow runtime
Self-healing tests via semantic IDs instead of selectors, but needs proof it beats Copilot test gen.
QA engineers and full-stack developers writing E2E tests
GitHub Copilot (test generation) · Playwright Inspector · Synapse (AI test gen)
AthenaFlow runs a real browser, maps interaction paths, writes a human-readable spec first, then implements Playwright tests from that spec. The spec step is intentional, it's reviewable before any code runs, and every generated test traces back to a TC-ID. Self-healing works by resolving semantic identifiers rather than selectors, so when the DOM shifts, the identifier survives.
Three repos make up the stack:
athena-flow-cli is the workflow runtime. It hooks into Claude Code's event system, receives runtime events over a Unix domain socket as NDJSON, persists sessions to SQLite, and renders a live terminal UI. Sessions are fully resumable. In CI, exec mode emits JSONL with clean exit codes for every failure mode. https://github.com/lespaceman/athena-flow-cli
agent-web-interface is an MCP server that exposes semantic page snapshots to the model instead of raw DOM or full accessibility trees. Keeps context window usage low and action resolution stable across layout changes. https://github.com/lespaceman/agent-web-interface
athena-workflow-marketplace holds the QA domain knowledge as a Claude plugin -- composable skills for codebase analysis, coverage planning, browser exploration, spec generation, and test implementation, wired together as a multi-phase workflow with checkpointed progress. https://github.com/lespaceman/athena-workflow-marketplace
npm install -g @athenaflow/cli https://athenaflow.in/
Local Rainforest alternative using Tauri desktop app and Playwright MCP.
Self-healing Playwright tests via AI agent repair, but Playwright Inspector and Cypress already auto-fix selectors.
Pattern guides for Playwright tests, but this is context-injection—a known workaround, not a platform shift.
Excel-to-test automation when Testim and Mabl already dominate this space.
Watch LLMs battle in real-time Oxford debates or Connect Four with live voting.
The write-up zeroes in on a concrete, painful failure mode: MCP setups streaming full DOMs and logs into models and burning token budgets. It shows how playwright-cli keeps browser state external and emits compact element references and YAML flows you can replay into npx playwright test — a realistic pattern for long agent sessions. Valuable practical guidance for teams already on Playwright, but it's an explainer, not a new system you can drop in without plumbing.