AthenaFlow – it browses your app, then writes Playwright tests

Name: AthenaFlow – it browses your app, then writes Playwright tests
Availability: InStock
Author: nadeem1

by nadeem1·Mar 6, 2026·1 point·0 comments

Visit Project View on HN

AI Analysis

●●SolidBig BrainSolve My Problem

Self-healing tests via semantic IDs instead of selectors, but needs proof it beats Copilot test gen.

Strengths

•Spec-first approach is reviewable before code runs, traces every test to TC-ID
•Self-healing via semantic identifiers survives DOM shifts that break selector-based tests
•Resumable sessions, CI-mode JSONL output, and Unix socket integration show production thinking

Weaknesses

•No public demo or working example; unclear how Claude Code hooks actually work
•Competes with GitHub Copilot, Continue, and v0 test generation—differentiation unproven

Post Description

E2E tests don't break once, they drift. AI tools that generate tests without seeing the app produce code that passes today and fails silently until CI goes red. The real cost isn't writing tests, it's maintaining them.

AthenaFlow runs a real browser, maps interaction paths, writes a human-readable spec first, then implements Playwright tests from that spec. The spec step is intentional, it's reviewable before any code runs, and every generated test traces back to a TC-ID. Self-healing works by resolving semantic identifiers rather than selectors, so when the DOM shifts, the identifier survives.

Three repos make up the stack:

athena-flow-cli is the workflow runtime. It hooks into Claude Code's event system, receives runtime events over a Unix domain socket as NDJSON, persists sessions to SQLite, and renders a live terminal UI. Sessions are fully resumable. In CI, exec mode emits JSONL with clean exit codes for every failure mode. https://github.com/lespaceman/athena-flow-cli

agent-web-interface is an MCP server that exposes semantic page snapshots to the model instead of raw DOM or full accessibility trees. Keeps context window usage low and action resolution stable across layout changes. https://github.com/lespaceman/agent-web-interface

athena-workflow-marketplace holds the QA domain knowledge as a Claude plugin -- composable skills for codebase analysis, coverage planning, browser exploration, spec generation, and test implementation, wired together as a multi-phase workflow with checkpointed progress. https://github.com/lespaceman/athena-workflow-marketplace

npm install -g @athenaflow/cli https://athenaflow.in/