Back to browse
GitHub Repository

An agentic workflow runtime

23 starsTypeScript

AthenaFlow – it browses your app, then writes Playwright tests

by nadeem1·Mar 6, 2026·1 point·0 comments

AI Analysis

●●SolidBig BrainSolve My Problem

Self-healing tests via semantic IDs instead of selectors, but needs proof it beats Copilot test gen.

Strengths
  • Spec-first approach is reviewable before code runs, traces every test to TC-ID
  • Self-healing via semantic identifiers survives DOM shifts that break selector-based tests
  • Resumable sessions, CI-mode JSONL output, and Unix socket integration show production thinking
Weaknesses
  • No public demo or working example; unclear how Claude Code hooks actually work
  • Competes with GitHub Copilot, Continue, and v0 test generation—differentiation unproven
Target Audience

QA engineers and full-stack developers writing E2E tests

Similar To

GitHub Copilot (test generation) · Playwright Inspector · Synapse (AI test gen)

Post Description

E2E tests don't break once, they drift. AI tools that generate tests without seeing the app produce code that passes today and fails silently until CI goes red. The real cost isn't writing tests, it's maintaining them.

AthenaFlow runs a real browser, maps interaction paths, writes a human-readable spec first, then implements Playwright tests from that spec. The spec step is intentional, it's reviewable before any code runs, and every generated test traces back to a TC-ID. Self-healing works by resolving semantic identifiers rather than selectors, so when the DOM shifts, the identifier survives.

Three repos make up the stack:

athena-flow-cli is the workflow runtime. It hooks into Claude Code's event system, receives runtime events over a Unix domain socket as NDJSON, persists sessions to SQLite, and renders a live terminal UI. Sessions are fully resumable. In CI, exec mode emits JSONL with clean exit codes for every failure mode. https://github.com/lespaceman/athena-flow-cli

agent-web-interface is an MCP server that exposes semantic page snapshots to the model instead of raw DOM or full accessibility trees. Keeps context window usage low and action resolution stable across layout changes. https://github.com/lespaceman/agent-web-interface

athena-workflow-marketplace holds the QA domain knowledge as a Claude plugin -- composable skills for codebase analysis, coverage planning, browser exploration, spec generation, and test implementation, wired together as a multi-phase workflow with checkpointed progress. https://github.com/lespaceman/athena-workflow-marketplace

npm install -g @athenaflow/cli https://athenaflow.in/

Similar Projects

Developer Tools●●Solid

Playwright Test Studio

Local Rainforest alternative using Tauri desktop app and Playwright MCP.

Ship ItSolve My ProblemNiche Gem
baudehlo
462mo ago
Developer Tools●●Solid

Why Playwright-CLI Beats MCP for AI‑Driven Browser Automation

The write-up zeroes in on a concrete, painful failure mode: MCP setups streaming full DOMs and logs into models and burning token budgets. It shows how playwright-cli keeps browser state external and emits compact element references and YAML flows you can replay into npx playwright test — a realistic pattern for long agent sessions. Valuable practical guidance for teams already on Playwright, but it's an explainer, not a new system you can drop in without plumbing.

Niche GemBig Brain
tanmay001
104mo ago