Back to browse
GitHub Repository

Your smart AI agent to discover all user journeys on a website.

3 starsPython

Web Scout AI – Auto-discover every user journey (zero config)

by apexkid·Mar 1, 2026·2 points·0 comments

AI Analysis

●●●BangerWizardrySolve My Problem

Claude vision + Playwright auto-discovers every user journey without selectors or config.

Strengths
  • Zero-config journey discovery eliminates manual path mapping—Claude vision genuinely understands clickable elements
  • Smart deduplication (recognizes 50 product cards, samples one) reduces API calls and exploration time
  • Outputs replaying JSON files + Mermaid diagrams, making results immediately actionable for QA
Weaknesses
  • Heavy API cost per run (Claude vision calls per screenshot); unclear if viable for large sites
  • Relies on site structure being visually coherent—may struggle with dynamic SPA rendering or JS-heavy sites
Target Audience

QA engineers, test automation teams, product managers documenting user flows

Similar To

Playwright Inspector · Selenium IDE · Gretel.ai (synthetic data)

Post Description

Hi HN, I'm Adi. I built Web Scout AI, an open-source tool that takes a URL and autonomously discovers every user journey on the site — no scripts, no selectors, no config. Point it at your homepage and it finds paths like Homepage → Product Listing → Product Detail → Cart → Checkout on its own.

Repo: https://github.com/apexkid/web-scout-ai

It uses Claude's vision + Playwright to explore a site the way a human would: look at the page, decide what's clickable, click it, repeat. It does this recursively via BFS, building a complete graph of every reachable state. When it finds 50 product cards, it recognizes the pattern and explores one representative instead of all 50. Cookie banners and popups get auto-dismissed. The output is a journey graph, Mermaid diagrams, and a set of replayable JSON files along with network requests fired at every interactions.

Who's can use it and for what:

* QA/Testing — The most obvious use case. Run auto against your staging environment, get a full set of discovered journeys, then replay them after every deploy. No test scripts to write or maintain. When the site changes, re-run discovery instead of fixing selectors. One team told me they went from 2 weeks of manual test writing to a single afternoon of reviewing auto-discovered journeys.

* 3P API auditing — This is the one I didn't expect. The replay engine captures every XHR/fetch request at every step — full request and response bodies. Teams are using this to verify that analytics events (GA4, Segment, etc.) actually fire at the right moments in the right order. "Does our checkout funnel fire the right events at every step?" becomes a replay + grep instead of a manual walkthrough.

* Journey documentation — PMs and designers use the Mermaid diagram output to get a ground-truth map of what users can actually do. Turns out the real journey graph rarely matches what's in the Figma file. Dead ends, loops, and unreachable states show up immediately.

* Post-deploy smoke tests — Run replay all in CI after a deploy. It replays every known journey through a real browser and reports pass/fail per step. No LLM cost, runs in parallel, takes minutes. If a flow breaks, you know which step and which selector failed.

* Competitive analysis — Point it at a competitor's site and get a structured map of their user flows. What journeys do they support? What does their checkout look like? All captured as screenshots and structured JSON.

Similar Projects

AI/ML●●●Banger

Auto GPU Kernel – Autonomous GPU-kernel discovery and optimizer

Autonomous kernel optimizer that won MLSys contest with 34.93x speedup.

WizardryBig BrainBold Bet
dogacel
1019d ago
Developer Tools●●Solid

NitROS – Robot pub/sub in 3 lines, zero config

Zero-config mDNS discovery plus automatic handling of dicts, numpy arrays and PyTorch tensors is a genuinely handy ergonomics win for quick robot prototypes. JPEG/LZ4 heuristics and non-blocking send() show the author thought about real data types (images, point clouds) rather than just passing JSON. It’s not trying to replace ROS2 — and the README is honest about missing TF/actions and scalability limits — so treat this as a pragmatic tool for small-scale workflows, not a production ROS replacement.

Niche GemSolve My Problem
inputnameplz
213mo ago