BrowseBrawl – What if browser agents battled to generate training data?

Name: BrowseBrawl – What if browser agents battled to generate training data?
Availability: InStock
Author: HrubyOnRails

by HrubyOnRails·Mar 4, 2026·30 points·18 comments

Visit Project View on HN

AI Analysis

●●●BangerZero to OneBig BrainWizardry

Self-play for web agents: adversarial interference generates richer training data than static sites.

Strengths

•Applies competitive self-play (AlphaGo / GANs) to a new domain: browser automation. Genuinely novel hypothesis.
•Live website testing (Amazon, Google Flights, TechCrunch) validates agents against real, dynamic DOM—not sandbox.
•Multi-agent framework choice: Playwright MCP, Browser-Use, Stagehand. Shows extensibility and research maturity.

Weaknesses

•Tasks aren't zero-sum; attacker/defender framing breaks down when both can succeed independently.
•Early-stage: hackathon project. No published results, no evidence adversarial data actually improves downstream models yet.

Post Description

I remember watching the AlphaGo documentary in 2017. What stood out to me was that the model got drastically better when it started competing against itself. GANs clicked for me similarly: a generator and discriminator competing, and somehow the competition is what produces something remarkable.

I've been curious whether this principle generalizes to today's agents.

So mehulkalia and I built Browser Brawl at the YC / BrowserUse hackathon last weekend and won first place. It is a fun experiment in which an attacker agent tries to complete tasks on live websites while a defender agent injects JavaScript to sabotage it.

The analogy isn't perfect, because browser tasks aren't zero-sum. But our hypothesis is that an agent faced with an adversary should produce more interesting training data than one navigating clean, static environments.

Try it on: http://browser-brawl.com

GitHub: https://github.com/RichardHruby/browser-brawl

Demo Video: https://youtu.be/NIoFXv-JvBY

(Skip to [0:55](https://www.youtube.com/watch?v=NIoFXv-JvBY&t=55s) to see the agents “brawling” in the arena :), [1:52](https://www.youtube.com/watch?v=NIoFXv-JvBY&t=1m52s) to see the browser traces generated)

Would love to chat with anyone building or training browser agents. Happy to dive in below!