A vision-based AI agent for end-to-end testing
They've traded brittle selector-based scripts for a vision-and-planning loop: describe a test in plain English, the agent visually inspects the UI, plans actions, executes them (including OS-level interactions) and iterates until success or failure. If it actually nails reproducible CI-friendly runs, debuggable artifacts, and edge cases like dynamic content and auth flows, this could be a meaningful shift — but those operational details will make or break it.