Spidra – AI web scraper that adapts to any website
LLM-flavored scraper, but Firecrawl, Jina, and jsoup already handle dynamic extraction.

AI-powered selectors sound good, but Firecrawl, JinaAI, and Bright Data already do this—for less friction.
Data analysts, market researchers, SEO professionals, non-technical users doing one-off scraping
Firecrawl · JinaAI · Bright Data
The problem: Traditional scrapers break when websites change their HTML structure. You spend more time maintaining selectors than actually using the data.
How it works: Send a URL + an AI prompt like "extract all product names and prices as JSON" and the AI reads the page like a human, returning structured data.
Tech stack: FastAPI backend, Gemini 2.5 Flash for extraction, Playwright for rendering, deployed on Google Cloud Run.
Free tier: 500 credits, no credit card required. Would love feedback from HN on the approach and pricing.
LLM-flavored scraper, but Firecrawl, Jina, and jsoup already handle dynamic extraction.
Derivative AI scraper competing with Browse AI, launched with a broken Chrome Store link.
LLM infers schema once, Go does 10k-row extraction—avoids token waste.
LLM infers selectors once, Go extracts 10k rows—smart AI-for-intelligence architecture.
LLM-generated selector caching beats manual scraping, but Jina AI and Beautiful Soup handle this cheaper.
Captures Toulmin argument structure for decisions when most tools just store flat facts.