MessyData – turn messy data into clean tables/CSV
Useful for quick cleanup, but JinaAI and LLMs already handle this natively.

Useful dataset for UK researchers but it's a Kaggle upload, not a reusable tool.
Data analysts, UK property researchers, journalists
Tabula · Camelot · PDFPlumber
Useful for quick cleanup, but JinaAI and LLMs already handle this natively.
Noise-filtered PDF/web extraction for RAG, but already solved by Jina, Firecrawl.
Instance segmentation for floor plans—clever CV work, but niche audience and no monetization path.
Reader-mode extraction to searchable PDFs in Google Drive, one click, zero signup.
OCR-based extraction handles images and PDFs where standard DOM scrapers fail.
Local-first finance without bank API access—but transaction import+categorization is well-solved.