CLI for crawling documentation sites into Markdown with defuddle
No-browser docs crawler using defuddle when Firecrawl and JinaAI already exist.
Lightweight CLI for crawling documentation sites into Markdown with defuddle
No-browser doc crawler when JinaAI and Firecrawl already dominate this space.
Developers building RAG pipelines or LLM context datasets
JinaAI · Firecrawl · Crawlee
It is built for static and server-rendered docs sites such as Docusaurus, VitePress, MkDocs, GitBook exports, and Obsidian Publish. It does not run a browser and does not execute page JavaScript.
No-browser docs crawler using defuddle when Firecrawl and JinaAI already exist.
Cloudflare /crawl API powers nostalgic newspaper layouts for Hacker News and friends.
DeepWiki scraper for Claude, but Jina and Firecrawl already do this better.
Hybrid BM25 + vector search via MCP beats pure keyword or pure semantic for API docs.
Auto-generates API tests from OpenAPI specs when Schemathesis and Postman already exist.
Local Markdown OCR via CLI and HTTP, though macOS Live Text overlaps heavily.