LiteParse, a fast open-source document parser for AI agents
Beats PyPDF and MarkItDown on accuracy without needing GPUs or cloud APIs.
A fast, helpful, and open-source document parser
Rust rewrite with PDFium delivers 100x speedup over the Python v1.
AI/ML engineers building document processing pipelines
Unstructured · LlamaParse · pdfplumber
Beats PyPDF and MarkItDown on accuracy without needing GPUs or cloud APIs.
2.7ms vs 151ms startup—pure speed optimization, Python thefuck already works.
Rust core beats LangChain's Python bottleneck, but chunking alone won't move the needle.
Beats simd-csv with pclmulqdq trick, but CSV parsing is a solved category.
Local PDF parsing with spatial boxes that rivals LlamaParse without the cloud bill.
jscodeshift drop-in replacement, 8x faster on real monorepos—API compatibility is the moat.