ReflowPDF – wrote a layout engine because every PDF library failed
Structural reflow engine beats overlay editors like Adobe Acrobat for table editing.

Runs full layout detection — PP-DocLayoutV3 for blocks/titles/tables and Microsoft's table-transformer for cell structure — entirely in the browser via ONNX Runtime (WASM/WebGPU), so documents never leave the tab. Impressive technical work, but the models are ~60MB and inference is CPU/GPU-heavy, so expect crashes on phones and slow analysis on older machines. The plugin-first approach (React/Svelte/Vue) and open-source repo make it a very usable proof-of-concept for privacy-first PDF tooling.
Frontend engineers, developers building PDF/document-processing tools, privacy-conscious teams and researchers experimenting with client-side ML
I recently posted my open-source PDF viewer here, and one thing I really value is that it runs completely offline. I started wondering if we could push that further: could we do full ML layout analysis (detecting tables, headers, columns) directly in the browser?
To my surprise, it actually works.
The catch: It is far from production-ready. It crashes on most phones, and on older computers, it can be incredibly slow.
The why: I believe the future of document processing is local. Many users work with sensitive documents (bank statements, legal contracts) and simply do not want to upload them to a cloud endpoint just to parse a table or analyze layout.
This is a proof of concept for that future—where models get smaller, WASM/WebGPU gets faster, and we can keep data entirely on the client side.
Demo: https://www.embedpdf.com/layout-analysis Repo: https://github.com/embedpdf/embed-pdf-viewer
I'd love to hear your thoughts on the performance and where you think browser-based ML is heading.
Structural reflow engine beats overlay editors like Adobe Acrobat for table editing.
Open models + flat pricing, but ChatGPT/Claude already own privacy-conscious users.
PDF-to-Markdown for LLMs when JinaAI and Firecrawl already exist.
Useful for quick cleanup, but JinaAI and LLMs already handle this natively.
Zero-server processing inside the browser is the clever bit — Web Workers + pdf-lib/PDF.js lets you compress, merge, split and convert large PDFs without uploading sensitive files. The app also supports offline use and multiple languages, but the repo is early-stage with minimal community traction, so it’s useful now for cautious users or integrators rather than a drop-in replacement for mature SaaS players.
Passport photo printer, but remove.bg and Passport Photo Online already solve this.