Digest AI vs HN About

GitHub Repository

Snap any image, screenshot, or webpage into plaintext. No GPU. No cloud. One command.

88 starsPython

CPU-only fast OCR for screenshots, images, PDFs, webpages

by mrkn1·May 31, 2026·9 points·8 comments

Visit Project View on HN

AI Analysis

●●SolidSolve My ProblemCozy

CPU-only VLM OCR beats Tesseract accuracy without sending data to the cloud.

Strengths

•Heavily quantized 0.9B ONNX model runs on plain CPUs without CUDA dependencies.
•Clipboard input and output workflow makes screenshot-to-text round trips incredibly fast.
•Single-file portable design allows dropping the tool anywhere without installation requirements.

Weaknesses

•Webpage processing isolates main content but could miss complex multi-image layouts.
•890 MB initial download is heavy compared to standard Tesseract installations.

Category

Developer Tools

Target Audience

Developers needing private, local OCR without GPU dependencies

Similar To

Umi-OCR · Tesseract · Google Cloud Vision

Similar Projects

AI/ML●●Solid

CPU-only OCR for screenshots, images, and webpages

CPU-only VLM OCR beats Tesseract on layout without needing CUDA or cloud APIs.

Solve My ProblemCozy

mrkn1

4920d ago

Productivity●●Solid

Local CPU OCR for images, PDFs, webpages

CPU-only OCR with clipboard in/out beats Tesseract for modern screenshots.

Ship ItSolve My Problem

mrkn1

3015d ago

AI/ML●●Solid

Local-first fast CPU image to text for screenshots, PDFs, webpages

CPU-only OCR with clipboard round-trip when cloud APIs dominate the space.

CozySolve My Problem

mrkn1

19178d ago

AI/ML●Mid

AI-Powered PDF to Markdown Converter

PDF-to-Markdown for LLMs when JinaAI and Firecrawl already exist.

Solve My Problem

QingWu

4510d ago

Developer Tools●●Solid

Pdf2md – 10MB Rust PDF-to-Markdown Tool with a Free API

Rust-based PDF parser that keeps tables intact for LLM ingestion.

Solve My ProblemShip It

johnson_nie

10626d ago

Productivity●●Solid

A free browser extension to extract tables

OCR-based extraction handles images and PDFs where standard DOM scrapers fail.

Solve My ProblemSlick

viniciuscsr

102mo ago