GitHub Repository

Snap any image, screenshot, or webpage into plaintext. No GPU. No cloud. One command.

177 starsPython

CPU-only OCR for screenshots, images, and webpages

Name: CPU-only OCR for screenshots, images, and webpages
Availability: InStock
Author: mrkn1

by mrkn1·May 24, 2026·4 points·9 comments

Visit Project View on HN

AI Analysis

●●SolidSolve My ProblemCozy

CPU-only VLM OCR beats Tesseract on layout without needing CUDA or cloud APIs.

Strengths

•Quantized ONNX model runs on CPU without CUDA dependencies or cloud API keys.
•Extracts main content image from webpages automatically via readability parsing.
•Single Python module with self-installing dependencies makes deployment trivial for automation.

Weaknesses

•Webpage OCR only targets the main image, ignoring full-page text extraction.
•Zero stars and one commit suggests early stage with unproven maintenance.

Similar Projects

Developer Tools●●Solid

CPU-only fast OCR for screenshots, images, PDFs, webpages

CPU-only VLM OCR beats Tesseract accuracy without sending data to the cloud.

Solve My ProblemCozy

mrkn1

981mo ago

Productivity●●Solid

Local CPU OCR for images, PDFs, webpages

CPU-only OCR with clipboard in/out beats Tesseract for modern screenshots.

Ship ItSolve My Problem

mrkn1

302mo ago

AI/ML●●Solid

Local-first fast CPU image to text for screenshots, PDFs, webpages

CPU-only OCR with clipboard round-trip when cloud APIs dominate the space.

CozySolve My Problem

mrkn1

19171mo ago

AI/ML●●Solid

Open-source alternative to Codex Chronicle, using Apple's local OCR

Privacy-first screen capture for agents when Rewind and ScreenPipe already exist.

Big BrainNiche Gem

talsraviv

202mo ago

Developer Tools●Mid

Klovr – Convert any webpage to Markdown (Cloudflare covers only 5%)

Nice, focused product: site-specific extraction rules (CSS selectors/metadata overrides), edge-first delivery (<500ms p99) and SDKs for Node/Python make it quick to drop into an LLM pipeline and claim 40–60% token savings. That said, HTML→Markdown is a crowded niche (Pandoc, Jina, Firecrawl and dozens of scrapers already exist), so Klovr needs clearer differentiation — e.g. demonstrable extraction accuracy, enterprise-grade rule sharing, or unique model-aware trimming — to move beyond 'handy utility'.

Solve My ProblemSlick

vaibhavlodha98

215mo ago

Productivity●●Solid

Nicasa – A native macOS image viewer with AI tools, OCR, and annotation

Native macOS image viewer with on-device AI enhancement beats Preview for power users.

SlickSolve My Problem

terryXyz

3021d ago