RAG-Ready Extractor – Structure-aware ingestion with semantic scoring
Noise-filtered PDF/web extraction for RAG, but already solved by Jina, Firecrawl.
CPU-optimised inference wrapper for Meta's SAM 3 — inference pipeline with chunked video processing, automatic object detection, and IoU-based tracking.
Memory-aware video chunking with IoU tracking lets SAM 3 run without GPU limits.
Computer vision engineers, developers without GPU access
Hugging Face pipelines · Replicate · Standard SAM wrappers
Noise-filtered PDF/web extraction for RAG, but already solved by Jina, Firecrawl.
They stopped pretending chunking at arbitrary byte offsets was fine and instead scan once to build message boundaries, then binary-search for clean split points — that simple change eliminates the OOM-by-design scenario. Couple that with SIMD-aware prefetch tuning (different distances for AVX2 vs AVX-512) and you get practical microarch-aware engineering, not just benchmark stunts; I want this shipped as a library or tool so other firms can stop reinventing the same footguns.
Fact-checking with web citations is clever, but ollama already does local LLM CLI.
Fact-checking with citations and web search runs entirely on your CPU.
One API unifies prompt tuning, code optimization, and blackbox search—beats domain-specific tools.
Core pinning and CCD alignment are real engineering, but game hosting is a crowded market.