Local video search with Qwen3-VL: no API, runs on Apple Silicon, GPUs

Name: Local video search with Qwen3-VL: no API, runs on Apple Silicon, GPUs
Availability: InStock
Author: sohamrj

by sohamrj·Mar 30, 2026·3 points·0 comments

Visit Project View on HN

AI Analysis

●●SolidNiche GemBig Brain

Local Qwen3-VL video embeddings beat API costs for dashcam and security footage.

Strengths

•Offline Qwen3-VL backend eliminates API dependency and recurring costs
•Tesla metadata overlay renders speed and location onto matched clips
•ChromaDB storage with similarity threshold suppresses weak false matches

Weaknesses

•8B model requires 18GB RAM, limiting smaller machine deployment
•Video search category already has Twelve Labs and commercial APIs

Post Description

Last week, I posted SentrySearch, a CLI for semantic video search using Gemini's embedding API. The #1 request was local model support.

Turns out Qwen3-VL-Embedding can natively embed video into the same kind of vector space, no API, fully offline. Runs on Apple Silicon (MPS) and NVIDIA GPUs (CUDA). The 8B model needs ~18GB RAM, or use the 2B model on smaller machines.

sentrysearch index /path --backend local

Also added: similarity threshold to suppress weak matches, and a Tesla metadata overlay that renders speed/location onto matched clips.

Details on the README.

Similar Projects

Infrastructure●●●Banger

IEEE-754-Conformant FP64 on Metal (Apple Silicon)

Bit-exact f64 emulation on Metal GPUs where Apple's native double support is missing.

WizardryNiche Gem

guyfischman

111mo ago

Data●●Solid

Benchmarking Apple Silicon unified mem for GPU-accelerated SQL analysis

The repo does one practical thing well: quantify the real-world impact of Apple Silicon's unified memory on analytics by running six TPC-H queries plus a GPU-favorable QX and shipping the raw charts and code. It's specific and empirical — you get MLX vs NumPy vs DuckDB numbers and PNGs, not just hand-wavy claims — but it's narrowly scoped to M4 hardware and small-ish scales, so its conclusions are useful for experimentation rather than sweeping generalization.

WizardryNiche Gem

sadopc

313mo ago

AI/ML●●Solid