Back to browse
GitHub Repository

Local-first Video Knowledge Base. Index your video library with multi-modal analysis (YOLO, DeepFace, Whisper), search semantically via natural language, Docker-ready.

1,431 starsTypeScript

I made a tool to search your video footage and it's on GitHub

by iliashad·May 8, 2026·1 point·0 comments

AI Analysis

●●●BangerBig BrainSolve My Problem

Local YOLO and Whisper indexing beats Google's $400 API bill for private video search.

Strengths
  • Multi-modal analysis combining transcription, face recognition, and object detection locally.
  • Docker-ready deployment avoids complex local Python environment configuration for ML tools.
  • Semantic search via ChromaDB allows finding specific scenes using natural language queries.
Weaknesses
  • Active development status means features are incomplete and bugs are expected for early users.
  • Heavy resource requirements for running multiple ML models simultaneously on local hardware.
Category
Target Audience

Video editors, content creators, and archivists with large local video libraries

Similar To

Google Video Intelligence API · Twelve Labs

Post Description

I make YouTube videos (~80 so far, most around an hour). Plus, I have 6TB + of personal videos and unpublished. I tried the Google Video Intelligence API. Got a $400 bill for 4 videos (5 minutes average, 4k videos) of analysis (doesn't include video transcription), and I used my GCP startup credits to cover the bill.

I decided to build my own tool that needs to have 3 important things: can transcribe videos, analyse video frames, and everything needs to be done locally.

I don't wanna deal with storing my videos in the cloud because of two concerns: privacy and storage cost.

I've been working for the last couple of months. I have a source available version that can be used for free (personal and commercial use with companies that have fewer than 5 people). Available here (https://github.com/IliasHad/edit-mind), and the project has 1.3k Github stars

Now, I'm building a desktop app with direct NLE integration (Final Cut Pro, DaVinci Resolve, and Adobe Premiere Pro). This includes an editing agent that understands your footage and your editing style. (https://edit-mind.com)

Demo Video: https://youtu.be/jcctyfVg_34

Similar Projects

AI/ML●●●Banger

Google Video API bill for 4 videos. I built my own

Replaces a $400 Google API bill with local YOLO, DeepFace, and Whisper running in Docker.

Solve My ProblemBig BrainShip It
iliashad
301mo ago
AI/ML●●Solid

DocMason – AI Agent Knowledge Base for local complex office files

Preserves document structure instead of flattening to text like most RAG tools.

Solve My ProblemBold Bet
Jet_Xu
232mo ago