Back to browse
GitHub Repository

Local-first Video Knowledge Base. Index your video library with multi-modal analysis (YOLO, DeepFace, Whisper), search semantically via natural language, Docker-ready.

1,433 starsTypeScript

Google Video API bill for 4 videos. I built my own

by iliashad·May 4, 2026·3 points·0 comments

AI Analysis

●●●BangerSolve My ProblemBig BrainShip It

Replaces a $400 Google API bill with local YOLO, DeepFace, and Whisper running in Docker.

Strengths
  • Multi-model pipeline (YOLO, DeepFace, Whisper) extracts rich metadata without cloud uploads.
  • Docker-ready deployment abstracts complex ML dependencies into a single compose file.
  • Semantic search over video frames and transcripts solves a genuine archival pain point.
Weaknesses
  • Explicitly admits non-production status with incomplete features and potential bugs.
  • Heavy local resource consumption for real-time analysis of 4K video libraries.
Category
Target Audience

Video editors, content creators, and privacy-conscious archivists

Similar To

Google Video Intelligence API · Twelve Labs · VideoDB

Post Description

I make YouTube videos (~80 so far, most around an hour). Plus, I have 6TB + of personal videos and unpublished.

I tried the Google Video Intelligence API. Got a $400 bill for 4 videos (5 minutes average, 4k videos) of analysis (doesn't include video transcription), and I used my GCP startup credits to cover the bill.

I decided to build my own tool that needs to have 3 important things: can transcribe videos, analyse video frames, and everything needs to be done locally.

I don't wanna deal with storing my videos in the cloud because of two concerns: privacy and storage cost.

I've been working for the last couple of months. I have a source available version that can be used for free (personal and commercial use with companies that have fewer than 5 people). Available here (https://github.com/IliasHad/edit-mind), and the project has 1.3k Github stars

Now, I'm building a desktop app with direct NLE integration (Final Cut Pro, DaVinci Resolve, and Adobe Premiere Pro). This includes an editing agent that understands your footage and your editing style.

Preview: https://youtu.be/jcctyfVg_34

Happy to answer questions and hear your feedback.

Similar Projects

AI/ML●●●Banger

I made a tool to search your video footage and it's on GitHub

Local YOLO and Whisper indexing beats Google's $400 API bill for private video search.

Big BrainSolve My Problem
iliashad
1026d ago