Back to browse
AI Sees Me – CLIP running in the browser

AI Sees Me – CLIP running in the browser

by jayyvk·Mar 1, 2026·1 point·0 comments

AI Analysis

●●SolidWizardryBig Brain

CLIP embeddings live in the browser, but embedding visualizers already exist.

Strengths
  • Getting CLIP to run at usable speeds in WASM while handling live video is a genuine constraint problem.
  • Makes embedding-space similarity tangible through direct visual feedback—abstract concept made concrete.
  • Zero server, fully local inference—no hidden API calls or data leakage.
Weaknesses
  • Browser-based CLIP tools exist (ml5.js, fast.ai notebooks). This iteration doesn't claim a clear differentiation beyond the specific implementation.
  • No interactive exploration UI beyond the basic text input—feels more like a demo than a tool you'd return to.
Category
Target Audience

Machine learning enthusiasts, developers exploring embeddings and computer vision

Similar To

ml5.js · Teachable Machine · Fast.ai embeddings explorer

Post Description

I built a tool that runs OpenAI's CLIP model entirely in your browser using Transformers.js and ONNX Runtime Web. It encodes your webcam feed into vector embeddings and compares them against any text you type in real-time. No server, no API calls — all inference happens locally. The interesting technical challenge was getting CLIP to run at usable speeds in WASM while processing live video frames. Wanted to make the concept of embeddings and similarity scores tangible rather than abstract. Github: https://github.com/jayyvk/howaiseesme

Similar Projects

Developer Tools●●●Banger

Timber – Ollama for classical ML models, 336x faster than Python

336× faster tree model inference; compiles sklearn/XGBoost to C99, serves like Ollama.

WizardrySolve My Problem
kossisoroyce
207333mo ago