Reverse lookup XKCD comics using Gemini multimodal embeddings
Search XKCD by image or text description using Gemini multimodal embeddings.
httpie for embeddings. Embed text, images, audio, video, and PDFs from the command line.
httpie for embeddings, but it's just a Gemini API wrapper with caching.
Developers building RAG applications or working with embeddings
jina-embeddings-cli · sentence-transformers · Voyage AI CLI
Search XKCD by image or text description using Gemini multimodal embeddings.
Direct video-to-vector embedding skips transcription entirely—Twelve Labs but self-hosted.
PDF-native AI chat beats copy-paste workflow, but specialized readers crowding fast.
Empirical OCR vs. image embeddings shootout for scientific PDFs reveals complementary bottlenecks.
Local SigLIP embeddings + 68K-term semantic tagging in a single Rust binary, zero cloud.
Multimodal embeddings in one vector space—text queries find images and audio locally.