Contrapunk – Real-time counterpoint harmony from guitar input, in Rust
Real-time guitar-to-MIDI harmony with proper counterpoint rules and sub-10ms latency.
Production-grade RAG API built in Rust. Hybrid search with HNSW dense vectors and BM25 sparse matching, cross-encoder reranking, layout-aware document extraction via Docling, and 94.5% accuracy on Open RAG Bench. Powered by Cerebras, Groq, Milvus, and Jina AI.
Sub-600ms RAG across continents without GPU beats standard vector-DB-plus-LLM stacks.
Engineers building RAG applications who prioritize latency and cost over accuracy; teams without GPU infrastructure
LlamaIndex · LangChain · Haystack
RustyRAG v0.2 hits sub-200ms on localhost and sub-600ms from Azure North Central US to a browser in Brazil. 977 PDFs, 56K chunks in Milvus, 3 sources per response.
Key changes in v0.2: switched to Cerebras/Groq for LLM inference, replaced Cohere with Jina AI local embeddings (v5-text-nano-retrieval), and added optional contextual retrieval via LLM-generated chunk prefixes.
Real-time guitar-to-MIDI harmony with proper counterpoint rules and sub-10ms latency.
Stripped cluster logic from Amgix to build a single-binary Rust engine that crushes latency benchmarks.
Beats Neon's 500ms cold starts by architecting SQLite around S3 constraints.
Go's GC pauses break voice AI conversations — Rust's deterministic latency actually solves this.
Sub-microsecond streaming SQL via zero-alloc hot path; genuine advancement over SQLite+DataFusion.
WASM sandboxes per parser replace $200K legacy gateways at $0.05/MB.