Back to browse
GitHub Repository

A transparent, 100%-local semantic cache for LLM APIs — drop-in proxy, one line to integrate, written in Rust

0 starsRust

Cachet – A drop-in semantic cache for LLM APIs, 100% local, in Rust

by Abhi_2112·Jun 23, 2026·5 points·0 comments

AI Analysis

●●SolidBig BrainShip It

Semantic caching without a vector DB—just swap your base URL.

Strengths
  • Single binary drop-in proxy, no vector database dependencies
  • Real-time dashboard showing hit rates and dollars saved
  • Pure Rust with ~47MB distroless Docker image
Weaknesses
  • Semantic caching for LLMs already exists in multiple tools
  • Only supports OpenAI and Anthropic-compatible APIs currently
Target Audience

Backend developers, AI engineers

Similar To

CacheLLM · LLMCache · Portkey

Similar Projects