A proxy to hide PII information from LLM requests
Yet another PII redaction proxy when Lakera and Portkey already dominate this space.
Privacy middleware for LLM & RAG pipelines - consistent pseudonymization, encrypted vault, SSE streaming rehydration.
Consistent pseudonymization beats redaction when RAG embeddings must survive.
Developers building RAG pipelines with privacy requirements
Presidio · Microsoft Purview · Privacera
When you send real documents or customer data to LLMs, you face a painful tradeoff:
- Send raw text → privacy disaster - Redact with [REDACTED] → embeddings break, RAG retrieval fails, multi-turn chats become useless, and the model often refuses to answer questions about the redacted entities.
The practical solution is consistent pseudonymization: the same real entity always maps to the same token (e.g. “Tata Motors” → ORG_7 everywhere). This preserves semantic meaning for vector search and reasoning, then you rehydrate the response so the provider never sees actual names, numbers or addresses.
I got fed up fighting this with Presidio + custom glue (truncated RAG chunks, declension in Indian languages, fuzzy merging for typos/siblings, LLM confusion, percentages breaking math). So I built Cloakpipe as a tiny single-binary Rust proxy.
It does: • Multi-layer detection (regex + financial rules + optional GLiNER2 ONNX NER + custom TOML) • Consistent reversible mapping in an AES-256-GCM encrypted vault (memory zeroized) • Smart rehydration that survives truncated chunks like [[ADDRESS:A00 • Built-in fuzzy resolution for typos and similar names • Numeric reasoning mode so percentages still work for calculations
Fully open source (MIT), zero Python dependencies, <5 ms overhead.
Repo: https://github.com/rohansx/cloakpipe Demo & quick start: https://app.cloakpipe.co/demo
Would love feedback from anyone who has audited their RAG data flow or is struggling with the redaction-vs-semantics problem — especially in legal, fintech, or non-English workflows.
What approaches have you landed on?
Yet another PII redaction proxy when Lakera and Portkey already dominate this space.
Agent-aware sleep prevention beats manual caffeine tools for long-running CLI tasks.
Zero-code LLM firewall; heuristics under 1ms, optional Groq semantic layer.
Rust core beats LangChain's Python bottleneck, but chunking alone won't move the needle.
Agents never see real keys, but Vault already does secret injection.
Pre-flight checks catch blockers before bed, stop hooks prevent 3am failures.