Back to browse
GitHub Repository

Ultra-low-latency reverse proxy that repairs truncated & malformed JSON in LLM streaming responses (OpenAI, Anthropic, Vertex AI, Bedrock) — fixes JSONDecodeError / serde_json EOF on truncated tool calls.

1 starsRust

Suture – a reverse proxy that repairs truncated JSON in LLM streams

by nabucodonosor·Jun 4, 2026·1 point·0 comments

AI Analysis

●●●BangerBig BrainSolve My ProblemShip It

Fixes truncated JSON on the wire in ~10µs without SDK changes or retries.

Strengths
  • SSE-aware repair of reassembled tool-call arguments across delta events, not raw bytes
  • Transparent gzip/brotli/deflate handling with ~10µs overhead per chunk
  • Supports OpenAI, Anthropic, Vertex AI, and Bedrock without provider-specific code
Weaknesses
  • Only fixes truncation errors, doesn't help with malformed JSON from the model itself
  • Rust implementation may limit adoption for teams without Rust deployment experience
Target Audience

Backend developers building LLM applications with streaming tool calls

Similar To

LiteLLM · OpenRouter

Similar Projects

Developer Tools●●Solid

Agent Firewall – Go proxy to kill LLM death spirals

Wire-protocol circuit breaker for agents when LangSmith costs too much.

Solve My ProblemShip It
wuweiaxin
212mo ago