Back to browse
GitHub Repository

Graphsignal Profiler

207 starsPython

CUDA Profiler for Production Inference

by npgraph·Jun 23, 2026·6 points·0 comments

AI Analysis

●●SolidSolve My ProblemSlick

LLM inference profiling with per-token timing, but Arize and Langfuse already own this space.

Strengths
  • vLLM integration via CLI wrapper means zero code changes to existing deployments
  • Per-step LLM generation tracing with token throughput breakdowns is genuinely useful
  • CUDA 12/13 support keeps pace with current GPU hardware
Weaknesses
  • Cloud-only with API key requirement limits self-hosted and air-gapped deployments
  • LLM observability is crowded with Arize, Langfuse, Datadog already well-funded
Category
Target Audience

ML engineers running production LLM inference

Similar To

Arize · Langfuse · PyTorch Profiler

Similar Projects