Back to browse
Deepgram releases Deepgram CLI (`dg`) an agent-aware CLI

Deepgram releases Deepgram CLI (`dg`) an agent-aware CLI

by lukeocodes·Apr 16, 2026·4 points·0 comments

AI Analysis

●●SolidSlickShip It

Agent-aware speech CLI with MCP support, but wraps an existing API service.

Strengths
  • Auto-detects Claude Code, Aider, Codex — switches to JSON output without flags
  • UNIX philosophy done right: stdout for data, stderr for diagnostics, proper exit codes
  • MCP server mode enables direct integration with AI agent tooling ecosystems
Weaknesses
  • Still fundamentally a CLI wrapper around Deepgram's existing API — no new capability
  • AssemblyAI, Whisper CLI, and Google Cloud Speech already serve this category
Target Audience

Backend developers, AI engineers building speech workflows

Similar To

AssemblyAI CLI · Whisper CLI · Google Cloud Speech CLI

Post Description

We launched the Deepgram CLI, a command-line interface for transcription, speech synthesis, text analysis, account management, and MCP-based AI workflows.

The main idea was to make Deepgram feel native in the terminal instead of forcing people into one-off curl snippets or small throwaway apps just to test a workflow. It also features agent-aware heuristics. We looked at what fingerprint agents leave on running apps in their sandbox/environment, and decided it was better than defaulting to agent mode. If we miss any, the standard --help output has a catch-all tip for agents parsing it.

It supports local files, URLs, microphone input, stdin pipelines, structured output, named profiles, shell completion, setting up skills for your project (also found at https://github.com/deepgram/skills), and MCP server mode for AI tools.

Docs: https://developers.deepgram.com/cli/getting-started Launch site: https://cli.deepgram.com Repo: https://github.com/deepgram/cli

Happy to answer questions about the architecture, plugin system, or MCP support.

Similar Projects

Give a Voice to Your AI Agent

The project is a pragmatic, no-friction way to route MCP client output to macOS TTS — you get a runnable speak_server.py, ready-made CLI snippets for Gemini and Claude, and persona profiles that alter spoken behavior. Small but thoughtful extras like dynamic AGENTS.MD and persona presets make it useful for prototyping voice-first agents. The downside is obvious: it’s macOS-only and targets a narrow audience, but for that audience it removes a lot of friction.

Niche GemShip It
pcbmaker20
103mo ago