Connect Deepgram to your AI agent

AI Tools 8 actions available

Deepgram provides AI-powered speech recognition and understanding services, offering APIs for real-time and pre-recorded audio transcription, text-to-speech, and audio intelligence.

We set up the connection using your own Deepgram account, with keys you control, and keep it running. Your agent picks it up and starts doing the work.

What your agent can do in Deepgram

Each one is a real action the agent can take on its own, the same things a person clicking around Deepgram could do. Read-only by default; write actions are confirmed against your policy.

  • Get Model by ID Retrieve metadata for a specific Deepgram model by its UUID. Returns detailed model information including name, architecture, supported languages, version, and capabilities. Works for both STT (speech-to-text) and TTS (…
  • Get Public Models Retrieve metadata on all public Deepgram models (speech-to-text and text-to-speech). Returns comprehensive model information including supported languages, architectures, versions, and capabilities. Set include_outdated…
  • List Deepgram Projects Tool to list all Deepgram projects. Use after authenticating with your API key.
  • Get Project Usage Summary Retrieves aggregated usage statistics for a Deepgram project including total audio duration, billable duration, number of requests, channels processed, and confidence/relevance scores. Returns both overall totals and br…
  • List Project Scopes Tool to list all scopes for a specified Deepgram project. Use when you need to retrieve all permission scopes for a project.
  • List Think Models Tool to list available think models for AI agent processing and voice agent configuration. Use when you need to see which think models are available for voice agents.
  • Transcribe pre recorded audio Tool to transcribe pre-recorded audio files into text. Use when converting a publicly accessible audio file URL to text. Primary transcript is at `results.channels[0].alternatives[0].transcript` in the response. Silent…
  • Text to speech (rest) Tool to convert text into natural-sounding speech. Use when you need TTS audio from text inputs.

How we connect it

  1. 1

    Connect your account

    You create a key in Deepgram, a key you create and control, and paste it in once. It lives in a secrets store on your server, not with us.

  2. 2

    Set the guardrails

    Read-only by default. You choose which write actions the agent may take, and anything outside that policy gets confirmed with you first.

  3. 3

    We keep it running

    Health checks on every connection, updates handled for you, and we watch the first week of activity to make sure the work lands.

Deepgram questions, answered.

With a key you create and control. You paste it in once, it is stored in a secrets store on your server, permissions are scoped to the minimum the agent needs, and you can revoke it at any time.
The actions Deepgram's API allows, the same things a person clicking around the app could do. Connections start read-only by default; write actions are confirmed against the policy you set before the agent takes them.
Connections are priced per tool on top of the base plan. Some are included, some are premium. See pricing for how connection charges work.
Standard tools are ready inside 7 business days of the setup call. We test the connection end to end, walk you through how the agent uses it, and watch the first week of activity.

Ready to put Deepgram to work?

Tell us what your team runs on. We set up the connection, secure it, and your agent takes it from there.

All product names, logos, and brands are property of their respective owners; used for identification only. ZeroToClaw is not affiliated with or endorsed by Deepgram.