About
What is Deepgram?
Deepgram offers enterprise-grade voice AI solutions, including Speech-to-Text (STT), Text-to-Speech (TTS), and Voice Agent APIs. It provides highly accurate, real-time transcription and synthesis, supporting over 45 languages with advanced features like Speaker Diarization, Smart Formatting, and Automatic Language Detection. Deepgram unifies STT, TTS, and LLM orchestration into a single Voice Agent API, reducing complexity and latency. The platform supports both real-time streaming and pre-recorded audio processing at the same low rate. Additionally, it offers Audio Intelligence features such as Summarization, Topic Detection, and Sentiment Analysis. Deepgram is available in cloud and self-hosted deployments, with options for custom models and enterprise-level compliance like SOC 2 Type 2 and HIPAA.
Best used for
Ideal for developers and product teams who need to integrate advanced speech recognition, text-to-speech, and conversational AI into their applications. Especially valuable for enterprises requiring scalable, accurate, and compliant voice solutions for contact centers, speech analytics, and conversational AI platforms.
Common actions
audio intelligenceBusiness applicationsaispeech-to-texttext to speechAPItranscriptionaudio editingstartupscontact centers+ 2 more
Capabilities
Key features
- Speech-to-Text API
- Text-to-Speech API
- Voice Agent API
- Audio Intelligence
- Real-time transcription
- Multilingual support
- Custom models
Target Audience
content creatordata scientistpodcaster
Integrations
amazon-connect
Pricing & Plans
Freemium ยท Paid ยท Usage-based ยท Enterprise
FAQs
How much does Deepgram Speech-to-Text cost per hour?
Deepgram's Pay-As-You-Go pricing for the standard Nova model is approximately $0.46 per hour ($0.0077/min). For high-volume users on the Growth plan, the cost drops to roughly $0.39 per hour, significantly lower than many major cloud providers.
Does Deepgram charge for silence or round up audio time?
No, Deepgram uses true per-second billing. You only pay for the exact duration of your audio file. Many competitors round up to the nearest 15 seconds or full minute, which can inflate invoices by 15-20%.
What is included in the $200 free credit?
Every new account receives $200 in free credit, equivalent to approximately 43,000 minutes (over 700 hours) of transcription using the Nova model. This credit does not expire, allowing ample time for prototyping without pressure.
How does pricing work for the Voice Agent API?
The Voice Agent API is billed based on the duration of the conversation, covering Speech-to-Text, LLM processing, and Text-to-Speech orchestration. If you use your own LLM, you only pay Deepgram for the audio components and orchestration, potentially lowering costs.
Is Deepgram HIPAA and SOC 2 compliant?
Yes, Deepgram is SOC 2 Type 2 certified and HIPAA compliant. They can sign a Business Associate Agreement (BAA) for Enterprise customers handling sensitive healthcare data, ensuring data security and privacy.