
AssemblyAI - Speech to Text API
- Features
- Ease of use
- Ease of management
- Quality of support
- Affordability
- Market presence
- Media and communications
- Arts, entertainment, and recreation
- Information technology and software
What is AssemblyAI - Speech to Text API
Developer-first API integration
Speech intelligence add-ons
Scales for production workloads
Cloud dependency and data handling
Accuracy varies by domain
Cost at high volumes
Plan & Pricing
Pricing model: Pay-as-you-go Free tier/trial: $50 in credits on sign-up (equivalent to up to 185 hours pre-recorded or 333 hours streaming as stated on the official Pricing page). Free credit available to new accounts; LLM Gateway not available on the free tier.
Example costs (key Speech-to-Text models & add-ons) — billed per hour, prorated to the second:
-
Pre-recorded Speech-to-Text:
- Universal-3 Pro — $0.21 / hr.
- Universal-2 — $0.15 / hr.
- Prompting (add-on) — $0.05 / hr.
- Keyterms Prompting (add-on) — $0.05 / hr.
- Speaker Diarization (add-on) — $0.02 / hr.
-
Streaming Speech-to-Text:
- Universal-Streaming — $0.15 / hr.
- Universal-Streaming Multilingual — $0.15 / hr.
- Keyterms Prompting (streaming add-on) — $0.04 / hr.
-
Speech Understanding (audio intelligence) — (examples):
- Speaker Identification — $0.02 / hr.
- Translation — $0.06 / hr.
- Custom Formatting — $0.03 / hr.
- Entity Detection — $0.08 / hr.
- Sentiment Analysis — $0.02 / hr.
- Auto Chapters — $0.08 / hr.
- Key Phrases — $0.01 / hr.
- Topic Detection — $0.15 / hr.
- Summarization — $0.03 / hr.
-
Guardrails / Safety features (examples):
- Profanity Filtering — $0.01 / hr.
- PII Audio Redaction — $0.05 / hr.
- PII Redaction (text) — $0.08 / hr.
- Content Moderation — $0.15 / hr.
-
LLM Gateway (tokenized pricing examples):
- GPT-5.2 — $1.75 / 1M input tokens; $14.00 / 1M output tokens.
- GPT-5.1 — $1.25 / 1M input; $10.00 / 1M output.
- (Multiple other LLMs listed with per-1M-token input/output rates on the pricing page.)
Discounts / enterprise: Volume discounts and enterprise/tiered pricing are available by contacting AssemblyAI sales (custom pricing, rate limits, enhanced concurrency, self-hosting options).
Billing notes: Rates are listed per hour but pro-rated to the second. Multichannel audio is billed per channel (each channel transcribed and billed separately).