Speech-to-Text

Experience industry-leading speech-to-text accuracy with Speech AI models on the cutting-edge of AI research, accessible through a simple API.

Use our API Try our API for free

Universal-2

State-of-the-art multilingual speech-to-text model

>92.5%

Accuracy*

30.4s

Latency on 30 min audio file

12.5M

Hours of multilingual training data

Industry’s lowest Word Error Rate (WER)

See how Universal-1 performs against other Automatic Speech Recognition providers.

Read our research

See it in action

Away. First time. Good start from. From Bolt. Bulk lead in the moment and going away. Gay trying to go with him. And he's going. Being dragged through to second place, but he's going to win it by 2 meters. 9.58. The world record's gone. That's more like it. Sub nine six.

Try our playground

*Benchmark performed across 11 datasets, including 8 academic datasets & 3 internally curated datasets representing real world English audio.

Harness best-in-class accuracy and powerful Speech AI capabilities

International Language Support

Gain support to transcribe over 99+ languages and counting, including Global English (English and all of its accents).

See how in docs

Speaker Diarization

Detect the number of speakers in your audio file, with each word in the text associated with its speaker.t

See how in docs

Automatic Language Detection

Automatically detect if the dominant language of the spoken audio is supported by our API and route it to the appropriate model for transcription.

See how in docs

View word-by-word timestamps across the entire transcript text.

See how in docs

Profanity Filtering

Detect and replace profanity in the transcription text with ease.

See how in docs

Auto Punctuation and Casing

Automatically add casing and punctuation of proper nouns to the transcription text.

See how in docs

Custom Vocabulary

Boost accuracy for vocabulary that is unique or custom to your specific use case or product.

See how in docs

Confidence Scores

Get a confidence score for each word in the transcript.

See how in docs

See all in docs

Continuously  up-to-date  and secure

Regular enhancements

Explore our changelog for detailed updates on the most recent product enhancements and improvements.

Enterprise-grade security

AssemblyAI is committed to the highest standards of security practices to keep your data and your customers' data safe.

Read more about our security

AssemblyAI's accuracy is better than any other tools in the market (and we have tried them all).

Vedant Maheshwari, Co-Founder and CEO

Explore more

Streaming Speech-to-Text

Transcribe audio streams synchronously with high accuracy and low latency.

Speech Understanding

Extract maximum value from voice data with Audio Intelligence, and leverage Large Language Models with LeMUR.

Turn voice data into unparalleled product experiences

Partner with the leader in Speech AI to build powerful products with breakthrough industry impact.

Try our API for free Contact sales