What languages are supported for Streaming Speech-to-text?

Available Models and Language Support

Model	Languages Supported
Universal-3.5 Pro Streaming	English, Spanish, German, French, Portuguese, Italian, Turkish, Dutch, Swedish, Norwegian, Danish, Finnish, Hindi, Vietnamese, Arabic, Hebrew, Japanese, Mandarin (native code switching across 18 languages)
Universal-Streaming English	English only
Universal-Streaming Multilingual	English, Spanish, German, French, Portuguese, Italian (per turn)

Choosing the Right Model

If you need the highest accuracy with multilingual code switching across up to 18 languages, use Universal-3.5 Pro Streaming (universal-3-5-pro).

If you only need English transcription at the lowest cost, use Universal-Streaming English (universal-streaming-english).

If you need multilingual support at a lower cost, use Universal-Streaming Multilingual (universal-streaming-multilingual).

For more details on model selection, see the Model selection page.

Difference From Pre-recorded STT

It’s important to note that the language_code parameter mentioned in some AssemblyAI documentation applies to the Pre-recorded STT feature, not the Streaming Transcription feature. For real-time STT, you specify the model using the speech_model parameter.

To stay informed about new features and improvements, including language support updates, you can follow our Changelog.

​Available Models and Language Support

​Choosing the Right Model

​Difference From Pre-recorded STT

Available Models and Language Support

Choosing the Right Model

Difference From Pre-recorded STT