Skip to main content
The speech_model connection parameter lets you specify which model to use for streaming transcription. It’s optional; if omitted, the session defaults to universal-3-5-pro.

Available models

NameParameterDescriptionBest for
Universal-3.5 Pro Streaming Recommended"universal-3-5-pro"The next generation of our flagship streaming model. More languages, improved prompting, and enhanced conversational context features.Real-time voice agents needing the latest accuracy, broad language support, and conversational context.
Universal-Streaming English"universal-streaming-english"An English transcription model offering a good balance of speed and cost-effectiveness.Cost-effective English real-time transcription, English-only real-time apps
Universal-Streaming Multilingual"universal-streaming-multilingual"A multilingual transcription model offering a good balance of speed and cost-effectiveness.Cost-effective multilingual streaming across EN/ES/DE/FR/PT/IT

Choosing a model

FeatureUniversal-3.5 Pro Streaming RecommendedUniversal-Streaming EnglishUniversal-Streaming Multilingual
LatencyFastestFastFast
Supported languagesEN, ES, DE, FR, PT, IT, TR, NL, SV, NO, DA, FI, HI, VI, AR, HE, JA, ZHENEN, ES, DE, FR, PT, IT
Partial transcriptsYesYesYes
MultilingualNative Code SwitchingNoPer Turn
Entity accuracyBestOkayOkay
Disfluencies & filler wordsYesNoNo
Language detectionYesNoYes
Non-speech tagsNoNoNo
CustomizationKeyterms prompting (known context) + Native prompting (unknown context)Keyterms prompting (known context)Keyterms prompting (known context)
For detailed setup and configuration of Universal-3.5 Pro Streaming, see the Universal-3.5 Pro Streaming page. For prompting guidance, see the Prompting guide.

Set the model

Set the speech_model connection parameter when you open the WebSocket.
CONNECTION_PARAMS = {
    "sample_rate": 16000,
    "speech_model": "universal-3-5-pro",  # universal-streaming-english | universal-streaming-multilingual
}