Getting started

Introducing Universal-3 Pro

Learn how to transcribe audio using Universal-3 Pro.

Overview

Universal-3 Pro is our most powerful Voice AI model yet, designed to capture the “hard stuff” that traditional ASR models struggle with. The model out of the box outperforms all ASR models on the market on accuracy, especially as it pertains to entities and rare words. With prompting, you can get an entirely customized transcription output that rivals near-human-level transcription.

Universal-3 Pro is available for both pre-recorded (async) and streaming use cases. Configuration and settings differ between the two because streaming is optimized for real-time audio utterances typically under 10 seconds, with special efficiencies built into the model for low-latency turn detection and voice agent workflows.

Based on your use case, navigate to the appropriate guide below: