Realtime Speech-to-Text API

Real-time transcription your notetaker, agents, and captions can depend on

Universal-3 Pro Streaming

Your transcriptions will show here...

Runway
Dovetail
Granola
Supernormal
Ashby
Jiminny
Calabrio
JotPsych
EdgeTier
Genio
WhatConverts
Earmark
Grain
Loop
CallRail
Happy Scribe
Veed.io
Delphi
Runway
Dovetail
Granola
Supernormal
Ashby
Jiminny
Calabrio
JotPsych
EdgeTier
Genio
WhatConverts
Earmark
Grain
Loop
CallRail
Happy Scribe
Veed.io
Delphi
Runway
Dovetail
Granola
Supernormal
Ashby
Jiminny
Calabrio
JotPsych
EdgeTier
Genio
WhatConverts
Earmark
Grain
Loop
CallRail
Happy Scribe
Veed.io
Delphi
Runway
Dovetail
Granola
Supernormal
Ashby
Jiminny
Calabrio
JotPsych
EdgeTier
Genio
WhatConverts
Earmark
Grain
Loop
CallRail
Happy Scribe
Veed.io
Delphi
Models

Pick the model that fits your workload

Real-time transcription fast enough for voice agents, accurate enough for production.

Compare features

Model
U-3 Pro Streaming Voice agents, AI scribes
Universal Streaming AI notetakers, call centers
Univ. Multilingual Global contact centers
Price
$0.45 /hr
$0.15 /hr
$0.15 /hr
Languages
EN, ES, FR, DE, IT, PT
English
EN, ES, FR, DE, IT, PT
Natural language prompting
Up to ~1,500 words
Keyterm prompting
Up to ~100 words
Up to ~100 words
Up to ~100 words
Code-switching
Speaker diarization
10+ speakers
10+ speakers
10+ speakers
Medical terminology
Medical mode add-on
Medical mode add-on
Medical mode add-on
HIPAA BAA
On request
On request
On request
Unlimited concurrency
Use cases

Built for every voice workflow

Real-time transcription powers every application where you stream audio.

Common questions