domain connection parameter to "medical-v1". No other changes to your existing pipeline are required.
Quickstart
Setdomain to "medical-v1" as a connection parameter when you open the WebSocket.
- Python
- Python SDK
- Javascript
- JavaScript SDK
Example output
Without Medical Mode:Use cases
Medical Mode is designed for healthcare AI applications where accurate medical terminology is critical:- Ambient clinical documentation — Capture medication names, dosages, and clinical terms correctly during live patient encounters.
- Real-time medical scribes — Deliver accurate transcripts to clinicians during or immediately after a consult.
- Front-office voice agents — Handle drug names, provider names, and clinic-specific terminology in scheduling calls and insurance verification.
- Medical contact centers — Transcribe calls with correct medical vocabulary for downstream processing and quality assurance.
Combine with other features
Medical Mode works alongside other streaming features. You can combine it with:- Streaming Diarization to identify who said what in clinical conversations
- Keyterms Prompting to further boost accuracy for specific medical terms unique to your use case
- Python
- Python SDK
- Javascript
- JavaScript SDK
Configuration for medical audio
Medical conversations — such as clinical dictation, patient encounters, and ambient scribes — have different speech patterns than typical voice agent interactions. Clinicians often pause mid-sentence to think, review a chart, or formulate a diagnosis. The default turn detection settings are optimized for fast-paced voice agent dialogues and can incorrectly fragment these natural pauses into separate turns. To prevent premature turn boundaries in medical audio, increase the silence thresholds:| Parameter | Default | Recommended for Medical | Why |
|---|---|---|---|
min_turn_silence | 100 ms (U3 Pro) / 400 ms (Universal Streaming) | 800 ms | Gives clinicians time to pause mid-sentence without triggering a speculative end-of-turn check. |
max_turn_silence | 1000 ms (U3 Pro) / 1280 ms (Universal Streaming) | 3600 ms | Allows extended pauses for chart review or thinking without forcing a turn boundary. |
These values match the Conservative quick start configuration on the turn detection page. You can further adjust them based on your specific workflow — for example, a real-time medical scribe may benefit from a lower
max_turn_silence (around 2000 ms) than a dictation application.- Python
- Python SDK
- Javascript
- JavaScript SDK