u3-rt-prouniversal-streaming-englishuniversal-streaming-multilingualenesdefrUS & EU
Medical Mode is an add-on that enhances streaming transcription accuracy for medical terminology — including medication names, procedures, conditions, and dosages. It is optimized for medical entity recognition to correct terms that other models frequently get wrong.
Medical Mode can be used with all of our Streaming STT models.
Enable Medical Mode by setting the domain connection parameter to "medical-v1". No other changes to your existing pipeline are required.
Medical Mode is billed as a separate add-on. See the pricing page for details.
Without Medical Mode:
With Medical Mode, lisprohumalog is updated to Lispro (Humalog) - following the standard medical convention of writing the generic name first, with the brand name in parentheses.
Medical Mode is designed for healthcare AI applications where accurate medical terminology is critical:
Medical Mode works alongside other streaming features. You can combine it with:
Medical conversations — such as clinical dictation, patient encounters, and ambient scribes — have different speech patterns than typical voice agent interactions. Clinicians often pause mid-sentence to think, review a chart, or formulate a diagnosis. The default turn detection settings are optimized for fast-paced voice agent dialogues and can incorrectly fragment these natural pauses into separate turns.
To prevent premature turn boundaries in medical audio, increase the silence thresholds:
These values match the Conservative quick start configuration on the turn detection page. You can further adjust them based on your specific workflow — for example, a real-time medical scribe may benefit from a lower max_turn_silence (around 2000 ms) than a dictation application.
If you are using a Universal Streaming model (not U3 Pro), do not set end_of_turn_confidence_threshold to 0. This completely disables semantic turn detection and forces a turn boundary at every silence, which is especially harmful for medical audio where mid-sentence pauses are common. See Turn detection for details.
AssemblyAI offers a Business Associate Agreement (BAA) for customers who need to process Protected Health Information (PHI). AssemblyAI is SOC 2 Type 2, ISO 27001:2022, and PCI DSS v4.0 certified. Medical Mode does not change existing data handling or retention policies.
For BAA setup or enterprise pricing, contact our sales team.