For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
PlaygroundChangelogSign In
OverviewAPI ReferencePre-recorded STTStreaming STTVoice AgentsSpeech UnderstandingGuardrailsLLM GatewayFAQ
OverviewAPI ReferencePre-recorded STTStreaming STTVoice AgentsSpeech UnderstandingGuardrailsLLM GatewayFAQ
  • Getting started
    • Transcribe a pre-recorded audio file
    • Model selection
    • View model benchmarks
    • Evaluate model accuracy
    • Cloud endpoints & data residency
    • Manage concurrent requests
    • Webhooks
  • Models
    • Medical Mode
  • Features
    • Boost specific terms
    • Label speakers
    • Transcribe multiple audio channels
    • Transcribe audio with mixed languages
    • Correct spelling of terms
    • Include filler words
    • Search for words in transcript
    • Set the start and end of the transcript
  • Guides
LogoLogo
PlaygroundChangelogSign In
On this page
  • Quickstart
  • Example output
  • Use cases
  • Combine with other features
  • HIPAA compliance
Models

Medical Mode

Was this page helpful?
Previous

Keyterms Prompting

Next
Built with
Supported models
Universal-3 Prouniversal-3-pro
Universal-2universal-2

Supported languages
Englishen
Spanishes
Germande
Frenchfr

Supported regions

US & EU

Medical Mode is an add-on that enhances transcription accuracy for medical terminology — including medication names, procedures, conditions, and dosages. It is optimized for medical entity recognition to correct terms that other models frequently get wrong.

Medical Mode can be used with all of our Pre-recorded STT models.

Enable Medical Mode by setting the domain parameter to "medical-v1". No changes to your existing pipeline are required.

Medical Mode is billed as a separate add-on. See the pricing page for details.

Medical Mode supports English, Spanish, German, and French.

If you use Medical Mode with an unsupported language, the API ignores the domain parameter and returns a warning indicating that Medical Mode was not applied: "Skipped medical-v1 domain correction because the language is not supported"

Your transcript is still returned using standard transcription, and you will not be charged for Medical Mode.

Quickstart

Python
Python SDK
JavaScript
JavaScript SDK

To enable Medical Mode, set domain to "medical-v1" in the POST request body:

1import requests
2import time
3
4base_url = "https://api.assemblyai.com"
5headers = {"authorization": "<YOUR_API_KEY>"}
6
7data = {
8 "audio_url": "https://assembly.ai/lispro",
9 "language_detection": True,
10 "speech_models": ["universal-3-pro", "universal-2"],
11 "domain": "medical-v1"
12}
13
14response = requests.post(base_url + "/v2/transcript", headers=headers, json=data)
15
16if response.status_code != 200:
17 print(f"Error: {response.status_code}, Response: {response.text}")
18 response.raise_for_status()
19
20transcript_response = response.json()
21transcript_id = transcript_response["id"]
22polling_endpoint = f"{base_url}/v2/transcript/{transcript_id}"
23
24while True:
25 transcript = requests.get(polling_endpoint, headers=headers).json()
26 if transcript["status"] == "completed":
27 print(transcript["text"])
28 break
29 elif transcript["status"] == "error":
30 raise RuntimeError(f"Transcription failed: {transcript['error']}")
31 else:
32 time.sleep(3)
Your browser does not support the audio element. Download the audio.

Example output

Without Medical Mode:

1I have here insulin to be used for both prandial mealtime and sliding scale is
2insulin lisprohumalog subcutaneously.

With Medical Mode, lisprohumalog is updated to Lispro (Humalog) - following the standard medical convention of writing the generic name first, with the brand name in parentheses.

1I have here insulin to be used for both prandial mealtime and sliding scale is
2insulin Lispro (Humalog) subcutaneously.

Use cases

Medical Mode is designed for healthcare AI applications where accurate medical terminology is critical:

  • Ambient clinical documentation — Capture medication names, dosages, and clinical terms correctly in real-time scribing workflows.
  • AI-powered clinical notes — Generate clean transcripts for downstream LLMs producing SOAP notes, discharge summaries, and referral letters.
  • Front-office automation — Handle drug names, provider names, and clinic-specific terminology in scheduling calls, insurance verification, and voice agents.
  • Multi-speaker clinical conversations — Combine with Speaker Diarization for provider/patient separation in telehealth, therapy documentation, and clinical settings.

Combine with other features

Medical Mode works alongside other transcription features. You can combine it with:

  • Speaker Diarization to identify who said what in clinical conversations
  • Keyterms Prompting to further boost accuracy for specific medical terms unique to your use case
  • PII Redaction to redact sensitive patient information from transcripts
Python
Python SDK
JavaScript
JavaScript SDK
1data = {
2 "audio_url": "<YOUR_AUDIO_URL>",
3 "speech_models": ["universal-3-pro", "universal-2"],
4 "language_detection": True,
5 "domain": "medical-v1",
6 "speaker_labels": True,
7 "keyterms_prompt": ["Lisinopril", "Metformin", "Humalog"]
8}

HIPAA compliance

AssemblyAI offers a Business Associate Agreement (BAA) for customers who need to process Protected Health Information (PHI). AssemblyAI is SOC 2 Type 2, ISO 27001:2022, and PCI DSS v4.0 certified. Medical Mode does not change existing data handling or retention policies.

For BAA setup or enterprise pricing, contact our sales team.