Multichannel Transcription | AssemblyAI

Supported Languages, Regions, and Models

Multichannel transcription is supported for all languages, regions, and models.

If you have a multichannel audio file with multiple speakers, you can transcribe each of them separately.

The response includes an audio_channels property with the number of different channels, and an additional utterances property, containing a list of turn-by-turn utterances.

Each utterance contains channel information, starting at 1.

Additionally, each word in the words array contains the channel identifier.

Quickstart

1 import assemblyai as aai
2 
3 aai.settings.api_key = "<YOUR_API_KEY>"
4 
5 # audio_file = "./local_file.mp3"
6 audio_file = "https://assembly.ai/wildfires.mp3"
7 
8 config = aai.TranscriptionConfig(multichannel=True)
9 
10 transcript = aai.Transcriber(config=config).transcribe(audio_file)
11 
12 if transcript.status == "error":
13   raise RuntimeError(f"Transcription failed: {transcript.error}")
14 
15 for utterance in transcript.utterances:
16   print(f"Channel {utterance.speaker}: {utterance.text}")

Multichannel audio increases the transcription time by approximately 25%.

Per-channel diarization

If you have a multichannel audio file where individual channels may contain multiple speakers, you can combine multichannel and speaker_labels to perform diarization within each channel.

When both parameters are enabled:

Channels are labeled numerically (1, 2, 3, etc.)
Speakers within each channel are labeled alphabetically (A, B, C, etc.)
The combined speaker label format is {channel}{speaker} (e.g., “1A”, “1B”, “2A”)

For example, if channel 1 has two speakers and channel 2 has one speaker, the labels would be:

First speaker on channel 1: 1A
Second speaker on channel 1: 1B
First speaker on channel 2: 2A

1 import assemblyai as aai
2 
3 aai.settings.api_key = "<YOUR_API_KEY>"
4 
5 # audio_file = "./local_file.mp3"
6 audio_file = "https://assembly.ai/wildfires.mp3"
7 
8 config = aai.TranscriptionConfig(multichannel=True,
9                                   speaker_labels=True)
10 
11 transcript = aai.Transcriber(config=config).transcribe(audio_file)
12 
13 if transcript.status == "error":
14   raise RuntimeError(f"Transcription failed: {transcript.error}")
15 
16 for utterance in transcript.utterances:
17   print(f"Speaker {utterance.speaker}: {utterance.text}")