Our Speech-to-Text model enables you to transcribe pre-recorded audio into written text.

On top of the transcription, you can enable other features and models, such as Speaker Diarization, by adding additional parameters to the same transcription request.

Choose Model Class

You can use the optional speech_model parameter to specify the class of models. To learn more, see Select the speech model.

Quickstart

The following example transcribes an audio file from a local file.

1 import assemblyai as aai
2 
3 aai.settings.api_key = "<YOUR_API_KEY>"
4 
5 audio_file = "./local_file.mp3"
6 # audio_file = "https://assembly.ai/wildfires.mp3"
7 
8 transcript = aai.Transcriber().transcribe(audio_file)
9 
10 if transcript.status == "error":
11   raise RuntimeError(f"Transcription failed: {transcript.error}")
12 
13 print(transcript.text)

Example output

1 Smoke from hundreds of wildfires in Canada is triggering air quality alerts
2 throughout the US. Skylines from Maine to Maryland to Minnesota are gray and
3 smoggy. And...

API reference

You can find the API reference here.