Filler Words

This page covers using the disfluencies parameter with the Universal-2 model. To learn about verbatim transcription and disfluencies with Universal-3-Pro, see Verbatim transcription and disfluencies.

Global Englishen
Australian Englishen_au
British Englishen_uk
US Englishen_us

Universal-2universal-2

US & EU

The following filler words are removed by default:

  • “um”
  • “uh”
  • “hmm”
  • “mhm”
  • “uh-huh”
  • “ah”
  • “huh”
  • “hm”
  • “m”

If you want to keep filler words in the transcript, you can set the disfluencies to true in the transcription config.

1import assemblyai as aai
2
3aai.settings.api_key = "<YOUR_API_KEY>"
4
5# audio_file = "./local_file.mp3"
6audio_file = "https://assembly.ai/wildfires.mp3"
7
8config = aai.TranscriptionConfig(
9 speech_models=["universal-2"],
10 language_detection=True,
11 disfluencies=True
12)
13
14transcript = aai.Transcriber(config=config).transcribe(audio_file)
15
16if transcript.status == "error":
17 raise RuntimeError(f"Transcription failed: {transcript.error}")
18
19print(transcript.text)