Word-level timestamps

The response also includes an array with information about each word:

1import assemblyai as aai
2
3aai.settings.api_key = "<YOUR_API_KEY>"
4
5# audio_file = "./local_file.mp3"
6audio_file = "https://assembly.ai/wildfires.mp3"
7
8config = aai.TranscriptionConfig()
9
10transcript = aai.Transcriber().transcribe(audio_file, config)
11
12for word in transcript.words:
13 print(f"Word: {word.text}, Start: {word.start}, End: {word.end}, Confidence: {word.confidence}")

API Reference