Create Chapter Summaries with LeMURs Custom Text Input Parameter | AssemblyAI

In this guide, we’ll show you how to use AssemblyAI’s LeMUR (Leveraging Large Language Models to Understand Recognized Speech) framework to process an audio file and summarize it into chapters by sending in the timestamped transcript via LeMUR’s input_text parameter.

Calling LeMUR using transcript_ids is preferred as default. Depending on your use case, you can alternatively use the input_text parameter to call LeMUR with custom formatted transcript data including edited transcripts, speaker-labelled transcripts and more.

Quickstart

1 import assemblyai as aai
2 
3 aai.settings.api_key = "YOUR_API_KEY"
4 transcriber = aai.Transcriber()
5 
6 transcript = transcriber.transcribe(
7     "https://github.com/AssemblyAI-Examples/audio-examples/raw/main/20230607_me_canadian_wildfires.mp3"
8 )
9 
10 paragraphs = transcript.get_paragraphs()
11 combined_paragraphs = []
12 step = 2  # Adjust as needed if you want combined paragraphs to be shorter or longer in length.
13 
14 # Combine paragraphs into groups, finding the appropriate timestamps and combining all their text into one string.
15 for i in range(0, len(paragraphs), step):
16     paragraph_group = paragraphs[i : i + step]
17     start = paragraph_group[0].start
18     end = paragraph_group[-1].end
19     text = ""
20     for paragraph in paragraph_group:
21         text += f"{paragraph.text} "
22     combined_paragraphs.append(f"Paragraph: {text} Start: {start} End: {end}")
23 
24 results = []
25 
26 for paragraph in combined_paragraphs:
27     result = aai.Lemur().task(
28         prompt="Summarize this text as a whole and provide start and end timestamps.",
29         input_text=paragraph,
30         final_model=aai.LemurModel.claude3_5_sonnet,
31     )
32     results.append(result.response)
33 
34 for result in results:
35     print(f"{result}\n")

Get Started

Before we begin, make sure you have an AssemblyAI account and an API key. You can sign up for an account and get your API key from your dashboard.

LeMUR features are currently only available to paid users, at two pricing tiers: LeMUR and LeMUR Basic. See pricing for more details.

Step-by-Step Instructions

First, let’s install the AssemblyAI SDK.

$ pip install -U assemblyai

Import the assemblyai package and set your API key.

1 import assemblyai as aai
2 
3 aai.settings.api_key = "YOUR_API_KEY"

Use the Transcriber object’s transcribe method and parse the audio file URL path as a parameter. The transcribe method will save the results of the transcription to the Transcriber object’s transcript attribute.

1 transcriber = aai.Transcriber()
2 
3 transcript = transcriber.transcribe(
4     "https://github.com/AssemblyAI-Examples/audio-examples/raw/main/20230607_me_canadian_wildfires.mp3"
5 )

Next we’ll use the SDK to fetch all of the paragraphs generated out of this transcript and combine them into groups. We set a step variable that controls how many paragraphs we combine into one overall paragraph to help LeMUR have more context to create better summaries.

We also extract the appropriate start and end timestamps, and save all of our combined paragraphs in string form to send into LeMUR in a later step.

1 paragraphs = transcript.get_paragraphs()
2 combined_paragraphs = []
3 step = 2  # Adjust as needed if you want combined paragraphs to be shorter or longer in length.
4 
5 # Combine paragraphs into groups, finding the appropriate timestamps and combining all their text into one string.
6 for i in range(0, len(paragraphs), step):
7     paragraph_group = paragraphs[i : i + step]
8     start = paragraph_group[0].start
9     end = paragraph_group[-1].end
10     text = ""
11     for paragraph in paragraph_group:
12         text += f"{paragraph.text} "
13     combined_paragraphs.append(f"Paragraph: {text} Start: {start} End: {end}")

Now we’ll use LeMUR’s task endpoint in conjuction with the input_text parameter to send in all of our combined_paragraphs to create summaries for each one.

The summary for each paragraph then gets saved to a results array so we can output all of them at the same time.

1 results = []
2 
3 for paragraph in combined_paragraphs:
4     result = aai.Lemur().task(
5         prompt="Summarize this text as a whole and provide start and end timestamps.",
6         input_text=paragraph,
7         final_model=aai.LemurModel.claude3_5_sonnet,
8     )
9     results.append(result.response)
10 
11 for result in results:
12     print(f"{result}\n")

The output will look similar to the example below.

Summary:
This transcript discusses the widespread impact of Canadian wildfires on air quality across the United States. Smoke from these fires is causing hazy conditions and air quality alerts in multiple states, prompting warnings to stay indoors in some areas. Peter DeCarlo, an environmental health expert from Johns Hopkins University, explains that dry conditions and specific weather patterns are channeling the smoke southward, affecting the mid-Atlantic and Northeast regions.
Start timestamp: 240
End timestamp: 60890
Summary:
The transcript discusses the unhealthy air quality in Baltimore due to high levels of particulate matter. These microscopic particles can affect respiratory, cardiovascular, and neurological systems. The concentration of particulate matter has reached dangerous levels, measuring 150 micrograms per cubic meter, which is 10 times higher than the annual average and 4 times higher than the recommended 24-hour average.
Start timestamp: 62270
End timestamp: 113214
Summary:
The text discusses the health impacts of high levels of air pollution, likely due to smoke from wildfires. It explains that the concentration of particles in the air is much higher than usual, leading to various health problems. The most vulnerable groups are identified as children, the elderly, and those with pre-existing health conditions, particularly respiratory or heart issues. The situation is severe enough that outdoor activities are being canceled in places like New York City, despite it being early summer.
Start timestamp: 113342
End timestamp: 158870
Summary:
The text discusses air quality issues related to smoke from wildfires. It mentions that some areas, like New York, are experiencing higher concentrations of smoke, but this will change as the air moves. The impact will vary across different areas over the next few days. The speaker doesn't expect concentrations to increase significantly even if more fires start. The duration of the smoke's impact on the US depends on weather system changes, while the fires themselves are expected to continue burning for some time.
Start timestamp: 162090
End timestamp: 203856
Summary:
The transcript discusses the impact of weather systems on smoke from wildfires affecting the Mid-Atlantic and Northeast regions. It predicts that changing weather patterns will soon push the smoke away from these areas. The speaker also addresses the connection between climate change and increased wildfires, suggesting that such air quality issues may become more frequent in the future, particularly in the western United States.
Start timestamp: 203968
End timestamp: 258010
Summary:
Peter DeCarlo, an associate professor at Johns Hopkins University, discusses how climate change is likely to increase the frequency of extreme weather events in the eastern United States. He suggests that while such events are currently unusual for the region, they may become more common in the future due to climate change.
Start timestamp: 258130
End timestamp: 280290