Audio Intelligence

Auto Chapters

The Auto Chapters model summarizes audio data over time into chapters. Chapters makes it easy for users to navigate and find specific information.

Each chapter contains the following:

  • Summary
  • One-line gist
  • Headline
  • Start and end timestamps
Auto Chapters and Summarization

You can only enable one of the Auto Chapters and Summarization models in the same transcription.

Quickstart

Enable Auto Chapters by setting AutoChapters to true in the transcription parameters. Punctuate must be enabled to use Auto Chapters (Punctuate is enabled by default).

1using AssemblyAI;
2using AssemblyAI.Transcript;
3
4var client = new AssemblyAIClient("<YOUR_API_KEY>");
5
6var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
7{
8 // For local files see our Getting Started guides.
9 AudioUrl = "https://assembly.ai/wildfires.mp3",
10 AutoChapters = true
11});
12
13foreach (var chapter in transcript.Chapters!)
14{
15 Console.WriteLine($"{chapter.Start}-{chapter.End}: {chapter.Headline}");
16}

Example output

1250-28840: Smoke from hundreds of wildfires in Canada is triggering air quality alerts across US
229610-280340: High particulate matter in wildfire smoke can lead to serious health problems
Auto Chapters Using LeMUR

Check out this cookbook Creating Chapter Summaries for an example of how to leverage LeMUR’s custom text input parameter for chapter summaries.

For the full API reference, see the API reference section on the Auto Chapters page.

Content Moderation

The Content Moderation model lets you detect inappropriate content in audio files to ensure that your content is safe for all audiences.

The model pinpoints sensitive discussions in spoken data and their severity.

Quickstart

Enable Content Moderation by setting ContentSafety to true in the transcription parameters.

1using AssemblyAI;
2using AssemblyAI.Transcripts;
3
4var client = new AssemblyAIClient("<YOUR_API_KEY>");
5
6var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
7{
8 // For local files see our Getting Started guides.
9 AudioUrl = "https://assembly.ai/wildfires.mp3",
10 ContentSafety = true
11});
12
13var safetyLabels = transcript.ContentSafetyLabels!;
14
15foreach (var result in safetyLabels.Results)
16{
17 Console.WriteLine(result.Text);
18 Console.WriteLine($"Timestamp: {result.Timestamp.Start} - {result.Timestamp.End}");
19
20 foreach (var label in result.Labels)
21 {
22 Console.WriteLine($"{label.Label} - {label.Confidence} - {label.Severity}");
23 }
24
25 Console.WriteLine();
26}
27
28foreach (var summary in safetyLabels.Summary)
29{
30 Console.WriteLine($"{summary.Value * 100}% confident that the audio contains {summary.Key}");
31}
32
33Console.WriteLine();
34
35foreach (var severitySummary in safetyLabels.SeverityScoreSummary)
36{
37 Console.WriteLine(
38 $"{severitySummary.Value.Low * 100}% confident that the audio contains low-severity {severitySummary.Key}");
39 Console.WriteLine(
40 $"{severitySummary.Value.Medium * 100}% confident that the audio contains medium-severity {severitySummary.Key}");
41 Console.WriteLine(
42 $"{severitySummary.Value.High * 100}% confident that the audio contains high-severity {severitySummary.Key}");
43}

Example output

1Smoke from hundreds of wildfires in Canada is triggering air quality alerts throughout the US. Skylines...
2Timestamp: 250 - 28920
3disasters - 0.8141 - 0.4014
4
5So what is it about the conditions right now that have caused this round of wildfires to...
6Timestamp: 29290 - 56190
7disasters - 0.9217 - 0.5665
8
9So what is it in this haze that makes it harmful? And I'm assuming it is...
10Timestamp: 56340 - 88034
11health_issues - 0.9358 - 0.8906
12
13...
14
1599.42% confident that the audio contains disasters
1692.70% confident that the audio contains health_issues
17
1857.43% confident that the audio contains low-severity disasters
1942.56% confident that the audio contains mid-severity disasters
200.0% confident that the audio contains high-severity disasters
2123.57% confident that the audio contains low-severity health_issues
2230.22% confident that the audio contains mid-severity health_issues
2346.19% confident that the audio contains high-severity health_issues

Adjust the confidence threshold

The confidence threshold determines how likely something is to be flagged as inappropriate content. A threshold of 50% (which is the default) means any label with a confidence score of 50% or greater is flagged.

To adjust the confidence threshold for your transcription, include ContentSafetyConfidence in the transcription parameters.

1// Setting the content safety confidence threshold to 60%.
2var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
3{
4 AudioUrl = "https://assembly.ai/wildfires.mp3",
5 ContentSafety = true,
6 ContentSafetyConfidence = 60
7});

For the full API reference, as well as the supported labels and FAQs, refer to the full Content Moderation page.

Entity Detection

The Entity Detection model lets you automatically identify and categorize key information in transcribed audio content.

Here are a few examples of what you can detect:

  • Names of people
  • Organizations
  • Addresses
  • Phone numbers
  • Medical data
  • Social security numbers

For the full list of entities that you can detect, see Supported entities.

Supported languages

Entity Detection is available in multiple languages. See Supported languages.

Quickstart

Enable Entity Detection by setting EntityDetection to true in the transcription parameters.

1using AssemblyAI;
2using AssemblyAI.Transcripts;
3
4var client = new AssemblyAIClient("<YOUR_API_KEY>");
5
6var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
7{
8 // For local files see our Getting Started guides.
9 AudioUrl = "https://assembly.ai/wildfires.mp3",
10 EntityDetection = true
11});
12
13foreach (var entity in transcript.Entities!) {
14 Console.WriteLine(entity.Text);
15 Console.WriteLine(entity.EntityType);
16 Console.WriteLine($"Timestamp: {entity.Start} - ${entity.End}\n");
17}

Example output

1Canada
2location
3Timestamp: 2548 - 3130
4
5the US
6location
7Timestamp: 5498 - 6350
8
9...

For the full API reference, as well as the supported entities and FAQs, refer to the full Entity Detection page.

Key Phrases

The Key Phrases model identifies significant words and phrases in your transcript and lets you extract the most important concepts or highlights from your audio or video file.

Quickstart

Enable Key Phrases by setting AutoHighlights to true in the transcription parameters.

1using AssemblyAI;
2using AssemblyAI.Transcripts;
3
4var client = new AssemblyAIClient("<YOUR_API_KEY>");
5
6var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
7{
8 // For local files see our Getting Started guides.
9 AudioUrl = "https://assembly.ai/wildfires.mp3",
10 AutoHighlights = true
11});
12
13foreach (var result in transcript.AutoHighlightsResult!.Results)
14{
15 var timestamps = string.Join(", ", result.Timestamps.Select(timestamp =>
16 $"[Timestamp(start={timestamp.Start}, end={timestamp.End})]"
17 ));
18 Console.WriteLine($"Highlight: {result.Text}, Count: {result.Count}, Rank {result.Rank}, Timestamps: {timestamps}");
19}

Example output

1Highlight: air quality alerts, Count: 1, Rank: 0.08, Timestamps: [Timestamp(start=3978, end=5114)]
2Highlight: wide ranging air quality consequences, Count: 1, Rank: 0.08, Timestamps: [Timestamp(start=235388, end=238838)]
3Highlight: more fires, Count: 1, Rank: 0.07, Timestamps: [Timestamp(start=184716, end=185186)]
4...

For the full API reference and FAQs, refer to the full Key Phrases page.

PII Redaction

The PII Redaction model lets you minimize sensitive information about individuals by automatically identifying and removing it from your transcript.

Personal Identifiable Information (PII) is any information that can be used to identify a person, such as a name, email address, or phone number.

When you enable the PII Redaction model, your transcript will look like this:

  • With hash substitution: Hi, my name is ####!
  • With entity_name substitution: Hi, my name is [PERSON_NAME]!

You can also Create redacted audio files to replace sensitive information with a beeping sound.

Supported languages

PII Redaction is available in multiple languages. See Supported languages.

Redacted properties

PII only redacts words in the text property. Properties from other features may still include PII, such as entities from Entity Detection or summary from Summarization.

Quickstart

Enable PII Redaction by setting RedactPii to true in the transcription parameters.

Use RedactPiiPolicies to specify the information you want to redact. For the full list of policies, see PII policies.

1using AssemblyAI;
2using AssemblyAI.Transcripts;
3
4var client = new AssemblyAIClient("<YOUR_API_KEY>");
5
6var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
7{
8 // For local files see our Getting Started guides.
9 AudioUrl = "https://assembly.ai/wildfires.mp3",
10 RedactPii = true,
11 RedactPiiPolicies = [
12 PiiPolicy.PersonName,
13 PiiPolicy.Organization,
14 PiiPolicy.Occupation
15 ]
16});
17
18Console.WriteLine(transcript.Text);

Example output

1Smoke from hundreds of wildfires in Canada is triggering air quality alerts
2throughout the US. Skylines from Maine to Maryland to Minnesota are gray and
3smoggy. And in some places, the air quality warnings include the warning to stay
4inside. We wanted to better understand what's happening here and why, so we
5called ##### #######, an ######### ######### in the ########## ## #############
6###### ### ########### at ##### ####### ##########. Good morning, #########.
7Good morning. So what is it about the conditions right now that have caused this
8round of wildfires to affect so many people so far away? Well, there's a couple
9of things. The season has been pretty dry already, and then the fact that we're
10getting hit in the US. Is because there's a couple of weather systems that ...

Create redacted audio files

In addition to redacting sensitive information from the transcription text, you can also generate a version of the original audio file with the PII “beeped” out.

To create a redacted version of the audio file, set RedactPiiAudio to true in the transcription config. Use RedactPiiAudioQuality to specify the quality of the redacted audio file.

1using AssemblyAI;
2using AssemblyAI.Transcripts;
3
4var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
5{
6 AudioUrl = "https://assembly.ai/wildfires.mp3",
7 RedactPii = true,
8 RedactPiiPolicies = [
9 PiiPolicy.PersonName,
10 PiiPolicy.Organization,
11 PiiPolicy.Occupation
12 ],
13 RedactPiiAudio = true,
14 RedactPiiAudioQuality = RedactPiiAudioQuality.Wav // Optional. Defaults to "Mp3".
15});
16
17var redactionResult = await client.Transcripts.GetRedactedAudioAsync(transcript.Id);
18
19Console.WriteLine($"Status: {redactionResult.Status}, " +
20 $"Redacted audio URL: {redactionResult.RedactedAudioUrl}");

You can also retrieve the redacted audio file as a stream using the GetRedactedAudioFileAsync method. The following code stores the redacted audio file locally as redacted-audio.wav.

1await using var redactedAudioFileStream = await client.Transcripts.GetRedactedAudioFileAsync(transcript.Id);
2await using var fileStream = File.OpenWrite("./redacted_audio.wav");
3redactedAudioFileStream.CopyTo(fileStream);
Supported languages

You can only create redacted audio files for transcriptions in English and Spanish.

Maximum audio file size

You can only create redacted versions of audio files if the original file is smaller than 1 GB.

Example output

1https://s3.us-west-2.amazonaws.com/api.assembly.ai.usw2/redacted-audio/ac06721c-d1ea-41a7-95f7-a9463421e6b1.mp3?AWSAccessKeyId=...

For the full API reference, as well as the supported policies and FAQs, refer to the full PII Redaction page.

Sentiment Analysis

The Sentiment Analysis model detects the sentiment of each spoken sentence in the transcript text. Use Sentiment Analysis to get a detailed analysis of the positive, negative, or neutral sentiment conveyed in the audio, along with a confidence score for each result.

Quickstart

Enable Sentiment Analysis by setting SentimentAnalysis to true in the transcription parameters.

1using AssemblyAI;
2using AssemblyAI.Transcripts;
3
4var client = new AssemblyAIClient("<YOUR_API_KEY>");
5
6var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
7{
8 // For local files see our Getting Started guides.
9 AudioUrl = "https://assembly.ai/wildfires.mp3",
10 SentimentAnalysis = true
11});
12
13foreach (var result in transcript.SentimentAnalysisResults!)
14{
15 Console.WriteLine(result.Text);
16 Console.WriteLine(result.Sentiment); // POSITIVE, NEUTRAL, or NEGATIVE
17 Console.WriteLine(result.Confidence);
18 Console.WriteLine($"Timestamp: {result.Start} - {result.End}");
19}

Example output

1Smoke from hundreds of wildfires in Canada is triggering air quality alerts throughout the US.
2SentimentType.negative
30.8181032538414001
4Timestamp: 250 - 6350
5...
Sentiment Analysis Using LeMUR

Check out this cookbook LeMUR for Customer Call Sentiment Analysis for an example of how to leverage LeMUR’s QA feature for sentiment analysis.

Add speaker labels to sentiments

To add speaker labels to each sentiment analysis result, using Speaker Diarization, enable SpeakerLabels in the transcription parameters.

Each sentiment result will then have a Speaker field that contains the speaker label.

1var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
2{
3 AudioUrl = audioUrl,
4 SentimentAnalysis = true,
5 SpeakerLabels = true
6});
7
8// ...
9
10foreach (var result in transcript.SentimentAnalysisResults!)
11{
12 // ...
13 Console.WriteLine(result.Speaker);
14}

For the full API reference and FAQs, refer to the full Sentiment Analysis page.

Summarization

Distill important information by summarizing your audio files.

The Summarization model generates a summary of the resulting transcript. You can control the style and format of the summary using Summary models and Summary types.

Summarization and Auto Chapters

You can only enable one of the Summarization and Auto Chapters models in the same transcription.

Quickstart

Enable Summarization in the transcription parameters. Use SummaryModel and SummaryType to change the summary format.

If you specify one of SummaryModel and SummaryType, then you must specify the other.

The following example returns an informative summary in a bulleted list.

1using AssemblyAI;
2using AssemblyAI.Transcripts;
3
4var client = new AssemblyAIClient("<YOUR_API_KEY>");
5
6var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
7{
8 // For local files see our Getting Started guides.
9 AudioUrl = "https://assembly.ai/wildfires.mp3",
10 Summarization = true,
11 SummaryModel = SummaryModel.Informative,
12 SummaryType = SummaryType.Bullets
13});
14
15Console.WriteLine(transcript.Summary);

Example output

1- Smoke from hundreds of wildfires in Canada is triggering air quality alerts throughout the US. Skylines from Maine to Maryland to Minnesota are gray and smoggy. In some places, the air quality warnings include the warning to stay inside.
2- Air pollution levels in Baltimore are considered unhealthy. Exposure to high levels can lead to a host of health problems. With climate change, we are seeing more wildfires. Will we be seeing more of these kinds of wide ranging air quality consequences?
Custom Summaries Using LeMUR

If you want more control of the output format, see how to generate a Custom summary using LeMUR.

For the full API reference, as well as the supported summary models/types and FAQs, refer to the full Summarization page.

Topic Detection

The Topic Detection model lets you identify different topics in the transcript. The model uses the IAB Content Taxonomy, a standardized language for content description which consists of 698 comprehensive topics.

Quickstart

Enable Topic Detection by setting IabCategories to true in the transcription config.

1using AssemblyAI;
2using AssemblyAI.Transcripts;
3
4var client = new AssemblyAIClient("<YOUR_API_KEY>");
5
6var transcript = await client.Transcripts.TranscribeAsync(new TranscriptParams
7{
8 // For local files see our Getting Started guides.
9 AudioUrl = "https://assembly.ai/wildfires.mp3",
10 IabCategories = true
11});
12
13// Get the parts of the transcript that were tagged with topics
14foreach (var result in transcript.IabCategoriesResult!.Results)
15{
16 Console.WriteLine(result.Text);
17 Console.WriteLine($"Timestamp: {result.Timestamp?.Start} - {result.Timestamp?.End}");
18
19 foreach (var label in result.Labels!)
20 {
21 Console.WriteLine($"{label.Label} ({label.Relevance})");
22 }
23}
24
25// Get a summary of all topics in the transcript
26foreach (var summary in transcript.IabCategoriesResult.Summary)
27{
28 Console.WriteLine($"Audio is {summary.Value * 100} relevant to {summary.Key}");
29}

Example output

1Smoke from hundreds of wildfires in Canada is triggering air quality alerts throughout the US. Skylines...
2Timestamp: 250 - 28920
3Home&Garden>IndoorEnvironmentalQuality (0.9881)
4NewsAndPolitics>Weather (0.5561)
5MedicalHealth>DiseasesAndConditions>LungAndRespiratoryHealth (0.0042)
6...
7Audio is 100.0% relevant to NewsAndPolitics>Weather
8Audio is 93.78% relevant to Home&Garden>IndoorEnvironmentalQuality
9...
Topic Detection Using LeMUR

Check out this cookbook Custom Topic Tags for an example of how to leverage LeMUR for custom topic detection.

For the full API reference, as well as the full list of supported topics and FAQs, refer to the full Topic Detection page.