Create speech understanding

curl --request POST \
  --url https://llm-gateway.assemblyai.com/v1/understanding \
  --header 'Authorization: <api-key>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "transcript_id": "12345",
  "speech_understanding": {
    "request": {
      "translation": {
        "target_languages": [
          "es",
          "de"
        ],
        "formal": true,
        "match_original_utterance": true
      }
    }
  }
}
'

{
  "speech_understanding": {
    "request": {
      "translation": {
        "target_languages": [
          "es",
          "de"
        ],
        "formal": true,
        "match_original_utterance": true
      },
      "speaker_identification": {
        "speaker_type": "name",
        "speakers": [
          {
            "name": "Michel Martin",
            "description": "Hosts the program and interviews the guests",
            "company": "NPR",
            "title": "Host Morning Edition"
          },
          {
            "name": "Peter DeCarlo",
            "description": "Answers questions from the interview",
            "company": "Johns Hopkins University",
            "title": "Professor and Vice Chair of Environmental Health and Engineering"
          }
        ]
      },
      "custom_formatting": {
        "date": "mm/dd/yyyy",
        "phone_number": "(xxx)xxx-xxxx",
        "email": "username@domain.com"
      },
      "summarization": {
        "summary_type": "bullets"
      }
    },
    "response": {
      "summarization": {
        "status": "success",
        "effort": "low",
        "summary": [
          {
            "start": 0,
            "end": 1000,
            "bullets": [
              "Host introduces Dr Peter DeCarlo",
              "Asks for a quick intro"
            ]
          },
          {
            "start": 1050,
            "end": 3500,
            "bullets": [
              "Identifies key issues with the climate"
            ]
          }
        ]
      },
      "translation": {
        "status": "success"
      },
      "speaker_identification": {
        "mapping": {
          "A": "Michel Martin",
          "B": "Peter DeCarlo"
        },
        "status": "success"
      },
      "custom_formatting": {
        "status": "success",
        "mapping": {
          "2024-12-25": "12/25/2024",
          "555-1234-5678": "(555)123-45678"
        },
        "formatted_text": "Call me at (555)123-45678 on 12/25/2024",
        "formatted_utterances": [
          {
            "confidence": 0.92,
            "start": 0,
            "end": 2500,
            "text": "Hi, I'm the interviewer. Call me at (555)123-45678 on 12/25/2024",
            "speaker": "interviewer"
          },
          {
            "confidence": 0.95,
            "start": 2500,
            "end": 5000,
            "text": "Thanks! I'll reach out then.",
            "speaker": "candidate"
          }
        ]
      }
    }
  },
  "translated_texts": {
    "es": "Hola, soy el entrevistador. Llámame al cinco cinco cinco uno dos tres cuatro cinco seis siete ocho el veinticinco de diciembre de dos mil veinticuatro. ¡Gracias! Me pondré en contacto entonces.",
    "de": "Hallo, ich bin der Interviewer. Rufen Sie mich an unter fünf fünf fünf eins zwei drei vier fünf sechs sieben acht am fünfundzwanzigsten Dezember zweitausendvierundzwanzig. Danke! Ich werde mich dann melden."
  },
  "utterances": [
    {
      "confidence": 0.92,
      "start": 0,
      "end": 2500,
      "text": "Hi, I'm the interviewer. Call me at five five five one two three four five six seven eight on December twenty fifth twenty twenty four",
      "speaker": "interviewer",
      "translated_texts": {
        "es": "Hola, soy el entrevistador. Llámame al cinco cinco cinco uno dos tres cuatro cinco seis siete ocho el veinticinco de diciembre de dos mil veinticuatro",
        "de": "Hallo, ich bin der Interviewer. Rufen Sie mich an unter fünf fünf fünf eins zwei drei vier fünf sechs sieben acht am fünfundzwanzigsten Dezember zweitausendvierundzwanzig"
      }
    },
    {
      "confidence": 0.95,
      "start": 2500,
      "end": 5000,
      "text": "Thanks! I'll reach out then.",
      "speaker": "candidate",
      "translated_texts": {
        "es": "¡Gracias! Me pondré en contacto entonces.",
        "de": "Danke! Ich werde mich dann melden."
      }
    }
  ],
  "words": []
}

{
  "code": 123,
  "message": "<string>",
  "request_id": "3c90c3cc-0d44-4b50-8888-8dd25736052a",
  "metadata": {
    "errors": [
      "<string>"
    ]
  }
}

POST

understanding

curl --request POST \
  --url https://llm-gateway.assemblyai.com/v1/understanding \
  --header 'Authorization: <api-key>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "transcript_id": "12345",
  "speech_understanding": {
    "request": {
      "translation": {
        "target_languages": [
          "es",
          "de"
        ],
        "formal": true,
        "match_original_utterance": true
      }
    }
  }
}
'

{
  "speech_understanding": {
    "request": {
      "translation": {
        "target_languages": [
          "es",
          "de"
        ],
        "formal": true,
        "match_original_utterance": true
      },
      "speaker_identification": {
        "speaker_type": "name",
        "speakers": [
          {
            "name": "Michel Martin",
            "description": "Hosts the program and interviews the guests",
            "company": "NPR",
            "title": "Host Morning Edition"
          },
          {
            "name": "Peter DeCarlo",
            "description": "Answers questions from the interview",
            "company": "Johns Hopkins University",
            "title": "Professor and Vice Chair of Environmental Health and Engineering"
          }
        ]
      },
      "custom_formatting": {
        "date": "mm/dd/yyyy",
        "phone_number": "(xxx)xxx-xxxx",
        "email": "username@domain.com"
      },
      "summarization": {
        "summary_type": "bullets"
      }
    },
    "response": {
      "summarization": {
        "status": "success",
        "effort": "low",
        "summary": [
          {
            "start": 0,
            "end": 1000,
            "bullets": [
              "Host introduces Dr Peter DeCarlo",
              "Asks for a quick intro"
            ]
          },
          {
            "start": 1050,
            "end": 3500,
            "bullets": [
              "Identifies key issues with the climate"
            ]
          }
        ]
      },
      "translation": {
        "status": "success"
      },
      "speaker_identification": {
        "mapping": {
          "A": "Michel Martin",
          "B": "Peter DeCarlo"
        },
        "status": "success"
      },
      "custom_formatting": {
        "status": "success",
        "mapping": {
          "2024-12-25": "12/25/2024",
          "555-1234-5678": "(555)123-45678"
        },
        "formatted_text": "Call me at (555)123-45678 on 12/25/2024",
        "formatted_utterances": [
          {
            "confidence": 0.92,
            "start": 0,
            "end": 2500,
            "text": "Hi, I'm the interviewer. Call me at (555)123-45678 on 12/25/2024",
            "speaker": "interviewer"
          },
          {
            "confidence": 0.95,
            "start": 2500,
            "end": 5000,
            "text": "Thanks! I'll reach out then.",
            "speaker": "candidate"
          }
        ]
      }
    }
  },
  "translated_texts": {
    "es": "Hola, soy el entrevistador. Llámame al cinco cinco cinco uno dos tres cuatro cinco seis siete ocho el veinticinco de diciembre de dos mil veinticuatro. ¡Gracias! Me pondré en contacto entonces.",
    "de": "Hallo, ich bin der Interviewer. Rufen Sie mich an unter fünf fünf fünf eins zwei drei vier fünf sechs sieben acht am fünfundzwanzigsten Dezember zweitausendvierundzwanzig. Danke! Ich werde mich dann melden."
  },
  "utterances": [
    {
      "confidence": 0.92,
      "start": 0,
      "end": 2500,
      "text": "Hi, I'm the interviewer. Call me at five five five one two three four five six seven eight on December twenty fifth twenty twenty four",
      "speaker": "interviewer",
      "translated_texts": {
        "es": "Hola, soy el entrevistador. Llámame al cinco cinco cinco uno dos tres cuatro cinco seis siete ocho el veinticinco de diciembre de dos mil veinticuatro",
        "de": "Hallo, ich bin der Interviewer. Rufen Sie mich an unter fünf fünf fünf eins zwei drei vier fünf sechs sieben acht am fünfundzwanzigsten Dezember zweitausendvierundzwanzig"
      }
    },
    {
      "confidence": 0.95,
      "start": 2500,
      "end": 5000,
      "text": "Thanks! I'll reach out then.",
      "speaker": "candidate",
      "translated_texts": {
        "es": "¡Gracias! Me pondré en contacto entonces.",
        "de": "Danke! Ich werde mich dann melden."
      }
    }
  ],
  "words": []
}

{
  "code": 123,
  "message": "<string>",
  "request_id": "3c90c3cc-0d44-4b50-8888-8dd25736052a",
  "metadata": {
    "errors": [
      "<string>"
    ]
  }
}

Authorizations

Authorization

string

header

required

Body

application/json

Request body for speech understanding tasks.

transcript_id

string

required

The ID of the transcript to process.

speech_understanding

object

required

The speech understanding task to perform. Supports Translation, Speaker Identification, and Custom Formatting. Click into the request object below to see the available options.

Show child attributes

Response

Successful response containing the speech understanding results.

Option 1
Option 2
Option 3
Option 4

speech_understanding

object

Show child attributes

translated_texts

object

Translated text keyed by language code (e.g., {"es": "Texto traducido"})

Show child attributes

utterances

object[]

Array of utterances with translations (when match_original_utterance is true)

Show child attributes

words

object[]

Create a chat completion

Generate streaming token

⌘I