Universal-Streaming
Handshake
Headers
Use your API key for authentication, or alternatively generate a temporary token and pass it via the token query parameter.
Query parameters
A list of words and phrases to improve recognition accuracy for. See Keyterms Prompting for more details.
The maximum amount of silence in milliseconds allowed in a turn before end of turn is triggered. See Turn Detection for configuration details.
The minimum amount of silence in milliseconds required to detect end of turn when confident. See Turn Detection for configuration details.
API token for authentication (if using a temporary token).
The confidence threshold (0.0 to 1.0) for classifying audio frames as silence. Frames with VAD confidence below this value are considered silent. Increase for noisy environments to reduce false speech detection.
The confidence threshold (0.0 to 1.0) to use when determining if the end of a turn has been reached. See Turn Detection for configuration details.
Send
Receive
Receive a formatted turn-based transcription result.