For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
PlaygroundChangelogSign In
OverviewAPI ReferencePre-recorded STTStreaming STTVoice AgentsSpeech UnderstandingGuardrailsLLM GatewayFAQ
OverviewAPI ReferencePre-recorded STTStreaming STTVoice AgentsSpeech UnderstandingGuardrailsLLM GatewayFAQ
  • Overview
      • Can I use speaker diarization with Streaming Speech-to-Text?
      • How accurate is your Streaming transcription compared to Async transcription?
      • How does automatically scaling concurrency for Streaming STT work?
      • How does Universal Streaming session-based pricing work?
      • What languages are supported for Streaming Speech-to-text?
      • Resolving SSL Certificate Verification Error When Trying to Use Streaming STT
      • I am getting a "Model deprecated. See docs for new model information" error message. What does it mean?
LogoLogo
PlaygroundChangelogSign In
OverviewStreaming audio

Can I use speaker diarization with Streaming Speech-to-Text?

Yes. Streaming Diarization is supported on all streaming models, including u3-rt-pro, universal-streaming-english, universal-streaming-multilingual, and whisper-rt. Enable it by adding speaker_labels: true to your connection parameters. See our Streaming Diarization documentation to learn more.

Alternatively, you can use multichannel audio (i.e., an audio stream for each speaker) to separate speakers by creating a separate session for each channel. See the Multichannel streaming audio section for details on this approach.

Was this page helpful?
Previous

How accurate is your Streaming transcription compared to Async transcription?

Next
Built with