Voice AI infrastructure for builders

Trusted by millions of developers, AssemblyAI delivers industry-leading pre-recorded and real-time speech-to-text and voice agent APIs.

Universal-3.5 Pro Realtime

Your transcriptions will show here...

import assemblyai as aai
from assemblyai.streaming.v3 import (
    StreamingClient,
    StreamingClientOptions,
    StreamingEvents,
    StreamingParameters,
    TurnEvent,
)

def on_turn(self, event: TurnEvent):
    print(f"{event.transcript} ({event.end_of_turn})")

client = StreamingClient(
    StreamingClientOptions(
        api_key=API_KEY,
        api_host="streaming.assemblyai.com",
    )
)

client.on(StreamingEvents.Turn, on_turn)

client.connect(
    StreamingParameters(
        speech_model="u3-rt-pro",
        sample_rate=16000,
        continuous_partials=True,
    )
)

try:
    client.stream(aai.extras.MicrophoneStream(sample_rate=16000))
finally:
    client.disconnect(terminate=True)

36%

improvement in close rate

See case study

“The new Universal-3.5 Pro speech model from AssemblyAI is best so far in terms of accuracy, latency, and language switching.”

80%

increase in customer satisfaction

“Assembly has saved us countless hours managing models, and provided exceptional accuracy.”

36%

improvement in close rate

See case study

“The new Universal-3.5 Pro speech model from AssemblyAI is best so far in terms of accuracy, latency, and language switching.”

80%

increase in customer satisfaction

“Assembly has saved us countless hours managing models, and provided exceptional accuracy.”

Platform

Everything you need to build with Voice AI.

Transcribe speech with unmatched accuracy
Understand context, intent, and meaning
Power agentic workflows in real time
Scale securely, from MVP to production

Pre-recorded Speech-to-Text API

Get clean, customizable transcripts in 99 languages with industry-leading accuracy and natural language prompting.

Universal-3.5 Pro Universal-2

Realtime Speech-to-Text API

Stream transcripts in real time with async-level accuracy, so your agent responds fast without mishearing the user.

Universal-3.5 Pro Realtime Universal-Streaming Universal-Streaming Multilingual

Sync Speech-to-Text API

Send a short clip and get a finished, flagship-accuracy transcript back in the same response — no polling, no WebSocket.

Voice Agent API

Build production-ready voice agents with built-in turn detection and interruption handling, so you ship fast without the complexity.

Voice Agent API

Speech Understanding API

Go beyond basic transcription. Extract speaker ID, sentiment, chapters, and summaries from a single API call.

Speech Understanding API

Guardrails

Redact PII and moderate content inline on audio and transcripts, so sensitive data never hits your logs or your LLM.

Guardrails

LLM Gateway

Route between every LLM from one endpoint with built-in fallback, so you swap models and survive outages without touching your code.

GPT Claude Gemini Community Models

Explore the platform

Models, APIs, and infrastructure in one place.

Learn more

Infrastructure you can build a business on

Global redundancy, enterprise-grade uptime, and 2 million hours of audio processed every day. The Voice AI infrastructure your product can depend on at any scale.

Explore Enterprise

Pricing that doesn't turn against you at scale

No concurrency limits, no throttles, no forced commitments — the same platform scales from your first 100 hours to 400,000 a month.

See pricing

Less time configuring tools, more time shipping

AssemblyAI lets you choose which parts of the Voice AI stack you need, build quickly, and scale what works — with an Applied AI team that becomes an extension of yours.

Explore product overview

Infrastructure you can build a business on

Global redundancy, enterprise-grade uptime, and 2 million hours of audio processed every day. The Voice AI infrastructure your product can depend on at any scale.

Explore Enterprise

Pricing that doesn't turn against you at scale

No concurrency limits, no throttles, no forced commitments — the same platform scales from your first 100 hours to 400,000 a month.

See pricing

Less time configuring tools, more time shipping

AssemblyAI lets you choose which parts of the Voice AI stack you need, build quickly, and scale what works — with an Applied AI team that becomes an extension of yours.

Explore product overview

Playground

We're not playing around, but you can

Put our Voice AI models to the test in our no-code playground.

Try it out