🗺️ Automatic Language Detection Now Supports 99 Languages on Universal 🌟 (Learn more)

LogoLogo
Sign In
DocumentationAPI ReferenceCookbooksFAQPlaygroundChangelog
DocumentationAPI ReferenceCookbooksFAQPlaygroundChangelog
  • Getting started
    • Overview
    • Models
    • Transcribe a pre-recorded audio file
    • Transcribe streaming audio
    • Introducing Slam-1
  • Build with AssemblyAI
    • Integrations
    • Deployment
  • Speech-to-text
    • Pre-recorded audio
    • Streaming audio
  • Speech Understanding
Sign In
On this page
  • Quickstart
  • Products
Getting started

AssemblyAI Documentation

Build with our leading Speech AI models

Industry-leading models on a developer-first API

Your AI product strategy depends on the foundation that powers it. Make sure you build on the best.

Quickstart

Transcribe an audio file

Learn how to transcribe audio files with our SDK

Transcribe streaming audio

Learn how to transcribe live audio from a microphone

Apply LLMs to audio
Learn how to analyze audio content with LLMs
Cookbooks

Get started quickly with our use-case specific cookbooks

Products

Speech-to-Text

Models for converting audio files, video files, and live speech into text.

LeMUR

LeMUR is a framework for applying Large Language Models (LLMs) to spoken data.

Audio Intelligence

Models for interpreting audio for business and personal workflows.


Need help? Talk to our Support team.

·Join our Discord community·Check status page·See changelog

Models

Next
Built with