Announcing the AssemblyAI integration for LiveKit
LiveKit allows you to build real-time audio and video applications - now you can build with AssemblyAI's Streaming Speech-to-Text in LiveKit.
Senior Developer Educator
Announcements
LiveKit allows you to build real-time audio and video applications - now you can build with AssemblyAI's Streaming Speech-to-Text in LiveKit.
Tutorial
LiveKit allows you to build real-time audio and video applications - learn how to add real-time Speech-to-Text to your LiveKit application in this tutorial.
Industry
Universal-2 is solving problems in Conversational Intelligence by optimizing Speech-to-Text for real-world use cases
Tutorials
Zoom allows you to record each participant's audio track separately. Learn how to combine this with AssemblyAI's multichannel transcription for accurate meeting transcripts.
Case Studies
Learn how we built our AI data Lakehouse to allow for rapid research iteration while maintaining cohesive, secure, and deduplicated datasets.
Tutorials
Learn how to set up a Google Cloud project to transcribe both local and remote audio files using Google's Speech-to-Text API and Python
Tutorials
Learn how to make a free, GPU-powered Whisper API for transcribing audio files
Tutorials
Learn how to use Python to perform speaker diarization on audio and video files to identify "who said what when"
Industry
Learn the differences between speaker diarization and speaker recognition, as well as speaker verification and speaker identification in audio analysis
Deep Learning
Microsoft's Florence-2 is a foundational image model that can perform almost every common task in computer vision. Learn how Florence-2 works and how to use it in this guide.
Announcements
Announcing several improvements to our Speaker Diarization service, yielding a more accurate model that's available in more languages.
Tutorials
Modern AI models make it easy to automatically detect the presence of sensitive topics in speech data. Learn how to perform configurable content moderation with Python in this tutorial.
Tutorials
Learn how to filter profanity out of audio and video files with fewer than 10 lines of code in this tutorial
Tutorials
In this tutorial, we’ll learn how to automatically redact Personal Identifiable Information (PII) from audio and video files in 5 minutes using Python and AssemblyAI.
Tutorials
Learn how to transcribe a phone call in real-time using Python, AssemblyAI, ngrok, and Twilio
Announcements
We’re excited to introduce major improvements to our API’s inference latency, with the majority of audio files now completing in well under 45 seconds regardless of audio duration.
Tutorials
Learn how to automatically extract insights from customer calls with Large Language Models (LLMs) and Python.
Tutorials
In this tutorial, we will learn how to automatically determine video sections, how to generate section titles with LLMs, and how to format the information for YouTube chapters.
Tutorials
Learn how to perform real-time transcription on audio streams using Python in this tutorial.
Deep Learning
How does OpenAI's groundbreaking DALL-E 2 model actually work? Check out this detailed guide to learn the ins and outs of DALL-E 2.
Tutorials
Retrieval Augmented Generation (RAG) allows you to add relevant documents as context when querying LLMs. Learn how to perform RAG on audio data using LangChain and Chroma in this tutorial.
Tutorials
In this tutorial, we'll learn how to get Zoom transcripts using the Zoom API using Python.
Tutorials
Learn how to perform Automatic Speech Recognition in 5 minutes using Python and the AssemblyAI Speech-to-Text API with this simple tutorial.
Tutorials
In this tutorial, we’ll learn how to build an application that automatically summarizes a lecture and lets you ask questions about the lecture material.
Deep Learning
RLHF is the key method used to train AI assistants like ChatGPT, but it has strong limitations and can produce harmful outputs. RLAIF improves upon RLHF by using AI feedback. Learn the differences between the two methods and what these differences mean in practice in this guide.