Patrick Loeber

Senior Developer Advocate

Using Multichannel and Speaker Diarization
Using Multichannel and Speaker Diarization

Learn how Multichannel transcription and Speaker Diarization work, what their outputs look like, when to use each feature, and how you can implement them.

Universal-2 vs OpenAI's Whisper: Comparing Speech-to-Text models in real-world use cases
Universal-2 vs OpenAI's Whisper: Comparing Speech-to-Text models in real-world use cases

Comparing Universal-2, Universal-1, and Whispers models at proper noun and alphanumeric detection tasks, text formatting, and hallucinations.

Speech recognition in the browser using Web Speech API
Speech recognition in the browser using Web Speech API

Learn how to set up speech recognition in your browser using the Web Speech API and JavaScript.

Introducing the AssemblyAI integration for Langflow
Introducing the AssemblyAI integration for Langflow

AssemblyAI is now integrated with Langflow, a powerful low-code platform for building generative AI applications.

Speech-to-Text with Django
Speech-to-Text with Django

Learn how to integrate Speech-to-Text functionality into Django and build an example app.

How to identify languages in audio data using Python
How to identify languages in audio data using Python

Learn how to use Python to automatically identify languages in audio files.

The Best Audio File Formats for Speech-to-Text: A Guide
The Best Audio File Formats for Speech-to-Text: A Guide

Learn about the best audio and video formats for speech-to-text applications, as well as best practices for audio post-processing techniques.

Get started using Claude 3.5 Sonnet with audio data
Get started using Claude 3.5 Sonnet with audio data

Learn how to use the Claude 3 models with audio and video data in Python.

Build an AI-powered video conferencing app with Next.js and Stream
Build an AI-powered video conferencing app with Next.js and Stream

Learn how to build a Next.js video conferencing app that supports video calls with live transcriptions and an LLM-powered meeting assistant.

A New Free Python Course to Build Real-World Audio AI Apps
A New Free Python Course to Build Real-World Audio AI Apps

Learn how to build a real-world application working with audio data in this free video course.

How to do Speech-To-Text with Go
How to do Speech-To-Text with Go

This article shows how Speech Recognition can be integrated into your Go application in only a few lines of code.

How to Create VTT Files for Videos in Python
How to Create VTT Files for Videos in Python

Learn how to create VTT subtitle files for videos using Python in this easy-to-follow guide.

How to Create SRT Files for Videos in Python
How to Create SRT Files for Videos in Python

Learn how to create SRT subtitle files for videos using Python in this easy-to-follow guide.

How to Get YouTube Video Transcripts
How to Get YouTube Video Transcripts

Learn how to get the transcript of a YouTube video with this easy-to-follow Python tutorial. We'll use yt-dlp to download YouTube videos, and automatically transcribe them with AssemblyAI.

Key phrase detection in audio files using Python
Key phrase detection in audio files using Python

Learn how to identify key phrases and important words using Python and AssemblyAI.

How to use audio data in LlamaIndex with Python
How to use audio data in LlamaIndex with Python

Learn how to incorporate audio files into LlamaIndex and build an LLM-powered query engine in this step-by-step tutorial.

How to use audio data in LangChain with Python
How to use audio data in LangChain with Python

Learn how to incorporate audio files into LangChain and build an LLM app on top of spoken data in this step-by-step tutorial.

Introducing the AssemblyAI Creators Program
Introducing the AssemblyAI Creators Program

We are excited to announce our AssemblyAI Creators program, a community of creators in the AI space who grow together and give back to the developer community.

How To Convert Voice To Text Using JavaScript
How To Convert Voice To Text Using JavaScript

This article shows how Real-Time Speech Recognition from a microphone recording can be integrated into your JavaScript application in only a few lines of code.

Learn How To Get Started with OpenAI API and GPT-3
Learn How To Get Started with OpenAI API and GPT-3

Learn how to get started with the OpenAI API and GPT-3 in Python.

How to Build a Python Project that Summarizes Your Lectures
How to Build a Python Project that Summarizes Your Lectures

Learn how to build a Python app that lets you study faster by automatically summarizing lectures!

Reinforcement Learning With (Deep) Q-Learning Explained
Reinforcement Learning With (Deep) Q-Learning Explained

In this video, we learn about Reinforcement Learning and (Deep) Q-Learning.

Unsupervised Machine Learning For Beginners
Unsupervised Machine Learning For Beginners

In this video tutorial, we learn about Unsupervised Machine Learning.

Supervised Machine Learning For Beginners
Supervised Machine Learning For Beginners

In this video, we learn about Supervised Machine Learning, which is arguably the most important type of Machine Learning.

Recurrent Neural Networks (RNNs) Explained
Recurrent Neural Networks (RNNs) Explained

In this video, we learn what Recurrent Neural Networks (RNNs) are and how they work.