Tutorials

Browse guided tutorials on NLP, Machine Learning, AI, Deep Learning, coding, and related topics.

Using Multichannel and Speaker Diarization
Using Multichannel and Speaker Diarization

Learn how Multichannel transcription and Speaker Diarization work, what their outputs look like, when to use each feature, and how you can implement them.

How to transcribe Zoom participant recordings (multichannel)
How to transcribe Zoom participant recordings (multichannel)

Zoom allows you to record each participant's audio track separately. Learn how to combine this with AssemblyAI's multichannel transcription for accurate meeting transcripts.

JavaScript and Node.js Speech-to-Text
JavaScript and Node.js Speech-to-Text

Learn how to convert audio and video files from speech to text using AssemblyAI's API and JavaScript.

Transcribe audio and video files with Python and Universal-1
Transcribe audio and video files with Python and Universal-1

Learn how to transcribe audio and video files in your Python applications with AssemblyAI's Universal-1 speech recognition model.

How to use Google's Speech-to-Text API to transcribe audio in Python
How to use Google's Speech-to-Text API to transcribe audio in Python

Learn how to set up a Google Cloud project to transcribe both local and remote audio files using Google's Speech-to-Text API and Python

How to build a free Whisper API with GPU backend
How to build a free Whisper API with GPU backend

Learn how to make a free, GPU-powered Whisper API for transcribing audio files

How to convert speech to text in Java
How to convert speech to text in Java

Learn how to convert speech to text in Java using AssemblyAI.

Speech recognition in the browser using Web Speech API
Speech recognition in the browser using Web Speech API

Learn how to set up speech recognition in your browser using the Web Speech API and JavaScript.

Speech-to-Text with Django
Speech-to-Text with Django

Learn how to integrate Speech-to-Text functionality into Django and build an example app.

Speech-to-Text in Ruby using Universal-1
Speech recognition with Ruby using Universal-1

Learn how to transcribe audio and video files in your Ruby applications with AssemblyAI's Universal-1 speech recognition model.

Build a Real-Time AI Voice Bot Using Python, AssemblyAI, and ElevenLabs
Build a Real-Time AI Voice Bot Using Python, AssemblyAI, and ElevenLabs

Learn how to build a real-time AI voice bot using Python, AssemblyAI, OpenAI, and ElevenLabs for transcription, intelligent responses, and voice synthesis.

How to perform Speaker Diarization in Python
How to perform Speaker Diarization in Python

Learn how to use Python to perform speaker diarization on audio and video files to identify "who said what when"

Build a Discord Voice Bot to Add ChatGPT to Your Voice Channel
Build a Discord Voice Bot to Add ChatGPT to Your Voice Channel

Build a sophisticated Discord voice bot that leverages AssemblyAI for speech transcription, OpenAI's GPT-3.5 Turbo AI model for intelligent processing, and ElevenLabs for speech synthesis.

Analyze Audio from Zoom Calls with AssemblyAI and Node.js
Analyze Audio from Zoom Calls with AssemblyAI and Node.js

Learn how to analyze audio from Zoom calls using AssemblyAI and Node.js.

Generate subtitles with AssemblyAI and Zapier
Generate subtitles with AssemblyAI and Zapier

Generate subtitles for your videos using the AssemblyAI app for Zapier.

Get started using Claude 3.5 Sonnet with audio data
Get started using Claude 3.5 Sonnet with audio data

Learn how to use the Claude 3 models with audio and video data in Python.

How to Create a Real-Time Language Translation Service with AssemblyAI and DeepL in JavaScript
How to Create a Real-Time Language Translation Service with AssemblyAI and DeepL in JavaScript

Learn how to translate speech in real-time in JavaScript with AssemblyAI and DeepL.

Create Multi-Lingual Subtitles with AssemblyAI and DeepL
Create Multi-Lingual Subtitles with AssemblyAI and DeepL

Learn how to build a web app in Go that'll use AssemblyAI to transcribe an uploaded video file and generate subtitles.

Build an AI-powered video conferencing app with Next.js and Stream
Build an AI-powered video conferencing app with Next.js and Stream

Learn how to build a Next.js video conferencing app that supports video calls with live transcriptions and an LLM-powered meeting assistant.

How to Do Hotword Detection with Streaming Speech-to-Text and Go
How to Do Hotword Detection with Streaming Speech-to-Text and Go

In this tutorial, you'll learn how to respond to hotwords in voice data using Streaming Speech-to-Text in Go.

JavaScript code to retrieve information from a YouTube video
Transcribe and generate subtitles for YouTube videos with Node.js

Learn how to transcribe YouTube videos and generate SRT subtitles with Node.js and AssemblyAI in this easy-to-follow guide.

JavaScript code to transcribe a video and generate WebVTT subtitles files.
How to Create WebVTT Files for Videos in Node.js

Learn how to create WebVTT subtitle files for videos using Node.js in this easy-to-follow guide.

JavaScript code to transcribe a video and generate SRT subtitles files.
How to Create SRT Files for Videos in Node.js

Learn how to create SRT subtitle files for videos using Node.js in this easy-to-follow guide.

JavaScript code to transcribe audio with PII redaction enabled using the AssemblyAI SDK
Redact Personally Identifiable Information (PII) from audio with Node.js

In this tutorial, you will learn how to automatically redact Personal Identifiable Information from audio and video files with Node.js and the AssemblyAI API.

Code to summarize audio using LeMUR with the AssemblyAI JavaScript SDK.
Summarize audio with LLMs in Node.js

Learn how to automatically summarize audio with Node.js and the AssemblyAI API.