AssemblyAI | AI Tutorials

Retrieval Augmented Generation on audio data with LangChain and Chroma

Tutorials

Sep 26, 2023

Retrieval Augmented Generation on audio data with LangChain and Chroma

Retrieval Augmented Generation (RAG) allows you to add relevant documents as context when querying LLMs. Learn how to perform RAG on audio data using LangChain and Chroma in this tutorial.

Ryan O'Connor

Senior Developer Educator

Lemur

Sep 20, 2023

Build a podcast question & answer application using Rivet and AssemblyAI

Build an AI application that can answer questions about podcast episodes using Rivet's AI IDE and AssemblyAI's Rivet plugin for audio transcription and LeMUR.

Niels Swimberghe

Developer Educator

How to get Zoom Transcripts with the Zoom API

Tutorials

Sep 14, 2023

How to get Zoom Transcripts with the Zoom API

In this tutorial, we'll learn how to get Zoom transcripts using the Zoom API using Python.

Ryan O'Connor

Senior Developer Educator

Convert Speech to Text in Python in 5 Minutes

Tutorials

Sep 6, 2023

Convert Speech to Text in Python in 5 Minutes

Learn how to perform Automatic Speech Recognition in 5 minutes using Python and the AssemblyAI Speech-to-Text API with this simple tutorial.

Ryan O'Connor

Senior Developer Educator

How to use audio data in LangChain with Python

Tutorials

Aug 31, 2023

How to use audio data in LangChain with Python

Learn how to incorporate audio files into LangChain and build an LLM app on top of spoken data in this step-by-step tutorial.

Patrick Loeber

Senior Developer Advocate

How to build an interactive lecture summarization app

Tutorials

Aug 31, 2023

How to build an interactive lecture summarization app

In this tutorial, we’ll learn how to build an application that automatically summarizes a lecture and lets you ask questions about the lecture material.

Ryan O'Connor

Senior Developer Educator

How to integrate spoken audio into LangChain.js using AssemblyAI

Tutorials

Aug 15, 2023

How to integrate spoken audio into LangChain.js using AssemblyAI

Learn how to apply LLMs to spoken audio with AssemblyAI's new integration for LangChain.js, using TypeScript and Node.js.

Niels Swimberghe

Developer Educator

Automatic summarization with LLMs in Python

Tutorials

Aug 15, 2023

Automatic summarization with LLMs in Python

Learn how to perform automatic summarization with Python using LLMs in this easy-to-follow tutorial.

Ryan O'Connor

Senior Developer Educator

Python Speech-to-Text with Punctuation, Casing, and Formatting

Tutorials

May 25, 2023

Python Speech-to-Text with Punctuation, Casing, and Formatting

Learn how to transcribe audio and video files into text that contains punctuation, casing and formatting using the AssemblyAI Python SDK.

Matt Makai

VP of Developer Relations & Experience

Build a free Stable Diffusion app with a GPU backend

Tutorials

Jan 19, 2023

Build a free Stable Diffusion app with a GPU backend

In this tutorial, we'll build a completely free-to-use web app that allows you to generate images with Stable Diffusion (on GPU) in seconds.

Ryan O'Connor

Senior Developer Educator

Stable Diffusion in Keras - A Simple Tutorial

Tutorials

Nov 30, 2022

Stable Diffusion in Keras - A Simple Tutorial

Learn how to generate and inpaint images with Stable Diffusion in Keras, and how XLA can boost Stable Diffusion's inference speed, in this easy-to-follow guide.

Ryan O'Connor

Senior Developer Educator

An Introduction to Poisson Flow Generative Models

Deep Learning

Oct 26, 2022

An Introduction to Poisson Flow Generative Models

Poisson Flow Generative Models (PFGMs) are a new type of generative Deep Learning model, taking inspiration from physics much like Diffusion Models. Learn the theory behind PFGMs and how to generate images with them in this easy-to-follow guide.

Ryan O'Connor

Senior Developer Educator

How to Run OpenAI’s Whisper Speech Recognition Model

Tutorials

Sep 22, 2022

How to Run OpenAI’s Whisper Speech Recognition Model

OpenAI's Whisper model can perform Speech Recognition on a wide selection of languages. We'll learn how to run Whisper before checking out a performance analysis in this simple guide.

Ryan O'Connor

Senior Developer Educator

Getting Started with Hugging Face's Gradio

Tutorials

Sep 21, 2022

Getting Started with Hugging Face's Gradio

Gradio allows you to easily create shareable apps using only Python. Learn how to build a dashboard for Audio Intelligence Analysis in this easy-to-follow tutorial.

Ryan O'Connor

Senior Developer Educator

How to Automatically Transcribe Zoom Calls in Real-Time

Tutorials

Aug 31, 2022

How to Automatically Transcribe Zoom Calls in Real-Time

Learn how to automatically transcribe a Zoom meeting in real-time with this simple tutorial.

Ryan O'Connor

Senior Developer Educator

How to Run Stable Diffusion Locally to Generate Images

Tutorials

Aug 23, 2022

How to Run Stable Diffusion Locally to Generate Images

Stable Diffusion is a text-to-image model with recently-released open-sourced weights. Learn how to generate an image of a scene given only a description of it in this simple tutorial.

Ryan O'Connor

Senior Developer Educator

MinImagen - Build Your Own Imagen Text-to-Image Model

Deep Learning

Aug 17, 2022

MinImagen - Build Your Own Imagen Text-to-Image Model

Text-to-Image models have made great strides this year, from DALL-E 2 to the more recent Imagen model. In this tutorial learn how to build a minimal Imagen implementation - MinImagen.

Ryan O'Connor

Senior Developer Educator

Tutorials

Jun 6, 2022

Getting Started with ESPnet

ESPnet is the premier end-to-end, open-source speech processing toolkit. This easy-to-follow guide will help you get started using ESPnet for Speech Recognition.

Ryan O'Connor

Senior Developer Educator

How to Build a JavaScript Audio Transcript Application

Tutorials

Apr 12, 2022

How to Build a JavaScript Audio Transcript Application

Learn how to build a JavaScript Audio Transcript application using Node.js and the AssemblyAI JavaScript SDK with this step-by-step beginner's guide.

Stefan Rosanitsch

Contributor

Tutorials

Apr 7, 2022

MediaPipe for Dummies

With just a few lines of code, MediaPipe allows you to incorporate State-of-the-Art Machine Learning capabilities into your applications. Learn about MediaPipe and how to use its simple APIs in this beginner's guide.

Ryan O'Connor

Senior Developer Educator

JavaScript Text-to-Speech - The Easy Way

Tutorials

Apr 4, 2022

JavaScript Text-to-Speech - The Easy Way

Learn how to build a simple JavaScript Text-to-Speech application using JavaScript's Web Speech API in this step-by-step beginner's guide.

Stefan Rosanitsch

Contributor

Tutorials

Mar 30, 2022

React Text to Speech - Simplified!

Learn how to create a simple React Text-to-Speech application with this step-by-step beginner's guide.

Stefan Rosanitsch

Contributor

A Beginner's Guide to TorchStudio, The PyTorch IDE

Tutorials

Mar 28, 2022

A Beginner's Guide to TorchStudio, The PyTorch IDE

Learn how to build, train, and compare models with TorchStudio - the IDE built specifically for PyTorch.

Ryan O'Connor

Senior Developer Educator

Tutorials

Mar 17, 2022

Automate Meeting Notes with Python

Let's build a web app that receives the audio recording of a meeting and automatically generates meeting notes using AssemblyAI's Speech-to-Text API.

Mısra Turp

Developer Educator

React Speech Recognition with React Hooks

Tutorials

Mar 16, 2022

React Speech Recognition with React Hooks

Learn how to build a React Speech Recognition app that transcribes your voice using the AssemblyAI API.

Stefan Rosanitsch

Contributor