Extract phone call insights with LLMs in Python
Learn how to automatically extract insights from customer calls with Large Language Models (LLMs) and Python.
Tutorials
Learn how to automatically extract insights from customer calls with Large Language Models (LLMs) and Python.
Tutorials
Learn how to get the transcript of a YouTube video with this easy-to-follow Python tutorial. We'll use yt-dlp to download YouTube videos, and automatically transcribe them with AssemblyAI.
Tutorials
In this tutorial, we will learn how to automatically determine video sections, how to generate section titles with LLMs, and how to format the information for YouTube chapters.
Tutorials
Learn how to identify key phrases and important words using Python and AssemblyAI.
Deep Learning
A new approach towards multi-speaker speech processing integrates Speaker Diarization and Automatic Speech Recognition in a unified framework. We discuss the key insights from this recent exciting development in Speech AI research.
no-chatbot
Learn how to incorporate audio files into LlamaIndex and build an LLM-powered query engine in this step-by-step tutorial.
Tutorials
Learn how to perform real-time transcription on audio streams using Python in this tutorial.
Deep Learning
How does OpenAI's groundbreaking DALL-E 2 model actually work? Check out this detailed guide to learn the ins and outs of DALL-E 2.
Tutorials
Retrieval Augmented Generation (RAG) allows you to add relevant documents as context when querying LLMs. Learn how to perform RAG on audio data using LangChain and Chroma in this tutorial.
Deep Learning
Text-to-Music Models are advancing rapidly with the recent release of new platforms for AI-generated music. This guide focuses on MusicLM, MusicGen, and Stable Audio, exploring the technical breakthroughs and challenges in creating music with AI.
Tutorials
In this tutorial, we'll learn how to get Zoom transcripts using the Zoom API using Python.
Tutorials
Learn how to perform Automatic Speech Recognition in 5 minutes using Python and the AssemblyAI Speech-to-Text API with this simple tutorial.
Deep Learning
Neural Audio Compression methods based on Residual Vector Quantization are reshaping the landscape of modern audio codecs. In this guide, learn the basic ideas behind RVQ and how it enhances Neural Compression.
Tutorials
Learn how to incorporate audio files into LangChain and build an LLM app on top of spoken data in this step-by-step tutorial.
Tutorials
In this tutorial, we’ll learn how to build an application that automatically summarizes a lecture and lets you ask questions about the lecture material.
Deep Learning
RLHF is the key method used to train AI assistants like ChatGPT, but it has strong limitations and can produce harmful outputs. RLAIF improves upon RLHF by using AI feedback. Learn the differences between the two methods and what these differences mean in practice in this guide.
Deep Learning
What’s the difference between Language Models and Large Language Models? Let’s understand AI development trends and the difficulties of integrating LLMs into real-world applications.
Tutorials
Learn how to perform automatic summarization with Python using LLMs in this easy-to-follow tutorial.
Deep Learning
Large Language Models like ChatGPT are trained with Reinforcement Learning From Human Feedback (RLHF) to learn human preferences. Let’s uncover how RLHF works and survey its current strongest limitations.
Deep Learning
Reinforcement Learning from AI Feedback (RLAIF) is a supervision technique that uses a "constitution" to make AI assistants like ChatGPT safer. Learn everything you need to know about RLAIF in this guide.
Deep Learning
The spotlight has been on language and images for Generative AI, but there's been a lot of recent progress in the audio domain. Learn everything you need to know about generative audio models in this article.
Deep Learning
Speech Recognition models are key in extracting useful information from audio data. Learn how to properly evaluate speech recognition models in this easy-to-follow guide.
Tutorials
Learn how to transcribe audio and video files into text that contains punctuation, casing and formatting using the AssemblyAI Python SDK.
Deep Learning
A Product Manager's guide to understanding Large Language Models and the building blocks of Conversational AI.
Deep Learning
Modern Generative AI models for images are powering a range of creative applications and changing the way we work. This guide will overview everything you need to know about these models and how they work.