no-chatbot

Extract phone call insights with LLMs in Python
Extract phone call insights with LLMs in Python

Learn how to automatically extract insights from customer calls with Large Language Models (LLMs) and Python.

How to Get YouTube Video Transcripts
How to Get YouTube Video Transcripts

Learn how to get the transcript of a YouTube video with this easy-to-follow Python tutorial. We'll use yt-dlp to download YouTube videos, and automatically transcribe them with AssemblyAI.

Automatically determine video sections with AI using Python
Automatically determine video sections with AI using Python

In this tutorial, we will learn how to automatically determine video sections, how to generate section titles with LLMs, and how to format the information for YouTube chapters.

Key phrase detection in audio files using Python
Key phrase detection in audio files using Python

Learn how to identify key phrases and important words using Python and AssemblyAI.

Combining Speech Recognition and Diarization in one model
Combining Speech Recognition and Diarization in one model

A new approach towards multi-speaker speech processing integrates Speaker Diarization and Automatic Speech Recognition in a unified framework. We discuss the key insights from this recent exciting development in Speech AI research.

How to use audio data in LlamaIndex with Python
How to use audio data in LlamaIndex with Python

Learn how to incorporate audio files into LlamaIndex and build an LLM-powered query engine in this step-by-step tutorial.

Real-time transcription in Python
Real-time transcription in Python

Learn how to perform real-time transcription on audio streams using Python in this tutorial.

How DALL-E 2 Actually Works
How DALL-E 2 Actually Works

How does OpenAI's groundbreaking DALL-E 2 model actually work? Check out this detailed guide to learn the ins and outs of DALL-E 2.

Retrieval Augmented Generation on audio data with LangChain and Chroma
Retrieval Augmented Generation on audio data with LangChain and Chroma

Retrieval Augmented Generation (RAG) allows you to add relevant documents as context when querying LLMs. Learn how to perform RAG on audio data using LangChain and Chroma in this tutorial.

What AI Music Generators Can Do (And How They Do It)
What AI Music Generators Can Do (And How They Do It)

Text-to-Music Models are advancing rapidly with the recent release of new platforms for AI-generated music. This guide focuses on MusicLM, MusicGen, and Stable Audio, exploring the technical breakthroughs and challenges in creating music with AI.

How to get Zoom Transcripts with the Zoom API
How to get Zoom Transcripts with the Zoom API

In this tutorial, we'll learn how to get Zoom transcripts using the Zoom API using Python.

Convert Speech to Text in Python in 5 Minutes
Convert Speech to Text in Python in 5 Minutes

Learn how to perform Automatic Speech Recognition in 5 minutes using Python and the AssemblyAI Speech-to-Text API with this simple tutorial.

Residual Vector Quantization RVQ for Neural Compression
What is Residual Vector Quantization?

Neural Audio Compression methods based on Residual Vector Quantization are reshaping the landscape of modern audio codecs. In this guide, learn the basic ideas behind RVQ and how it enhances Neural Compression.

How to use audio data in LangChain with Python
How to use audio data in LangChain with Python

Learn how to incorporate audio files into LangChain and build an LLM app on top of spoken data in this step-by-step tutorial.

How to build an interactive lecture summarization app
How to build an interactive lecture summarization app

In this tutorial, we’ll learn how to build an application that automatically summarizes a lecture and lets you ask questions about the lecture material.

RLHF vs RLAIF for language model alignment
RLHF vs RLAIF for language model alignment

RLHF is the key method used to train AI assistants like ChatGPT, but it has strong limitations and can produce harmful outputs. RLAIF improves upon RLHF by using AI feedback. Learn the differences between the two methods and what these differences mean in practice in this guide.

Why Language Models Became Large Language Models And The Hurdles In Developing LLM-based Applications
Why Language Models Became Large Language Models And The Hurdles In Developing LLM-based Applications

What’s the difference between Language Models and Large Language Models? Let’s understand AI development trends and the difficulties of integrating LLMs into real-world applications.

Automatic summarization with LLMs in Python
Automatic summarization with LLMs in Python

Learn how to perform automatic summarization with Python using LLMs in this easy-to-follow tutorial.

How RLHF Models Works - Reinforcement Learning From Human Feedback
How RLHF Preference Model Tuning Works (And How Things May Go Wrong)

Large Language Models like ChatGPT are trained with Reinforcement Learning From Human Feedback (RLHF) to learn human preferences. Let’s uncover how RLHF works and survey its current strongest limitations.

How Reinforcement Learning from AI Feedback works
How Reinforcement Learning from AI Feedback works

Reinforcement Learning from AI Feedback (RLAIF) is a supervision technique that uses a "constitution" to make AI assistants like ChatGPT safer. Learn everything you need to know about RLAIF in this guide.

Recent developments in Generative AI for Audio
Recent developments in Generative AI for Audio

The spotlight has been on language and images for Generative AI, but there's been a lot of recent progress in the audio domain. Learn everything you need to know about generative audio models in this article.

How to evaluate Speech Recognition models
How to evaluate Speech Recognition models

Speech Recognition models are key in extracting useful information from audio data. Learn how to properly evaluate speech recognition models in this easy-to-follow guide.

Python Speech-to-Text with Punctuation, Casing, and Formatting
Python Speech-to-Text with Punctuation, Casing, and Formatting

Learn how to transcribe audio and video files into text that contains punctuation, casing and formatting using the AssemblyAI Python SDK.

Large Language Models for Product Managers: 5 Things to Know
Large Language Models for Product Managers: 5 Things to Know

A Product Manager's guide to understanding Large Language Models and the building blocks of Conversational AI.

Modern Generative AI for images
Modern Generative AI for images

Modern Generative AI models for images are powering a range of creative applications and changing the way we work. This guide will overview everything you need to know about these models and how they work.