no-chatbot - News, Tutorials, AI Research

Announcing the AssemblyAI integration for LiveKit

Announcements

Dec 18, 2024

Announcing the AssemblyAI integration for LiveKit

LiveKit allows you to build real-time audio and video applications - now you can build with AssemblyAI's Streaming Speech-to-Text in LiveKit.

Ryan O'Connor

Senior Developer Educator

How to build a LiveKit app with real-time Speech-to-Text

Tutorial

Dec 18, 2024

How to build a LiveKit app with real-time Speech-to-Text

LiveKit allows you to build real-time audio and video applications - learn how to add real-time Speech-to-Text to your LiveKit application in this tutorial.

Ryan O'Connor

Senior Developer Educator

Universal-2 in Action: Transforming Conversational Data Across Industries

Industry

Dec 9, 2024

Universal-2 in Action: Transforming Conversational Data Across Industries

Universal-2 is solving problems in Conversational Intelligence by optimizing Speech-to-Text for real-world use cases

Ryan O'Connor

Senior Developer Educator

How to transcribe Zoom participant recordings (multichannel)

Tutorials

Nov 25, 2024

How to transcribe Zoom participant recordings (multichannel)

Zoom allows you to record each participant's audio track separately. Learn how to combine this with AssemblyAI's multichannel transcription for accurate meeting transcripts.

Ryan O'Connor

Senior Developer Educator

Transcribe audio and video files with Python and Universal-1

Tutorials

Nov 24, 2024

Transcribe audio and video files with Python and Universal-1

Learn how to transcribe audio and video files in your Python applications with AssemblyAI's Universal-1 speech recognition model.

Matt Makai

VP of Developer Relations & Experience

Case Studies

Nov 19, 2024

How we built our AI Lakehouse

Learn how we built our AI data Lakehouse to allow for rapid research iteration while maintaining cohesive, secure, and deduplicated datasets.

Ahmed Etefy, Ryan O'Connor

Tech Lead - Data Infrastructure, Senior Developer Educator

Tutorial

Nov 15, 2024

Talk to ChatGPT on a Phone Call

Learn how to build a Speech AI app that lets you talk to ChatGPT over the phone.

Artem Oppermann

Featured writer

How to use Google's Speech-to-Text API to transcribe audio in Python

Tutorials

Nov 12, 2024

How to use Google's Speech-to-Text API to transcribe audio in Python

Learn how to set up a Google Cloud project to transcribe both local and remote audio files using Google's Speech-to-Text API and Python

Ryan O'Connor

Senior Developer Educator

Auto-generate subtitles with Python and AssemblyAI

Tutorial

Nov 5, 2024

Auto-generate subtitles with Python and AssemblyAI

Stop manually creating subtitles for your videos, and learn how to auto-generate them with Python and AssemblyAI in this tutorial.

Marcus Olsson

Senior Developer Educator

How to build a free Whisper API with GPU backend

Tutorials

Oct 22, 2024

How to build a free Whisper API with GPU backend

Learn how to make a free, GPU-powered Whisper API for transcribing audio files

Ryan O'Connor

Senior Developer Educator

Tutorials

Sep 27, 2024

Speech-to-Text with Django

Learn how to integrate Speech-to-Text functionality into Django and build an example app.

Patrick Loeber

Senior Developer Advocate

How to identify languages in audio data using Python

Tutorial

Sep 12, 2024

How to identify languages in audio data using Python

Learn how to use Python to automatically identify languages in audio files.

Patrick Loeber

Senior Developer Advocate

How to perform Speaker Diarization in Python

Tutorials

Sep 10, 2024

How to perform Speaker Diarization in Python

Learn how to use Python to perform speaker diarization on audio and video files to identify "who said what when"

Ryan O'Connor

Senior Developer Educator

Speaker diarization vs speaker recognition - what's the difference?

Industry

Sep 9, 2024

Speaker diarization vs speaker recognition - what's the difference?

Learn the differences between speaker diarization and speaker recognition, as well as speaker verification and speaker identification in audio analysis

Ryan O'Connor

Senior Developer Educator

Build a Discord Voice Bot to Add ChatGPT to Your Voice Channel

Tutorials

Sep 5, 2024

Build a Discord Voice Bot to Add ChatGPT to Your Voice Channel

Build a sophisticated Discord voice bot that leverages AssemblyAI for speech transcription, OpenAI's GPT-3.5 Turbo AI model for intelligent processing, and ElevenLabs for speech synthesis.

Michael Nyamande

Featured writer

Analyze Audio from Zoom Calls with AssemblyAI and Node.js

Tutorials

Aug 28, 2024

Analyze Audio from Zoom Calls with AssemblyAI and Node.js

Learn how to analyze audio from Zoom calls using AssemblyAI and Node.js.

David Ekete

Featured writer

Decoding Strategies: How LLMs Choose The Next Word

Deep Learning

Aug 21, 2024

Decoding Strategies: How LLMs Choose The Next Word

Large Language Models are trained to guess the next word. But when generating text, the combination of their probability estimates with algorithms known as decoding strategies is what determines how they actually choose words. Learn how decoding strategies work in this article.

Marco Ramponi

Developer Educator

The Best Audio File Formats for Speech-to-Text: A Guide

Automatic Speech Recognition

Aug 9, 2024

The Best Audio File Formats for Speech-to-Text: A Guide

Learn about the best audio and video formats for speech-to-text applications, as well as best practices for audio post-processing techniques.

Patrick Loeber

Senior Developer Advocate

Get started using Claude 3.5 Sonnet with audio data

Tutorials

Jul 19, 2024

Get started using Claude 3.5 Sonnet with audio data

Learn how to use the Claude 3 models with audio and video data in Python.

Patrick Loeber

Senior Developer Advocate

Florence-2: How it works and how to use it

Deep Learning

Jul 15, 2024

Florence-2: How it works and how to use it

Microsoft's Florence-2 is a foundational image model that can perform almost every common task in computer vision. Learn how Florence-2 works and how to use it in this guide.

Ryan O'Connor

Senior Developer Educator

How to Create a Real-Time Language Translation Service with AssemblyAI and DeepL in JavaScript

Tutorials

Jul 12, 2024

How to Create a Real-Time Language Translation Service with AssemblyAI and DeepL in JavaScript

Learn how to translate speech in real-time in JavaScript with AssemblyAI and DeepL.

Aniket Bhattacharyea

Featured writer

Create Multi-Lingual Subtitles with AssemblyAI and DeepL

Tutorials

Jul 8, 2024

Create Multi-Lingual Subtitles with AssemblyAI and DeepL

Learn how to build a web app in Go that'll use AssemblyAI to transcribe an uploaded video file and generate subtitles.

Aniket Bhattacharyea

Featured writer

Build an AI-powered video conferencing app with Next.js and Stream

Tutorials

Jul 2, 2024

Build an AI-powered video conferencing app with Next.js and Stream

Learn how to build a Next.js video conferencing app that supports video calls with live transcriptions and an LLM-powered meeting assistant.

Patrick Loeber, Stefan Blos

Senior Developer Advocate, Developer Advocate at Stream

How to Do Hotword Detection with Streaming Speech-to-Text and Go

Tutorials

Jun 25, 2024

How to Do Hotword Detection with Streaming Speech-to-Text and Go

In this tutorial, you'll learn how to respond to hotwords in voice data using Streaming Speech-to-Text in Go.

Yasoob Khalid

Featured writer

Content moderation on audio files with Python

Tutorials

May 27, 2024

Content moderation on audio files with Python

Modern AI models make it easy to automatically detect the presence of sensitive topics in speech data. Learn how to perform configurable content moderation with Python in this tutorial.

Ryan O'Connor

Senior Developer Educator