Convert Speech to Text in Python in 5 Minutes
Learn how to perform Automatic Speech Recognition in 5 minutes using Python and the AssemblyAI Speech-to-Text API with this simple tutorial.
Browse guided tutorials on NLP, Machine Learning, AI, Deep Learning, coding, and related topics.
Tutorials
Learn how to perform Automatic Speech Recognition in 5 minutes using Python and the AssemblyAI Speech-to-Text API with this simple tutorial.
Tutorials
Learn how to incorporate audio files into LangChain and build an LLM app on top of spoken data in this step-by-step tutorial.
Tutorials
In this tutorial, we’ll learn how to build an application that automatically summarizes a lecture and lets you ask questions about the lecture material.
Tutorials
Learn how to apply LLMs to spoken audio with AssemblyAI's new integration for LangChain.js, using TypeScript and Node.js.
Tutorials
Learn how to perform automatic summarization with Python using LLMs in this easy-to-follow tutorial.
Tutorials
Learn how to transcribe audio and video files into text that contains punctuation, casing and formatting using the AssemblyAI Python SDK.
Tutorials
In this tutorial, we'll build a completely free-to-use web app that allows you to generate images with Stable Diffusion (on GPU) in seconds.
Tutorials
Learn how to generate and inpaint images with Stable Diffusion in Keras, and how XLA can boost Stable Diffusion's inference speed, in this easy-to-follow guide.
Deep Learning
Poisson Flow Generative Models (PFGMs) are a new type of generative Deep Learning model, taking inspiration from physics much like Diffusion Models. Learn the theory behind PFGMs and how to generate images with them in this easy-to-follow guide.
Tutorials
OpenAI's Whisper model can perform Speech Recognition on a wide selection of languages. We'll learn how to run Whisper before checking out a performance analysis in this simple guide.
Tutorials
Gradio allows you to easily create shareable apps using only Python. Learn how to build a dashboard for Audio Intelligence Analysis in this easy-to-follow tutorial.
Tutorials
Learn how to automatically transcribe a Zoom meeting in real-time with this simple tutorial.
Tutorials
Stable Diffusion is a text-to-image model with recently-released open-sourced weights. Learn how to generate an image of a scene given only a description of it in this simple tutorial.
Deep Learning
Text-to-Image models have made great strides this year, from DALL-E 2 to the more recent Imagen model. In this tutorial learn how to build a minimal Imagen implementation - MinImagen.
Tutorials
ESPnet is the premier end-to-end, open-source speech processing toolkit. This easy-to-follow guide will help you get started using ESPnet for Speech Recognition.
Tutorials
Learn how to build a JavaScript Audio Transcript application using Node.js and the AssemblyAI JavaScript SDK with this step-by-step beginner's guide.
Tutorials
With just a few lines of code, MediaPipe allows you to incorporate State-of-the-Art Machine Learning capabilities into your applications. Learn about MediaPipe and how to use its simple APIs in this beginner's guide.
Tutorials
Learn how to build a simple JavaScript Text-to-Speech application using JavaScript's Web Speech API in this step-by-step beginner's guide.
Tutorials
Learn how to create a simple React Text-to-Speech application with this step-by-step beginner's guide.
Tutorials
Learn how to build, train, and compare models with TorchStudio - the IDE built specifically for PyTorch.
Tutorials
Let's build a web app that receives the audio recording of a meeting and automatically generates meeting notes using AssemblyAI's Speech-to-Text API.
Tutorials
Learn how to build a React Speech Recognition app that transcribes your voice using the AssemblyAI API.
Tutorials
Learn how to transcribe audio files stored in an AWS S3 bucket with AssemblyAI in 3 simple steps.
Tutorials
A step-by-step Kaldi install tutorial so you can get up and running on your NLP projects as soon as possible.
Tutorials
We say the funniest things when no one is listening. But what if someone did, all the time? In this article, we will learn how to make an app that will listen to you all the time and Tweet the funniest, smartest or most relatable things you say out loud.