Ryan O'Connor

Senior Developer Educator

Modern Generative AI for images
Modern Generative AI for images

Modern Generative AI models for images are powering a range of creative applications and changing the way we work. This guide will overview everything you need to know about these models and how they work.

Introduction to Generative AI
Introduction to Generative AI

Generative AI has made tremendous strides recently, from models like Stable Diffusion to ChatGPT. Get up to speed on the latest advancements with this easy-to-follow introduction to Generative AI.

Everything you need to know about Generative AI
Everything you need to know about Generative AI

Generative AI has taken the world by storm in the last several months, but what actually is Generative AI, and how does it work? Learn everything you need to know about Generative AI in this easy-to-follow series.

How physics advanced Generative AI
How physics advanced Generative AI

Many cutting-edge Generative AI models are inspired by concepts from physics. In this guide, we’ll take a high-level look at how physics is driving advancements in AI.

Emergent Abilities of Large Language Models
Emergent Abilities of Large Language Models

Emergence can be defined as the sudden appearance of novel behavior. Large Language Models apparently display emergence by suddenly gaining new abilities as they grow. Why does this happen, and what does this mean?

Build a free Stable Diffusion app with a GPU backend
Build a free Stable Diffusion app with a GPU backend

In this tutorial, we'll build a completely free-to-use web app that allows you to generate images with Stable Diffusion (on GPU) in seconds.

2022 at AssemblyAI - A Year in Review
2022 at AssemblyAI - A Year in Review

The end of 2022 is quickly approaching, and what a year it has been! As we get closer to 2023, we wanted to take a moment to look back and reflect on some of the highlights of the past year.

Winners and Honorable Mentions - AssemblyAI $50k Winter Hackathon
Winners and Honorable Mentions - AssemblyAI $50k Winter Hackathon

Last weekend, we hosted the AssemblyAI $50k AI Winter Hackathon - our first-ever hackathon. Check out the winning projects and learn more about the event below!

Releasing our new v9 transcription model - 11% better accuracy
Releasing our new v9 transcription model - 11% better accuracy

Today, we’re excited to release our most accurate Speech Recognition model to date - version 9 (v9). This updated model delivers increased performance across many metrics on a wide range of audio types.

Stable Diffusion 1 vs 2 - What you need to know
Stable Diffusion 1 vs 2 - What you need to know

Stable Diffusion 2 was released recently, sparking some debate about its performance relative to Stable Diffusion 1. Learn where the differences between the two models stem from and what they mean in practice in this simple guide.

New AI Models to summarize audio and video for any use case
New AI Models to summarize audio and video for any use case

We are introducing several new Summarization models, each tailored to a specific use case.

Stable Diffusion in Keras - A Simple Tutorial
Stable Diffusion in Keras - A Simple Tutorial

Learn how to generate and inpaint images with Stable Diffusion in Keras, and how XLA can boost Stable Diffusion's inference speed, in this easy-to-follow guide.

An Introduction to Poisson Flow Generative Models
An Introduction to Poisson Flow Generative Models

Poisson Flow Generative Models (PFGMs) are a new type of generative Deep Learning model, taking inspiration from physics much like Diffusion Models. Learn the theory behind PFGMs and how to generate images with them in this easy-to-follow guide.

How to Run OpenAI’s Whisper Speech Recognition Model
How to Run OpenAI’s Whisper Speech Recognition Model

OpenAI's Whisper model can perform Speech Recognition on a wide selection of languages. We'll learn how to run Whisper before checking out a performance analysis in this simple guide.

Getting Started with Hugging Face's Gradio
Getting Started with Hugging Face's Gradio

Gradio allows you to easily create shareable apps using only Python. Learn how to build a dashboard for Audio Intelligence Analysis in this easy-to-follow tutorial.

How to Automatically Transcribe Zoom Calls in Real-Time
How to Automatically Transcribe Zoom Calls in Real-Time

Learn how to automatically transcribe a Zoom meeting in real-time with this simple tutorial.

How to Run Stable Diffusion Locally to Generate Images
How to Run Stable Diffusion Locally to Generate Images

Stable Diffusion is a text-to-image model with recently-released open-sourced weights. Learn how to generate an image of a scene given only a description of it in this simple tutorial.

MinImagen - Build Your Own Imagen Text-to-Image Model
MinImagen - Build Your Own Imagen Text-to-Image Model

Text-to-Image models have made great strides this year, from DALL-E 2 to the more recent Imagen model. In this tutorial learn how to build a minimal Imagen implementation - MinImagen.

How Imagen Actually Works
How Imagen Actually Works

Given a brief description of a scene, Imagen can generate photorealistic, high-resolution images of the scene. Learn everything you need to know about Imagen and how it works in this easy-to-follow guide.

Getting Started with ESPnet
Getting Started with ESPnet

ESPnet is the premier end-to-end, open-source speech processing toolkit. This easy-to-follow guide will help you get started using ESPnet for Speech Recognition.

Introduction to Diffusion Models for Machine Learning
Introduction to Diffusion Models for Machine Learning

The meteoric rise of Diffusion Models is one of the biggest developments in Machine Learning in the past several years. Learn everything you need to know about Diffusion Models in this easy-to-follow guide.

MediaPipe for Dummies
MediaPipe for Dummies

With just a few lines of code, MediaPipe allows you to incorporate State-of-the-Art Machine Learning capabilities into your applications. Learn about MediaPipe and how to use its simple APIs in this beginner's guide.

A Beginner's Guide to TorchStudio, The PyTorch IDE
A Beginner's Guide to TorchStudio, The PyTorch IDE

Learn how to build, train, and compare models with TorchStudio - the IDE built specifically for PyTorch.

Transcribe Audio Files in an S3 Bucket with AssemblyAI
Transcribe Audio Files in an S3 Bucket with AssemblyAI

Learn how to transcribe audio files stored in an AWS S3 bucket with AssemblyAI in 3 simple steps.

Kaldi Install for Dummies
Kaldi Install for Dummies

A step-by-step Kaldi install tutorial so you can get up and running on your NLP projects as soon as possible.