Deep Learning - News, Tutorials, AI Research (Page 2)

Deep Learning

Dec 23, 2022

How ChatGPT actually works

Since its release, the public has been playing with ChatGPT and seeing what it can do, but how does ChatGPT actually work? While the details of its inner workings have not been published, we can piece together its functioning principles from recent research.

Marco Ramponi

Developer Educator

Stable Diffusion 1 vs 2 - What you need to know

Deep Learning

Dec 6, 2022

Stable Diffusion 1 vs 2 - What you need to know

Stable Diffusion 2 was released recently, sparking some debate about its performance relative to Stable Diffusion 1. Learn where the differences between the two models stem from and what they mean in practice in this simple guide.

Ryan O'Connor

Senior Developer Educator

Deep Learning

Nov 22, 2022

DeepMind's AlphaTensor Explained

AlphaTensor is a novel AI solution to discover mathematical algorithms with Reinforcement Learning. Learn everything you need to know about AlphaTensor in this comprehensive introduction.

Marco Ramponi

Developer Educator

AI research review - Merging Models Modulo Permutation Symmetries

Deep Learning

Nov 16, 2022

AI research review - Merging Models Modulo Permutation Symmetries

This week’s AI Research Review is Git Re-Basin: Merging Models Modulo Permutation Symmetries.

Yash Khare

Deep Learning Researcher

An Introduction to Poisson Flow Generative Models

Deep Learning

Oct 26, 2022

An Introduction to Poisson Flow Generative Models

Poisson Flow Generative Models (PFGMs) are a new type of generative Deep Learning model, taking inspiration from physics much like Diffusion Models. Learn the theory behind PFGMs and how to generate images with them in this easy-to-follow guide.

Ryan O'Connor

Senior Developer Educator

Deep Learning

Sep 21, 2022

AI Research Review - Multistream CNN

This week’s AI Research Review is Multistream CNN For Robust Acoustic Modeling

Luka Chkhetiani

Deep Learning Research Lead

Deep Learning

Sep 8, 2022

AI Research Review - Spelling and ASR

This week’s AI Research Review is Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems.

Taufiquzzaman Peyash

Deep Learning Engineer

Deep Learning

Aug 24, 2022

Deep Learning Paper Recap - Diffusion and Transformer Models

This week’s Deep Learning Paper Reviews is Diffusion-LM Improves Controllable Text Generation and Sparsifying Transformer Models with Trainable Representation Pooling.

Dillon Pulliam, Sergio Ramirez Martin

Deep Learning Researcher, Deep Learning Researcher

How to Run Stable Diffusion Locally to Generate Images

Tutorials

Aug 23, 2022

How to Run Stable Diffusion Locally to Generate Images

Stable Diffusion is a text-to-image model with recently-released open-sourced weights. Learn how to generate an image of a scene given only a description of it in this simple tutorial.

Ryan O'Connor

Senior Developer Educator

MinImagen - Build Your Own Imagen Text-to-Image Model

Deep Learning

Aug 17, 2022

MinImagen - Build Your Own Imagen Text-to-Image Model

Text-to-Image models have made great strides this year, from DALL-E 2 to the more recent Imagen model. In this tutorial learn how to build a minimal Imagen implementation - MinImagen.

Ryan O'Connor

Senior Developer Educator

Deep Learning

Aug 17, 2022

Deep Learning Paper Recap - Redundancy Reduction and Sparse MoEs

This week’s Deep Learning Paper Reviews is Barlow Twins: Self-Supervised Learning via Redundancy Reduction and Sparse MoEs Meet Efficient Ensembles

Domenic Donato, Kevin Zhang

VP of Engineering and Research, Deep Learning Researcher

Deep Learning

Aug 10, 2022

Deep Learning Paper Recap - Transfer Learning

This week’s Deep Learning Paper Review is Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis.

Michael Liang

Data Engineer

Deep Learning

Aug 3, 2022

Deep Learning Paper Recap - Automatic Speech Recognition

This week’s Deep Learning Paper Recaps are Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition and Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition

Gabriel Oexle, Yash Khare

Deep Learning Researcher, Deep Learning Researcher

Deep Learning

Jul 27, 2022

Deep Learning Paper Recaps - Modality Matching and Masked Autoencoders

This week’s Deep Learning Paper Recaps are MAESTRO: Matched Speech Text Representations through Modality Matching and Masked Autoencoders that Listen.

Luka Chkhetiani, Ruben Bousbib

Deep Learning Research Lead, Deep Learning Researcher

Deep Learning

Jul 7, 2022

Deep Learning Paper Recap - Language Models

This week’s Deep Learning Paper Recap is Prune Once For All: Sparse Pre-Trained Language Models

Taufiquzzaman Peyash

Deep Learning Engineer

Deep Learning

Jun 23, 2022

How Imagen Actually Works

Given a brief description of a scene, Imagen can generate photorealistic, high-resolution images of the scene. Learn everything you need to know about Imagen and how it works in this easy-to-follow guide.

Ryan O'Connor

Senior Developer Educator

Deep Learning

Jun 22, 2022

Deep Learning Paper Recap - Streaming ASR and Summarization

This week’s Deep Learning Paper Recaps are Bridging the gap between streaming and non-streaming ASR systems by distilling ensembles of CTC and RNN-T models and BRIO: Bringing Order to Abstractive Summarization

Guru Rao, Sergio Ramirez Martin

Deep Learning and AI Engineer, Deep Learning Researcher

Review – TOXIGEN & Knowledge Distillation Meets Open-Set Semi-Supervised Learning

Deep Learning

Jun 16, 2022

Review – TOXIGEN & Knowledge Distillation Meets Open-Set Semi-Supervised Learning

This week’s Deep Learning Paper Reviews are TOXIGEN: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection and Knowledge Distillation Meets Open-Set Semi-Supervised Learning.

Domenic Donato, Dillon Pulliam

VP of Engineering and Research, Deep Learning Researcher

Deep Learning

Jun 8, 2022

Review - Decision Transformer & SPIRAL

This week’s Deep Learning Paper Review is Decision Transformer: Reinforcement Learning via Sequence Modeling and SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training

Kevin Zhang, Francis McCann

Deep Learning Researcher, Deep Learning Researcher

Introduction to Diffusion Models for Machine Learning

Deep Learning

May 12, 2022

Introduction to Diffusion Models for Machine Learning

The meteoric rise of Diffusion Models is one of the biggest developments in Machine Learning in the past several years. Learn everything you need to know about Diffusion Models in this easy-to-follow guide.

Ryan O'Connor

Senior Developer Educator

Review - ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Deep Learning

Mar 16, 2022

Review - ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

This week’s Deep Learning Paper Review is ALBERT: A Lite BERT for Self-supervised Learning of Language Representations.

Sergio Ramirez Martin

Deep Learning Researcher

BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models

Deep Learning

Feb 25, 2022

BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models

This paper falls in the category of parameter efficient fine-tuning, where the goal is to use as few parameters as possible to achieve almost the same accuracy as if we were to fine-tune the whole model.

Taufiquzzaman Peyash

Deep Learning Engineer

What is Gradient Clipping for Neural Networks?

Videos

Feb 21, 2022

What is Gradient Clipping for Neural Networks?

In this video, we will learn about Gradient Clipping, a technique to tackle the exploding gradients problem in Neural Networks.

Mısra Turp

Developer Educator

Why You Should (or Shouldn't) be Using Google's JAX in 2023

Deep Learning

Feb 15, 2022

Why You Should (or Shouldn't) be Using Google's JAX in 2023

Should you be using JAX in 2023? Check out our recommendations on using JAX for Deep Learning and more!

Ryan O'Connor

Senior Developer Educator

Videos

Feb 14, 2022

Hyperparameters of Neural Networks

In this video, we take a high-level look on all main hyperparameters of Neural Networks.

Mısra Turp

Developer Educator