AssemblyAI Research | Building the world's leading Speech AI models

Universal-2-TF: Robust All-Neural Text Formatting for ASR

Universal-2-TF introduces a two-stage neural text formatting model for ASR that combines token classification and sequence-to-sequence approaches to efficiently handle punctuation, capitalization, and text normalization while achieving superior accuracy across diverse domains.

Research paper

Universal-1: Robust and accurate multilingual speech-to-text

We are excited to introduce Universal-1, our latest and most powerful speech recognition model. Trained on over 12.5 million hours of multilingual audio data, Universal-1 achieves best-in-class speech-to-text accuracy across four major languages: English, Spanish, French, and German.

Research paper

Introducing Conformer-2

We're introducing Conformer-2, our latest AI model for automatic speech recognition. Conformer-2 is trained on 1.1M hours of English audio data, extending Conformer-1 to provide improvements on proper nouns, alphanumerics, and robustness to noise.

Research paper

Light-weight probing of unsupervised representations for Reinforcement Learning

An evaluation protocol for unsupervised RL representations uses two linear probing tasks to predict rewards and expert actions, reducing computational cost and improving RL training efficiency.

Conformer-1: A robust speech recognition model trained on 650K hours of data

We're introducing Conformer-1, a state-of-the-art speech recognition model trained on 650K hours of audio data that achieves near human-level performance and robustness across a variety of data.

Research paper

Pioneering the next generation of Speech AI technology