2023 at AssemblyAI - A Year in Review
Here are some of the new products and features we've launched for customers in 2023:
- Conformer-1 and Conformer-2 AI Models Released: The year saw the launch of Conformer-2, our enhanced AI model for automatic speech recognition. Building on Conformer-1, this new model offers improved recognition of proper nouns and alphanumeric sequences, and better performance in noisy conditions, thanks to its training on over 1.1 million hours of English audio.
- LeMUR Framework Release: LeMUR, our new framework for applying Large Language Models (LLMs) to spoken data, was introduced. It facilitates summarizing, questioning, and text generation for applications dealing with voice data.
- Partnership with AWS Marketplace: This year, we established a partnership with the Amazon Web ServicesMarketplace. This collaboration is aimed at simplifying the use of AWS services with AssemblyAI’s offerings.
- Enhanced AI Models: Our AI models for PII Redaction, Entity Detection, and Punctuation and Casing received major updates, resulting in improved performance and accuracy for voice data processing.
- New No-Code Playground: A redesigned, user-friendly no-code playground was introduced, designed to simplify and accelerate AI application development.
- Series C Funding Milestone: Our Series C funding round raised $50 million, supporting our objective to continue developing advanced Speech AI models that enhance voice data processing capabilities.
LeMUR Cookbooks: Build LLM Audio Apps
LeMUR is the easiest way to code applications that apply LLMs to speech. In just a few lines of code you can search, summarize, ask questions, and generate text across your audio and video data. Check out the following popular LeMUR resources:
- Real-time Transcription with LeMUR.
- Processing Speaker Labels with LeMUR.
- LeMUR's new `input_text` parameter
- Extracting phone call insights with LeMUR.
Try LeMUR in our new playground.
Top Blog Posts In 2023
The Top Free Speech-to-Text APIs, AI Models, and Open Source Engines: Compare the best free Speech-to-Text APIs and AI models on the market today, including APIs that have a free tier. Take a look at several free open source Speech-to-Text engines and explore why you might choose an API versus an open source library, or vice versa. Read more>>
The Full Story of Large Language Models and RLHF: Large Language Models have been in the limelight since the release of ChatGPT, with new models being announced seemingly every week. This guide walks through the essential ideas of how these models came to be. Read more>>
Automatically summarize audio and video files at scale with AI: Learn how AI summarization helps developers and product teams build exciting features that automatically summarize audio and video data. Read more>>
Emergent Abilities of Large Language Models: Emergence can be defined as the sudden appearance of novel behavior. Large Language Models apparently display emergence by suddenly gaining new abilities as they grow. Why does this happen, and what does this mean? Read more>>
Most Popular Video Tutorials in 2023
Speech recognition in Python made easy | Python Tutorial: Learn how to get started with the AssemblyAI Python SDK for speech recognition. In just 5 minutes you'll learn how you can transcribe and analyze audio data.
Vector Databases simply explained! (Embeddings & Indexes): Learn what vector databases and vector embeddings are and how they work.
Analyze a Conversation with AI for Free on the Playground: Conversations have complicated structures that make it hard to analyze them. Learn to use AssemblyAI's API to get multiple audio intelligence insights immediately with your transcription.
🤯 OpenAI Assistants API Python (Full Tutorial): Learn how to build with the new OpenAI Assistants API in less than 20 minutes and how to integrate GPT-4 into your applications.
LangChain explained - The hottest new Python framework: LangChain explained in 3 minutes - LangChain is a Python framework for developing applications powered by language models.