Industry

Industry-related news and insights for enterprises building features and products with state-of-the-art AI models.

Top 7 Data Science Blogs for Data Scientists and Enthusiasts
Top 7 Data Science Blogs for Data Scientists and Enthusiasts

If you’re looking for a new data science blog to follow, check out our team's top picks, as well as recommended readings!

Machine Learning Podcasts - The Ultimate Listening Guide
Machine Learning Podcasts - The Ultimate Listening Guide

Our team of deep learning engineers is always on the hunt to keep up with the latest research, industry news and applications, and future outlook of machine learning, deep learning, and artificial intelligence on our quest to create an industry best Speech-to-Text API.

Data Science Podcasts to Listen to Now
Data Science Podcasts to Listen to Now

From accessible, real-world topics to deep dives into the most technical aspects of data science to interviews with industry experts, these nine data science podcasts are all worthwhile listens to add to your playlist.

What to Know About Speech-to-Text Privacy
What to Know About Speech-to-Text Privacy

If you’re an application developer, here are the questions about data security and privacy to explore before choosing the best Speech-to-Text API for your project.

Top 5 Machine Learning Blogs to Follow
Top 5 Machine Learning Blogs to Follow

Here are the top five Machine Learning blogs featuring Distill, Machine Learning Mastery, ML CMU, Neptune Blog, and Hacker News.

Do I Need A Custom Speech Recognition Model?
Do I Need A Custom Speech Recognition Model?

In the field of Automatic Speech Recognition, or ASR, custom models are actually considered an “old school” approach. This is because in the past, classical ASR models had plateaued, so the only option was to try to customize the models in order to increase accuracy.

What is Speaker Diarization and How Does it Work?
What is Speaker Diarization and How Does it Work?

In the field of Automatic Speech Recognition (ASR), Speaker Diarization refers to (A) the number of speakers that can be automatically detected in an audio file, and (B) the words that can be assigned to the correct speaker in that file.

Comparing Zoom Transcription Accuracy Across Speech-to-Text APIs
Comparing Zoom Transcription Accuracy Across Speech-to-Text APIs

In this benchmark report, we compare transcription accuracy between AssemblyAI, Google Cloud Speech-to-Text, and AWS Transcribe on Zoom Meeting Recordings.

Speech-to-Text Accuracy on Podcasts, News Broadcasts, and  Social Media
Speech-to-Text Accuracy on Podcasts, News Broadcasts, and Social Media

In this report, we look at 12 different audio/video files from various sources, and review how accurately AssemblyAI, AWS Transcribe, and Google Speech-to-Text, are able to automatically transcribe each file.

Comparing Speech-to-Text APIs on Phone Call Transcription
Comparing Speech-to-Text APIs on Phone Call Transcription

In this report, we look at 5 different earning calls from various companies, and review how accurately AssemblyAI, AWS Transcribe, and Google Speech-to-Text are able to automatically transcribe these recordings.