Palo Alto, California May 13, 2021 - Soniox Inc launched the Soniox AI Speech Recognition Platform, the world’s first self-learning artificial intelligence for automatic speech recognition. Soniox Speech AI leverages vast amounts of available unlabeled audio and text to teach itself how to recognize complex speech patterns. As a result, Soniox Speech AI can accurately recognize speech in real-world environments on most topics of human knowledge with up to 24% improved word-error-rate than today’s leading speech systems. Available now at https://soniox.com and as an iOS app, Soniox offers speech recognition services and products for enterprises, developers and consumers.
Soniox has invented a novel approach to training speech recognition models to overcome today’s speech recognition limitations. In unsupervised fashion, Soniox Speech AI learns from vast amounts of unlabeled audio and unlabeled text that is publicly available on the internet. It learns to recognize words by exploring different interpretations of spoken words in unlabeled audio and their usage in unlabeled written text. Soniox Speech AI can now uniquely recognize near error-free most of the words in English language without requiring direct human supervision.
Prior to today’s announcement, humans were needed to manually transcribe audio to create accurate labeled speech-to-text datasets, making speech recognition learning extremely time-consuming and expensive. Collecting labeled data for speech recognition was further challenged because of the extreme variety of the speech input and output space. Existing approaches made it practically infeasible to obtain sufficient amounts of paired audio-transcript data to cover the complex input and output space.
In contrast, Soniox Speech AI continuously learns and auto-improves as it gains access to more unlabeled audio and unlabeled text. With each iteration, Soniox Speech AI is slightly better and is able to correctly interpret and recognize more and more of the words in human knowledge.
“Audio is becoming the prevalent medium for rapid, immersive communication” said Klemen Simonic, Founder and CEO of Soniox. “With our self-learning AI platform, Soniox has built the industry’s strongest infrastructure and toolset to build advanced speech and audio understanding solutions. Our self-learning speech AI is the first example of how Soniox can solve hard problems differently. Expect more to come in the near future!”
To make speech recognition accessible and easy to use, Soniox has built both the Soniox web application and the Soniox mobile application (for iOS devices). Among other features, these applications enable users to instantly transcribe audio/video files or live streams, such as meetings and conversations. These products are available for free up to 5 hours of speech recognition per month.
Privacy and security is critical for speech recognition use. Soniox has developed an on-premises deployment of Soniox Speech AI, where the entire system is deployed within the enterprise's infrastructure. The on-premises deployment supports efficient and distributed processing of large volumes of audio in real-time and low-latency settings. Soniox has also developed on-mobile-device deployment of Soniox Speech AI for iOS devices. The entire computation takes place on the mobile device and the audio never leaves the device. It also eliminates the requirement for network connectivity while transcribing audio streams.
About Soniox Inc: Soniox was founded in April in 2020 in Redwood City, California. Soniox mission is to deeply understand audio and make it universally accessible and useful. Soniox developed the world’s first self-learning artificial intelligence for automatic speech recognition and is a leader in speech recognition technology. To learn more about the Soniox, visit https://soniox.com
Jennifer Grenz: jen [at] soniox.com