Google Significantly Improves its Mobile Speech Recognition

Google has made some significant improvements to the speech recognizer on its mobile phones. The new software outputs every single character in real time and is entirely contained on the mobile device, which means that the dictation system will work offline with zero latency.

Johan Schalkwyk, a Google Fellow with the company’s Speech Team, explained the new system in a recent AI Blog post. According to Schalkwyk, more conventional speech recognition systems convert speech to text using a sequence that involves three separate steps, beginning with an analysis of an audio sample to identify specific sounds. The software then uses those sounds to form words and a language model to complete the sentence.

The drawback is that those traditional systems require a complete input sequence in order to generate a transcription. Google’s team used Recurrent Neural Network transducer (RNN-T) technology to convert audio input to text output on a character-by-character basis, improving speed by outputting each individual letter instead of a longer word or phrase.

The new platform is also smaller than its predecessors, reducing the speech recognizer footprint from 2 GB to 80 MB. At the former size, speech recognizers are too unwieldly to store on a mobile device and therefore require a network connection in order to function. The new dictation system is small enough to embed on a standard smartphone and will be available to customers on or offline.

For now, the new speech recognizer will only be available in American English on Pixel phones, though Google hopes to launch the service for more languages and devices soon. The announcement is the latest RNN breakthrough for the company’s speech recognition team, which achieved human parity back in 2017.

Source: Google AI Blog

(Originally posted on Mobile ID World)

Related News

Sponsored Links

FaceTec’s patented, industry-leading 3D Face Authentication software anchors digital identity, creating a chain of trust from user onboarding to ongoing authentication on all modern smart devices and webcams. FaceTec’s 3D FaceMaps™ make trusted, remote identity verification finally possible. As the only technology backed by a persistent spoof bounty program and NIST/iBeta Certified Liveness Detection, FaceTec is the global standard for Liveness and 3D Face Matching with millions of users on six continents in financial services, border security, transportation, blockchain, e-voting, social networks, online dating and more. www.facetec.com

TECH5 is an international technology company founded by experts from the biometrics industry, which focuses on developing disruptive biometric and digital ID solutions through the application of AI and Machine Learning technologies.

TECH5 target markets include both Government and Private sectors with products powering Civil ID, Digital ID, as well as authentication solutions that deliver identity assurance for various use cases.

Learn more: www.tech5.ai

Mobile ID World is here to bring you the latest in mobile authentication solutions and application providers. Our company is dedicated to providing users with the best content and cutting edge information on technology, news, and mobile solutions for your mobile identity management needs.

HID powers the trusted identities of the world’s people, places and things. Our trusted identity solutions give people convenient and secure access to physical and digital places and connect things that can be identified, verified and tracked digitally. Millions of people use HID products to navigate their everyday lives, and billions of things are connected through HID technology. https://www.hidglobal.com/

As the world moves to a mobile-first economy, businesses need to modernize how they acquire, engage with, and enable consumers. Prove’s phone-centric identity tokenization and passive cryptographic authentication solutions reduce friction, enhance security and privacy across all digital channels, and accelerate revenues while reducing operating expenses and fraud losses. Over 1,000 enterprise customers use Prove’s platform to process 20 billion customer requests annually across industries including banking, lending, healthcare, gaming, crypto, e-commerce, marketplaces, and payments. https://www.prove.com/

Related News

Footer

Follow Us