Google Drastically Improves Its Speech Recognition

Google has drastically improved its speech recognition technology, the company’s Speech Team has announced. In a new blog post, the team members details how they improved the technology to make Google voice analysis both more accurate and faster.

Essentially, the team has refined the Deep Neural Networks (DNNs) that replaced the 30-year-old Gaussian Mixture Model (GMM) back in 2012. Now, the team has implemented specialized extensions of more accurate recurrent neural networks (RNNs) called sequence dsicriminative training techniques and Connectionist Temporal Classification (CTC).

What does it all mean? The short answer is that RNNs have a built-in feedback loop that lets them place each sound being analyzed into a context, so that rather than trying to identify any one sound (such as the first “m” in “museum”) in isolation from all the others, they can connect the dots right away to see how that sound fits into the larger word being spoken.

(The long answer can be found in the team’s blog post.)

With the new system in place, voice command and search systems in the Google app and on Android systems will benefit from improved accuracy and speed. That could prove particularly advantageous going forward as such software is incorporated into the growing Internet of Things, in which voice command technology could play a crucial role, and fundamentally change the infrastructure of computing in the process.

—

(Originally posted on Mobile ID World)

Sponsored Links

FaceTec’s patented, industry-leading 3D Face Authentication software anchors digital identity, creating a chain of trust from user onboarding to ongoing authentication on all modern smart devices and webcams. FaceTec’s 3D FaceMaps™ make trusted, remote identity verification finally possible. As the only technology backed by a persistent spoof bounty program and NIST/iBeta Certified Liveness Detection, FaceTec is the global standard for Liveness and 3D Face Matching with millions of users on six continents in financial services, border security, transportation, blockchain, e-voting, social networks, online dating and more. www.facetec.com

TECH5 is an international technology company founded by experts from the biometrics industry, which focuses on developing disruptive biometric and digital ID solutions through the application of AI and Machine Learning technologies.

TECH5 target markets include both Government and Private sectors with products powering Civil ID, Digital ID, as well as authentication solutions that deliver identity assurance for various use cases.

Learn more: www.tech5.ai

Mobile ID World is here to bring you the latest in mobile authentication solutions and application providers. Our company is dedicated to providing users with the best content and cutting edge information on technology, news, and mobile solutions for your mobile identity management needs.

HID powers the trusted identities of the world’s people, places and things. Our trusted identity solutions give people convenient and secure access to physical and digital places and connect things that can be identified, verified and tracked digitally. Millions of people use HID products to navigate their everyday lives, and billions of things are connected through HID technology. https://www.hidglobal.com/

As the world moves to a mobile-first economy, businesses need to modernize how they acquire, engage with, and enable consumers. Prove’s phone-centric identity tokenization and passive cryptographic authentication solutions reduce friction, enhance security and privacy across all digital channels, and accelerate revenues while reducing operating expenses and fraud losses. Over 1,000 enterprise customers use Prove’s platform to process 20 billion customer requests annually across industries including banking, lending, healthcare, gaming, crypto, e-commerce, marketplaces, and payments. https://www.prove.com/

Footer

Follow Us