Android Police

speech recognition

Readers like you help support Android Police. When you make a purchase using links on our site, we may earn an affiliate commission. Read More.

latest

Google maps illustration with a a blurred map of New York in the background and the Google Maps icon in the foreground

Google's Speech Services has been around since 2013, and it’s been an important part of the Android Operating System’s history. Many apps have utilized it to help with text-to-speech and speech-to-text, but the service has been used less and less by Google as it has begun to show its age. Google Maps, which used Speech Services for years, has now replaced it with Google Assistant to provide a smoother user experience.

google pixel 6 speech recognition featured image

At today's Pixel 6 and 6 Pro announcement, Google detailed that the Tensor chip is enabling quite some incredible camera voodoo. But Tensor is also improving other machine-learning related features, like speech recognition. The chip is supposed to drastically improve the natural language processing for speech-to-text.

Microsoft just bought the company that built Siri's voice recognition for Apple

Microsoft purchased Nuance, makers of Dragon speech-to-text tools, for $16 billion

4
By 

Cortana, Microsoft's digital voice assistant named after that hologram lady from HALO, didn't exactly set the world on fire. Most Windows users I've seen treat it like an annoyance to be avoided, instead of an integral part of the system, as Google's Assistant and Apple's Siri have become. But based on its latest corporate purchase, Microsoft might be taking another stab at voice-powered interaction.

We all know Google's speech transcription technology is really, really, really good. Not only is it the best in the industry, it's doing it without a data connection: Pixels have been transcribing audio on-device for some time now, and that's been owed to Google's extremely impressive transcription algorithms that utilize machine learning hardware on its smartphones. But accuracy isn't everything when it comes to transcription, even if it the single most important feature—speed matters too.

Google collects vast amounts of speech data across all of its products, and while it hasn't been too transparent about the practice, we as users profit from it for the most part. Speech recognition has consistently gotten better over the years, which has allowed impressive sci-fi tech like smart speakers to enter our homes. There's one department where Google needs to step up its game, though: multilingual speakers are having a hard time using more than one language on any Google product.

At I/O 2019, Google is going public with a bold AI-driven accessibility initiative it calls Project Euphonia. The goal is to transcribe the words spoken by people who have non-standard speech patterns — those who live with ALS, hearing impairment and other conditions — across messaging platforms to enable better communication between those people, their friends and family, and others.

Speech to text is something we take for granted a lot of the time. It's a complicated process, though — so much so that the heavy lifting is actually done remotely, and the end result is sent back to our devices. But Google has worked out a way to shrink the process to the point that it can be performed locally, and the fruits of that labor are coming to Gboard.

Speech recognition is one of the most powerful aspects of many Google products, particularly in the Google app where Voice Search relies on being able to understand what we're saying. The same is true of Gboard, which is capable of typing up entire messages based on what you dictate to it. We may take it for granted somewhat these days, but it truly is a marvel. Now, this feature can be enjoyed by many more around the globe as Google has added support for 30 further languages.

 

It's no secret that one of Google's strengths in recent years has been voice recognition. In my own experience, my Google Home picks up what I am trying to say almost every time, even in a low voice. Obviously the success rate varies by language and accent, but it is still pretty darn impressive.

As the years go by, I get lazier it seems. So the prospect of talking to my computing devices gets better and better as time passes (even if I have to yell across my house to my Google Home, which is amusing in its own right). However, always-on speech recognition comes at a price for battery-operated devices. Researchers at MIT claim to have to come up with a solution to this: a dedicated speech recognition chip that can reduce power consumption by 90-99% across real-world devices.

Google's voice search function is pretty impressive, especially if you compare it to some of the alternatives that we used to have. But it's by no means perfect - proper nouns and regional accents can sometimes throw it for a loop, necessitating manual text input. But if you'd rather give it another go while restricting your voice input to specific words, that's now an option. One of our readers found this particular tool - we're not sure how long it's been active in Google Search, but it's quite handy.

We know, we know - you're tired of hearing about Siri and its respective knockoffs. But, we assure you, this one is different. Very different. In fact, it's beyond anything we've ever seen before.

Update: For anyone who can't install this because of geographical limitations, here is the apk for manual installation -- make sure to drop your results in the comments. Enjoy! Thanks, Nick!