How Baidu's AI Lab plans to solve speech recognition - with lots of data

Deep Speech 2 uses deep learning to recognise words in English and Mandarin, reports Tech in Asia

AI, artificial intelligence, robot
Baidu's Xiaodu, an artificial intelligent robot, can respond to voice commands
Eva Xiao | Tech in Asia
Last Updated : Feb 23 2017 | 5:06 PM IST
Baidu wants to build a speech recognition engine that’s 99 percent accurate, a threshold that Andrew Ng, chief scientist at Baidu and founder of Google’s "Google Brain" deep learning project, believes will fundamentally change how humans interact with computers.

Baidu, which opened its Silicon Valley AI Lab in 2014, is hoping to carve out a space for itself as a leader in speech recognition. So far, it’s making impressive headway. The company’s latest speech recognition engine, dubbed Deep Speech 2, uses deep learning to recognise words spoken in English and Mandarin, at times outperforming humans in the latter, according to Baidu.

"We can train this giant neural network that eventually learns to recognise speech on its own as well as a human can, and not spend so much of our time thinking about how words are structured," says Adam. "Instead, [we] can just ask the computer system to learn those things on its own."

The short answer to Baidu’s plan to conquer speech recognition is data — lots of it. Adam says Deep Speech 2 was trained on tens of thousands of hours of audio recordings. Some of it comes from public data, while another portion is from crowdsourcing services, such as Mechanical Turk, Amazon’s marketplace for odd jobs that require human intelligence.


Deep Speech 2 is an example of supervised learning, a type of machine learning that uses labelled training data – such as transcribed audio – to teach a system new skills, like recognising handwritten numbers. Without labelled training data, however, the neural network wouldn’t be able to differentiate right from wrong. This is an excerpt from an article published on TechInAsia. You can read the full story here

One subscription. Two world-class reads.

Already subscribed? Log in

Subscribe to read the full story →
*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

Next Story