You are here: Home » Technology » News » Others
Business Standard

Microsoft develops first human-like speech recognition system

The milestone means that, for the first time, a computer can recognise the words in a conversation as well as a person would

Press Trust of India  |  Washington 

Microsoft

In a major breakthrough in recognition, researchers at claim to have developed the that recognises the words in a conversation as well as humans do.

A team of researchers and engineers in Artificial Intelligence and Research created a that makes the same or fewer errors than professional transcriptionists.

They reported a word error rate (WER) of 5.9 per cent, down from the 6.3 per cent WER the team reported just last month.

The 5.9 per cent error rate is about equal to that of people who were asked to transcribe the same conversation, and it is the lowest ever recorded against the industry standard Switchboard task.

"We've reached human parity. This is a historic achievement," Xuedong Huang, the company's chief scientist said in a blog post.

The milestone means that, for the time, a computer can recognise the words in a conversation as well as a person would.

In doing so, the team beat a goal they set less than a year ago - and greatly exceeded everyone else's expectations as well.

The research milestone comes after decades of research in recognition, beginning in the early 1970s with DARPA, the US agency tasked with making breakthroughs.

Over the decades, most major companies and many research organisations joined in the pursuit.

"This accomplishment is the culmination of over twenty years of effort," said Geoffrey Zweig, who manages the and Dialogue research group.

The milestone will have broad implications for consumer and business products that can be significantly augmented by recognition.

That includes consumer entertainment devices like the Xbox, accessibility tools such as instant speech-to-text transcription and personal digital assistants such as Cortana.

"This will make more powerful, making a truly intelligent assistant possible," Shum said.

The research milestone does not mean the computer recognised every word perfectly. In fact, humans do not do that, either.

Instead, it means that the error rate - or the rate at which the computer misheard a word like "have" for "is" or "a" for "the" - is the same as you would expect from a person hearing the same conversation.

Zweig attributed the accomplishment to the systematic use of the latest neural network in all aspects of the system.

The push that got the researchers over the top was the use of neural language models in which words are represented as continuous vectors in space, and words like "fast" and "quick" are close together.

"This lets the models generalise very well from word to word," Zweig said.

RECOMMENDED FOR YOU

Microsoft develops first human-like speech recognition system

The milestone means that, for the first time, a computer can recognise the words in a conversation as well as a person would

The milestone means that, for the first time, a computer can recognise the words in a conversation as well as a person would
In a major breakthrough in recognition, researchers at claim to have developed the that recognises the words in a conversation as well as humans do.

A team of researchers and engineers in Artificial Intelligence and Research created a that makes the same or fewer errors than professional transcriptionists.

They reported a word error rate (WER) of 5.9 per cent, down from the 6.3 per cent WER the team reported just last month.

The 5.9 per cent error rate is about equal to that of people who were asked to transcribe the same conversation, and it is the lowest ever recorded against the industry standard Switchboard task.

"We've reached human parity. This is a historic achievement," Xuedong Huang, the company's chief scientist said in a blog post.

The milestone means that, for the time, a computer can recognise the words in a conversation as well as a person would.

In doing so, the team beat a goal they set less than a year ago - and greatly exceeded everyone else's expectations as well.

The research milestone comes after decades of research in recognition, beginning in the early 1970s with DARPA, the US agency tasked with making breakthroughs.

Over the decades, most major companies and many research organisations joined in the pursuit.

"This accomplishment is the culmination of over twenty years of effort," said Geoffrey Zweig, who manages the and Dialogue research group.

The milestone will have broad implications for consumer and business products that can be significantly augmented by recognition.

That includes consumer entertainment devices like the Xbox, accessibility tools such as instant speech-to-text transcription and personal digital assistants such as Cortana.

"This will make more powerful, making a truly intelligent assistant possible," Shum said.

The research milestone does not mean the computer recognised every word perfectly. In fact, humans do not do that, either.

Instead, it means that the error rate - or the rate at which the computer misheard a word like "have" for "is" or "a" for "the" - is the same as you would expect from a person hearing the same conversation.

Zweig attributed the accomplishment to the systematic use of the latest neural network in all aspects of the system.

The push that got the researchers over the top was the use of neural language models in which words are represented as continuous vectors in space, and words like "fast" and "quick" are close together.

"This lets the models generalise very well from word to word," Zweig said.
image
Business Standard
177 22

Microsoft develops first human-like speech recognition system

The milestone means that, for the first time, a computer can recognise the words in a conversation as well as a person would

In a major breakthrough in recognition, researchers at claim to have developed the that recognises the words in a conversation as well as humans do.

A team of researchers and engineers in Artificial Intelligence and Research created a that makes the same or fewer errors than professional transcriptionists.

They reported a word error rate (WER) of 5.9 per cent, down from the 6.3 per cent WER the team reported just last month.

The 5.9 per cent error rate is about equal to that of people who were asked to transcribe the same conversation, and it is the lowest ever recorded against the industry standard Switchboard task.

"We've reached human parity. This is a historic achievement," Xuedong Huang, the company's chief scientist said in a blog post.

The milestone means that, for the time, a computer can recognise the words in a conversation as well as a person would.

In doing so, the team beat a goal they set less than a year ago - and greatly exceeded everyone else's expectations as well.

The research milestone comes after decades of research in recognition, beginning in the early 1970s with DARPA, the US agency tasked with making breakthroughs.

Over the decades, most major companies and many research organisations joined in the pursuit.

"This accomplishment is the culmination of over twenty years of effort," said Geoffrey Zweig, who manages the and Dialogue research group.

The milestone will have broad implications for consumer and business products that can be significantly augmented by recognition.

That includes consumer entertainment devices like the Xbox, accessibility tools such as instant speech-to-text transcription and personal digital assistants such as Cortana.

"This will make more powerful, making a truly intelligent assistant possible," Shum said.

The research milestone does not mean the computer recognised every word perfectly. In fact, humans do not do that, either.

Instead, it means that the error rate - or the rate at which the computer misheard a word like "have" for "is" or "a" for "the" - is the same as you would expect from a person hearing the same conversation.

Zweig attributed the accomplishment to the systematic use of the latest neural network in all aspects of the system.

The push that got the researchers over the top was the use of neural language models in which words are represented as continuous vectors in space, and words like "fast" and "quick" are close together.

"This lets the models generalise very well from word to word," Zweig said.

image
Business Standard
177 22

Upgrade To Premium Services

Welcome User

Business Standard is happy to inform you of the launch of "Business Standard Premium Services"

As a premium subscriber you get an across device unfettered access to a range of services which include:

  • Access Exclusive content - articles, features & opinion pieces
  • Weekly Industry/Genre specific newsletters - Choose multiple industries/genres
  • Access to 17 plus years of content archives
  • Set Stock price alerts for your portfolio and watch list and get them delivered to your e-mail box
  • End of day news alerts on 5 companies (via email)
  • NEW: Get seamless access to WSJ.com at a great price. No additional sign-up required.
 

Premium Services

In Partnership with

 

Dear Guest,

 

Welcome to the premium services of Business Standard brought to you courtesy FIS.
Kindly visit the Manage my subscription page to discover the benefits of this programme.

Enjoy Reading!
Team Business Standard