New lip-reading technology to catch inaudible audio

Image
IANS London
Last Updated : Mar 25 2016 | 12:13 PM IST

Scientists from the University of East Anglia (UEA) have developed a new lip-reading technology that can help in solving crimes and provide communication assistance for people with hearing and speech impairments.

The visual speech recognition technology, created by Dr Helen L. Bear and professor Richard Harvey, can be applied "any place where the audio isn't good enough to determine what people are saying."

Unique problems with determining speech arise when sound isn't available such as on CCTV footage or if the audio is inadequate and there are no clues to give the context of a conversation.

"We are still learning the science of visual speech and what it is people need to know to create a fool-proof recognition model for lip-reading, but this classification system improves upon previous lip-reading methods by using a novel training method for the classifiers," Dr Bear explained.

Potentially, a robust lip-reading system could be applied in a number of situations from criminal investigations to entertainment.

Lip-reading has been used to pinpoint words footballers have shouted in heated moments on the pitch, but is likely to be of most practical use in situations where are there are high levels of noise, such as in cars or aircraft cockpits.

"Such a system could be adapted for use for a range of purposes like for people with hearing or speech impairments. Alternatively, a good lip-reading machine could be part of an audio-visual recognition system," Dr Bear added.

Lip-reading is one of the most challenging problems in artificial intelligence so it's great to make progress on one of the trickier aspects "which is how to train machines to recognise the appearance and shape of human lips," Harvey noted.

The findings were scheduled to be presented at the International Conference on Acoustics, Speech and Signal Processing (ICASSP) in Shanghai on Friday.

The paper was published in the journal Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing 2016.

*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

First Published: Mar 25 2016 | 12:02 PM IST

Next Story