Given a picture of a car, for instance, the new system developed by scientists at Disney Research and ETH Zurich in Switzerland can automatically return the sound of a car engine.
A system that knows the sound of a car, a splintering dish, or a slamming door might be used in a number of applications, such as adding sound effects to films, or giving audio feedback to people with visual disabilities, said Jean-Charles Bazin, associate research scientist at Disney.
To solve this challenging task, the research team leveraged data from collections of videos.
"Videos with audio tracks provide us with a natural way to learn correlations between sounds and images," Bazin said.
"Video cameras equipped with microphones capture synchronised audio and visual information. In principle, every video frame is a possible training example," he said.
One of the key challenges is that videos often contain many sounds that have nothing to do with the visual content.
"Sounds associated with a video image can be highly ambiguous," said Markus Gross, vice president for Disney Research.
"By figuring out a way to filter out these extraneous sounds, our research team has taken a big step towards an array of new applications for computer vision," said Gross.
"If we have a video collection of cars, the videos that contain actual car engine sounds will have audio features that recur across multiple videos," Bazin said.
"On the other hand, the uncorrelated sounds that some videos might contain generally won't share any redundant features with other videos, and thus can be filtered out," he said.
Subsequent testing showed that when presented an image, the proposed system often was able to suggest a suitable sound.
A user study found that the system consistently returned better results than one trained with the unfiltered original video collection, researchers said.
Disclaimer: No Business Standard Journalist was involved in creation of this content
You’ve reached your limit of {{free_limit}} free articles this month.
Subscribe now for unlimited access.
Already subscribed? Log in
Subscribe to read the full story →
Smart Quarterly
₹900
3 Months
₹300/Month
Smart Essential
₹2,700
1 Year
₹225/Month
Super Saver
₹3,900
2 Years
₹162/Month
Renews automatically, cancel anytime
Here’s what’s included in our digital subscription plans
Exclusive premium stories online
Over 30 premium stories daily, handpicked by our editors


Complimentary Access to The New York Times
News, Games, Cooking, Audio, Wirecutter & The Athletic
Business Standard Epaper
Digital replica of our daily newspaper — with options to read, save, and share


Curated Newsletters
Insights on markets, finance, politics, tech, and more delivered to your inbox
Market Analysis & Investment Insights
In-depth market analysis & insights with access to The Smart Investor


Archives
Repository of articles and publications dating back to 1997
Ad-free Reading
Uninterrupted reading experience with no advertisements


Seamless Access Across All Devices
Access Business Standard across devices — mobile, tablet, or PC, via web or app
