Microsoft releases Indian language 'Speech Corpus' for researchers

Image
IANS Bengaluru
Last Updated : Sep 06 2018 | 3:35 PM IST

To help researchers and academia build Indian language speech recognition for all applications where speech is used, Microsoft India on Thursday launched its Indian language "Speech Corpus", offering speech training and test data for Telugu, Tamil and Gujarati.

This is the largest publicly available Indian language speech dataset which includes audio and corresponding transcripts, Microsoft said in a statement.

This Indian language "Speech Corpus" content is provided by Microsoft Research Open Data initiative, a collection of free datasets from Microsoft Research to advance research in areas such as natural language processing, computer vision, and domain specific sciences.

"Microsoft Indian Language Speech Corpus is an extension of our on-going efforts to reduce language barriers and empower Indians to harness the full potential of the Internet," said Sundar Srinivasan, General Manager, Artificial Intelligence and Research, Microsoft India.

"Using our technology expertise, we want to accelerate innovation in voice based computing for India by supporting researchers and academia," Srinivasan said.

Microsoft's Indian Language Speech Corpus was tested at Interspeech 2018 conference in Hyderabad this month.

In a Low Resource Speech Recognition Challenge, participants used data from Microsoft Indian language speech corpus to build Automatic Speech Recognition (ASR) systems.

They were able to create high quality speech recognition models using this data, thus validating the efficacy of the Corpus, Microsoft said.

Microsoft has been working with Indian languages for over two decades since the launch of Project Bhasha in 1998, allowing users to input localised text easily and quickly using the Indian Language Input tool.

--IANS

gb/sed

Disclaimer: No Business Standard Journalist was involved in creation of this content

*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

First Published: Sep 06 2018 | 3:30 PM IST

Next Story