Researchers at Facebook have developed a quicker and more accurate way of translating low-resources languages like Urdu and Burmese using Artificial Intelligence, said a media report.
The breakthrough, which will be presented at Empirical Methods in Natural Language Processing or EMNLP, could prove to be important for Facebook, as the social media giant uses automatic language translation to help its users around the world to read posts in their preferred language, the Forbes reported.
The existing machine translation systems can achieve near human-level performance on some languages but they require access to parallel corpus vast quantities of the same sentences in different languages in order to learn, it said.
The team from the Facebook AI Research (FAIR) division were able to train a machine translation (MT) system by feeding it large pieces of different text in different languages from publicly available websites like Wikipedia.
The key thing to note is that these pieces of text were independent of one another.
When you have different pieces of text in different languages they're referred to as monolingual corpora, it said.
"Building a parallel corpus is complicated because you need to find people fluent in two languages to create it. For instance, if you wanted to build a parallel corpus of Portuguese/Nepali, you would need to find people fluent in these two languages, which would be very difficult," Antoine Bordes, a research scientist and the head of FAIR's Paris research lab, was quoted as saying in the report.
He said: "On the other side, building monolingual corpora Portuguese/Nepali is very easy: you just need to download webpages from Portuguese and from Nepali websites, it doesn't matter if they are not parallel sentences or if they talk about different things".
Most language translation computer systems use both monolingual corpora and parallel corpus to learn.
"The novelty in our approach is that we can train MT systems from monolingual corpora only, we don't need any parallel corpus. Potentially, given a book written in an alien language, we could use our model to translate it into English," Bordes said.
Disclaimer: No Business Standard Journalist was involved in creation of this content
You’ve reached your limit of {{free_limit}} free articles this month.
Subscribe now for unlimited access.
Already subscribed? Log in
Subscribe to read the full story →
Smart Quarterly
₹900
3 Months
₹300/Month
Smart Essential
₹2,700
1 Year
₹225/Month
Super Saver
₹3,900
2 Years
₹162/Month
Renews automatically, cancel anytime
Here’s what’s included in our digital subscription plans
Exclusive premium stories online
Over 30 premium stories daily, handpicked by our editors


Complimentary Access to The New York Times
News, Games, Cooking, Audio, Wirecutter & The Athletic
Business Standard Epaper
Digital replica of our daily newspaper — with options to read, save, and share


Curated Newsletters
Insights on markets, finance, politics, tech, and more delivered to your inbox
Market Analysis & Investment Insights
In-depth market analysis & insights with access to The Smart Investor


Archives
Repository of articles and publications dating back to 1997
Ad-free Reading
Uninterrupted reading experience with no advertisements


Seamless Access Across All Devices
Access Business Standard across devices — mobile, tablet, or PC, via web or app
