This AI tool will spot spoilers for you

Image
Press Trust of India Los Angeles
Last Updated : Jul 09 2019 | 4:50 PM IST

Scientists have developed an artificial intelligence (AI) based system that can flag spoilers in online reviews of books and TV shows.

"Spoilers are everywhere on the internet, and are very common on social media. As internet users, we understand the pain of spoilers, and how they can ruin one's experience," said Ndapa Nakashole, a professor at the University of California San Diego in the US.

Some websites allow people to manually flag their posts with tags that serve as 'spoiler ahead' warning signs. However, this does not always happen.

Researchers wanted to develop an AI tool powered by neural networks to automatically detect spoilers. They named the tool SpoilerNet.

On a theoretical level, researchers want to better understand how people write spoilers and what kind of linguistic patterns and common knowledge mark a sentence as a spoiler.

The tool could be used to build a browser extension to shield people from spoilers, the researchers said.

To train and test SpoilerNet, the team went looking for large datasets of sentences containing spoilers. They found none.

They created their own by collecting more than 1.3 million book reviews annotated with spoiler tags by book reviewers.

The tags encompass sentences that include spoilers and hide them behind a "view spoiler" link in the text.

The reviews were collected from Goodreads, a social networking site that allows people to track what they read, and share thoughts and reviews with other readers.

"To our knowledge, this is the first dataset with spoiler annotations at this scale and at such a fine-grained granularity," said Mengting Wan, a PhD student in computer science at UC San Diego.

Researchers found that spoiler sentences tend to clump together in the latter part of reviews.

However, they also found that different users had different standards to tag spoilers, and neural networks needed to be carefully calibrated to take this into account.

In addition, the same word may have different semantic meanings in different contexts.

For example, 'green' is just a colour in one book review, but it can be the name of an important character and a signal for spoilers in another book.

Identifying and understanding these differences is challenging, Wan said.

Researchers trained SpoilerNet on 80 per cent of the reviews on Goodreads, running the text through several layers of neural networks. The system could detect spoilers with 89 to 92 per cent accuracy.

They also ran SpoilerNet on a dataset of more than 16,000 single-sentence reviews of about 880 TV shows. The accuracy of the tool to detect spoilers was 74 to 80 per cent.

Most of the errors came from the system getting distracted by words that are usually loaded and revelatory -- for example murder or killed.

Disclaimer: No Business Standard Journalist was involved in creation of this content

*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

First Published: Jul 09 2019 | 4:50 PM IST

Next Story