Researchers have developed a new tool that unveils which data in a web account, such as emails, searches, or viewed products, are being used to target which outputs, such as ads, recommended products, or prices.
Roxana Geambasu and Augustin Chaintreau, both assistant professors of computer science at Columbia Engineering, along with PhD student Mathias Lecuyer created the tool called XRay to understand how personal data is being used on web services like google, amazon, Facebook and youtube.
"Today we have a problem: the web is not transparent. We see XRay as an important first step in exposing how websites are using your personal data," said Geambasu.
While harnessing big data can certainly improve our daily lives, these beneficial uses have also generated a big data frenzy, with web services aggressively pursuing new ways to acquire and commercialise the information.
"It's critical, now more than ever, to reconcile our privacy needs with the exponential progress in leveraging this big data," said Chaintreau.
"If we leave it unchecked, big data's exciting potential could become a breeding ground for data abuses, privacy vulnerabilities, and unfair or deceptive business practices," Geambasu added.
For example, one can use the XRay prototype to study why a user might be shown a specific ad in Gmail. Geambasu and Chaintreau found, for example, that a Gmail user who sees ads about various forms of spiritualism might have received them because he or she sent an email message about depression.
"The theoretical results were encouraging, but seemed too good to be true. So we tested XRay in actual situations, learning from experiments we ran on Gmail, Amazon, and YouTube, and refining the design multiple times.
The current XRay system works with Gmail, Amazon, and YouTube. However, XRay's core functions are service-agnostic and easy to instantiate for new services, and they can track data within and across services.
The key idea in XRay is to use black-box correlation of data inputs and outputs to detect data use.
To assess XRay's practical value, the researchers created an XRay-based demo service that continuously collects and diagnoses Gmail ads related to a set of topics, including various diseases, pregnancy, race, sexual orientation, divorce, debt, etc.
You’ve reached your limit of {{free_limit}} free articles this month.
Subscribe now for unlimited access.
Already subscribed? Log in
Subscribe to read the full story →
Smart Quarterly
₹900
3 Months
₹300/Month
Smart Essential
₹2,700
1 Year
₹225/Month
Super Saver
₹3,900
2 Years
₹162/Month
Renews automatically, cancel anytime
Here’s what’s included in our digital subscription plans
Exclusive premium stories online
Over 30 premium stories daily, handpicked by our editors


Complimentary Access to The New York Times
News, Games, Cooking, Audio, Wirecutter & The Athletic
Business Standard Epaper
Digital replica of our daily newspaper — with options to read, save, and share


Curated Newsletters
Insights on markets, finance, politics, tech, and more delivered to your inbox
Market Analysis & Investment Insights
In-depth market analysis & insights with access to The Smart Investor


Archives
Repository of articles and publications dating back to 1997
Ad-free Reading
Uninterrupted reading experience with no advertisements


Seamless Access Across All Devices
Access Business Standard across devices — mobile, tablet, or PC, via web or app
