Credit-card transactions can give away your identity!

Image
Press Trust of India Washington
Last Updated : Feb 02 2015 | 1:35 PM IST
MIT scientists, including one of Indian-origin, have examined anonymous credit card data and found that most individuals can be identified using just the dates and locations of four purchases.
Researchers at Massachusetts Institute of Technology found that just four fairly vague pieces of information - the dates and locations of four purchases - are enough to identify 90 per cent of the people in a data set recording three months of credit-card transactions by 1.1 million users.
When the researchers also considered coarse-grained information about the prices of purchases, just three data points were enough to identify an even larger percentage of people in the data set.
That means that someone with copies of just three of your recent receipts - or one receipt, one Instagram photo of you having coffee with friends, and one tweet about the phone you just bought - would have a 94 per cent chance of extracting your credit card records from those of a million other people, researchers said.
This is true, the researchers said, even in cases where no one in the data set is identified by name, address, credit card number, or anything else that we typically think of as personal information.
"If we show it with a couple of data sets, then it's more likely to be true in general," said Yves-Alexandre de Montjoye, an MIT graduate student in media arts and sciences, and first author of the study.
De Montjoye worked on the study with his advisor, Alex 'Sandy' Pentland, the Toshiba Professor of Media Arts and Science; Vivek Singh, a former postdoc in Pentland's group who is now an assistant professor at Rutgers University; and Laura Radaelli, a postdoc at Tel Aviv University.
The data set the researchers analysed included the names and locations of the shops at which purchases took place, the days on which they took place, and the purchase amounts.
Purchases made with the same credit card were all tagged with the same random identification number.
For each identification number - each customer in the data set - the researchers selected purchases at random, then determined how many other customers' purchase histories contained the same data points.
In separate analyses, the researchers varied the number of data points per customer from two to five. Without price information, two data points were still sufficient to identify more than 40 per cent of the people in the data set.
At the other extreme, five points with price information was enough to identify almost everyone, researchers found.
*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

First Published: Feb 02 2015 | 1:35 PM IST

Next Story