What is big data anyway? There are many definitions. The simplest one I came across says that big data stands for information that goes beyond what could be squeezed into a large spreadsheet. Big data cannot be filled into a spreadsheet simply because it is no longer numbers. It includes words, visuals, pictures, videos and more. Add to that the enormous amount of information we share on social media platforms, Google searches, blog posts, online reviews, personal photographs, videos etc and you have the big mass that is big data, I was told.
At a recent seminar hosted by IIM Calcutta Alumni Association Mumbai Chapter, Professor Ram gave us a new definition. She should know; she is Anheuser-Busch Endowed Professor of MIS, and Entrepreneurship & Innovation in the Eller College of Management at the University of Arizona. In her scheme of things, big data needs to be seen through just two simple lenses. One, big data has led to ‘datafication’ of what till now that was not in the realm of data. Take for example the smart watch you are wearing. By merely wearing it you are able to convert a lot of signals from your body that has always existed into data. You can count the steps you took yesterday, measure your pulse rate, and even take real time measurement of your blood pressure.
Two, big data is so defined because it comes with “time and geo stamp”. We are collecting a lot of data that is not just data but it comes with a clear marker about when it was collected and where it was collected. Take real time traffic measurement that is happening on Google maps. The data is getting collected real time with clear geographical tags (Google has some catching up to do on predicting travel times in Indian cities like Mumbai; for that it probably needs Big Big Data that goes into the realm of divinity).
The question is, how do you use all this big data. Professor Ram shared two interesting examples with us.
Professor Ram’s team had at its disposal the data from student identity cards (smart cards) which were used for entry into the mess halls, library, dorms, class room buildings etc. This data was being collected live, real time by the university. By mapping card transactions that occur very near in time and at the same location, researchers could make inferences about a student’s implicit friends group and social networks. They could also build a model around the regularity of the students activities, in an anonymous fashion. According to Ram the model that was build was able to predict at the end of the first 12 weeks the potential for a student to drop out to the extent of 85-90 per cent accuracy. By providing selective help, the University was able to hit a retention rate of 86.5 per cent, the highest in its history. While questions of privacy remain, here is a case where big data was used to help a very vulnerable cohort manage their lives better.
As you would have learnt by now, big data is not just about Big Brother watching you. When used sensibly, it can save lives and help build better future citizens. One human bit at a time.
The author is an independent brand strategist, author, and founder Brand-Building.com. Email: ambimgp@brand-building.com