Q&A with Christian Rudder, cofounder of OkCupid and author of Dataclysm As more of our social interaction happens on social media, how much can researchers learn about us from our online interactions? Well, they can only learn what we tell them, but in the age of Facebook and Google, thats become pretty much everything. To the extent that friendship, anger, sex, love, and whatever else happen online, we can investigate them. Your search history tells us what kind of jokes you like. Your Facebook network reveals not just your friendships, but in some cases the state of your marriage. Your preferences on OkCupid tell us what you find sexy, and your reaction to the strangers the site offers up tells us how you judge people. The articles you like tell us not just about your politics, but even predict your intelligence. You fold in data points like these for millions and millions of people, and you start to get a whole new picture of humankind. In Dataclysm youre taking this flood of information and putting it to an entirely new use: understanding human nature. So what have you found? I tried really hard to avoid the numerical dog and pony show. There are of course lots of interesting one-off factoids, but I mostly found what I (and probably you) have always known: that people are gentle, mean, stupid, lusty, lonely, kind, foolish, shrewd, shallow, and endlessly complex. Dataclysms central idea isnt necessarily what we can see using big data; its the fact of the vision itself. That we can get real data on even the most private moments in peoples lives is an astounding thing. Its like the second advent of reality television, but this time without the television part. Just the reality. Are you worried about any of this? I have mixed feelings about the implications. I myself almost never tweet, post, or share anything about my personal life. At the same time, Ive just spent three years writing about how interesting all this data is, and I cofounded OkCupid. My hope is that this ambivalence makes me a trustworthy guide through the thicket of technology and data. I admire the knowledge that social data can bring us; I also fear the consequences. You have a lot to say about race in the book, and you use data to shed light on the many ways it affects the way we interact with one another. What surprised you about your research in this area? Did you find anything unsurprising? The data on race was surprising only in its stubborn predictabilityfor all the glitzy technology, the results couldve been from the 1950s. I grew up in Little Rock and graduated from Central High, the first school in the South to be integrated: Eisenhower, the National Guard, mobs of white people screaming at nine black children, thats Central. The school embraces its history and is now over half black. Im no brave crusader, but race (and racism) were part of my education. So when, in researching the book, I unpacked three separate databases and found that in every one white people gave black people short-shrift, I wasnt shocked, you know? Asians and Latinos apply the same penalty to African Americans that white folks do, which says something about how even (relatively) recent additions to the American experience have acquired its biases. What makes this moment in timeand this set of datadifferent from the massive data surveys of the past, such as Pew, Gallup, or the Kinsey Institute? The data in my book is almost all passively observedtheres no questionnaire, no contrived experiment to simulate real life. This data is real life. Online you have friends, lovers, enemies, and intense moments of truth without a thought for whos watching, because ostensibly no one isexcept of course the computers recording it all. This is how digital data circumvents that old research obstacle: peoples inability to be honest when the truth makes them look bad. Digital datas ability to get at the private mind like this is unprecedented and very powerful.
Cited By
- Kilic O and Tsang A Catfished! Impacts of Strategic Misrepresentation in Online Dating Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, (1011-1019)
- Hutson J, Taft J, Barocas S and Levy K (2018). Debiasing Desire, Proceedings of the ACM on Human-Computer Interaction, 2:CSCW, (1-18), Online publication date: 1-Nov-2018.
- Tyson G, Perta V, Haddadi H and Seto M A first look at user activity on Tinder Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, (461-466)
- Elsden C, Nissen B, Garbett A, Chatting D, Kirk D and Vines J Metadating Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, (685-698)
- Masden C and Edwards W Understanding the Role of Community in Online Dating Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, (535-544)
Index Terms
- Dataclysm: Who We Are (When We Think No One's Looking)