Open Source Data: Big Data for All

Think about it: Wouldn’t you love to know everything your cellphone knows?

I mean, not just the stuff about the universe—like the distance between Des Moines and Billings or the weather in Ulaanbaatar, Mongolia—but also information about you. Like the time you wake up in the morning, your movements around town, when and where you tend to get stuck in traffic, how much exercise you are getting, what you are eating, your online clickstream, social networking activities, communications, contacts, calendar and more. If only you could tap into this information, analyze it and draw useful conclusions, you could no doubt improve your effectiveness and quality of life.

Over the past decade, the privacy framework has become preoccupied with organizational data management processes grouped under the title “accountability.” While improving corporate governance and mitigating data security risks (no doubt admirable goals), accountability measures generate little benefit to individuals. Indeed, by treating organizations as trusted stewards of personal information, accountability cuts individuals out of the decisionmaking process. You want privacy? Walmart or Pfizer will take care of it for you.

In a new article, Big Data for All: Privacy and User Control in the Age of Analytics, which will be published in the Northwestern Journal of Technology and Intellectual Property, Jules Polonetsky, CIPP/US, and I try to refocus the privacy framework on individual empowerment. We argue that going forward, organizations should provide individuals with practical, easy-to-use access to their information, so they can become productive participants in the data economy. In addition, organizations should be transparent about the decisional criteria underlying their data processing activities, allowing individuals to challenge, or at the very least understand, how decisions about them are made.

First, we propose a “sharing the wealth” strategy premised on organizations providing individuals with access to their data in usable format. This, in turn, will spawn the development of user-side applications that analyze individuals’ data to draw useful conclusions (“take the I-90”; “eat more proteins”; “call Sally”). We call this the “featurization” of Big Data, taming the Big Data “beast”, which is currently used strictly for server-side surveillance and harnessing it for domestic use. The technological groundwork has already been set with mash-ups and real-time APIs making it easier for organizations to combine information from different sources and services into a single user experience. Much like open source software or creative commons licenses, free access to personal data is grounded in both efficiency and fairness rationales.

Second, we suggest that organizations should be more forthcoming about the criteria used in their data-driven choices. Data analysis machinery is increasingly used to make choices that affect individuals’ lives. Will you get credit or insurance? Which medical treatments suit you best? Which ad will you be shown? The machine, the thinking goes, is more impartial, objective and efficient than any individual referee.

Yet the machine is also heartless.

Consider Bettina Wulff, the thirty-something-year-old wife of former German President Christian Wulff. When you Google her name, the search engine’s autocomplete function adds terms like “escort” and “prostitute.” Wulff, who has sued Google for defamation, vehemently denies these machine-made charges.

Yet who can argue against the machine? Who do you believe—Ms. Wulff or the Google algorithm?

The machine also lacks moral judgment.

Consider the “pregnancy score” allocated by Target to shoppers in order to preempt competitors with early detection of mothers-to-be. Jules and I argue that as “objective” as the machine may be, it must be tempered by a human conscious. Individuals should not be judged by machines operating based on opaque criteria that are hidden behind veils of trade secrecy. We are entitled to know, challenge and debate the merits of data-driven decisions. And while we recognize the practical difficulties of mandating disclosure without compromising organizations’ “secret sauce”, we trust that a distinction can be drawn between proprietary algorithms, which would remain secret, and decisional criteria, which should be disclosed.

The Big Data economy generates immense benefits to organizations, individuals and society at large. In order to preserve the value of data while protecting privacy rights, individuals should be granted a voice as well as a piece of the action.

Author

Omer Tene IAPP Member Contributor

1 Comment

If you want to comment on this post, you need to login.

Marty Abrams • Feb 27, 2013

The “Dublin paper” from the Global Accountability Project list five essential elements for accountability to be effective. The fourth element is individual participation, and while the fifth covers enforcement and redress. Professor Tene is correct, accountability puts the burden on data users – responsible and answerable organizations - to understand the risks they create for others, and mitigate those risks. He is also right that without aggressive enforcement some organizations will game an accountability based system by either not identifying the risks or inadequately policing the risks they identify. I would suggest that all privacy governance systems are gameable, and accountability, with quality enforcement is actually harder to game than a system based on procedural requirements. The bottom-line is that putting the burden on individuals to understand big data processes well enough to police the market is putting an intolerable burden on individuals.
Yes individuals should be able to access the information that pertains to them. Yes they should have processes explained to them in a manner they can understand. Yes they should be able to object to data uses they find repugnant. But they should not have to police a big data world.
It is not either organizational responsibility, or individual empowerment. It is both. Accountability requires both.
Just this week the Centre for Information Policy Leadership released “Big Data and Analytics: Seeking Foundations for Effect Privacy Guidance.” In this new paper from the Centre ethical analytics project suggests a two phased approach to big data governance is suggested. This approach sits on a foundation of accountability.

Inside the EU AI Act negotiations: A conversation with Laura Caroli

For many who are following along with the EU AI Act negotiations, the road to a final agreement took many twists and turns, some unexpected. For Laura Caroli, this long, complicated road has been a lived experience. As the lead technical negotiator and policy advisor to AI Act co-rapporteur and MEP...

Read More Save This

Contracting for AI: Finding balance

The EU Artificial Intelligence Act was carefully crafted with a risk-based approach to regulating AI — heavily regulating high-risk AI models, lightly regulating generative AI models with systemic risk and leaving low-risk models alone. Not all purchasers of AI functionality have adopted this risk-...

Read More Save This

Luminos.AI wants to take on AI management woes

Andrew Burt watched clients increasingly look to bring artificial intelligence into their companies. But they kept running into the same barrier: A struggle to manage the risks and align the AI's use with their company's activities. So he decided to take what the boutique law firm he co-founded, Lu...

Read More Save This

Unlocking global data privacy interoperability with CBPRs

In our digitally connected world, safeguarding personal data is essential. The question is, how can government and business collaborate to achieve this critical goal? At present, 137 countries have privacy laws and regulations in place — covering more than three-quarters of the world population — w...

Read More Save This

A comprehensive guide to creating a sustainable cookie program

Cookie management — whether that means obtaining consent or otherwise — is required by most privacy and data protection laws. In Europe and elsewhere around the world, consent is required for the use of nonessential cookies, think tracking or analytics cookies, and in many U.S. states, residents now...

Read More Save This

Privacy Perspectives | Open Source Data: Big Data for All Related reading: Inside the EU AI Act negotiations: A conversation with Laura Caroli

Open Source Data: Big Data for All

Author

Tags

1 Comment

Tags

Recent Comments

Author

Tags

1 Comment

Related Stories

Inside the EU AI Act negotiations: A conversation with Laura Caroli

Contracting for AI: Finding balance

Luminos.AI wants to take on AI management woes

Unlocking global data privacy interoperability with CBPRs

A comprehensive guide to creating a sustainable cookie program

Related Stories

Tags

Recent Comments