Deanonymizing data

Anonymity is a big deal these days. And it should be, because the proliferation of personal computing devices with multiple radio interfaces places individual privacy in question. Consider the problem of de-anonymizing the Netflix database released for the Netflix prize project. A recent paper from U. Texas showed that the records released as part of the database were not anonymous at all and given a little bit of side information, allowed easy identification of individual records in a simple way.

It is clear that some information must be removed from a database or a set of trajectories in order to prevent re-identification. But WHAT part of the information to remove is not so clear! I am pretty sure information theory must have something useful to say about this problem. In particular, the Information Bottleneck Method of Tishby et al might be useful place to look for answers. Pending job for the summer.

Published in:  on May 16, 2008 at 6:08 pm Comments (1)
Tags: ,

The URI to TrackBack this entry is: http://alicebobandeve.wordpress.com/2008/05/16/deanonymizing-data/trackback/

RSS feed for comments on this post.

One Comment Leave a comment.

  1. Very interesting — I learned of this in a different context of information inference.

    But the anonymity question raises interesting points. Have to learn more about the comments you made. Reg. IT it has to be modeled properly and dynamic system modeling in IT is still not easy to do.

    Lalitha


Leave a Comment