CSE 163
Privacy
��Hunter Schafer
Anonymous Data Isn’t
Mid 1990s, insurance group in Massachusetts published anonymous records of hospital visits with attributes like name, address, social security removed but left in demographic information.
Turns out this data release was not so anonymous!
Sweeney estimates 87% of the US is uniquely identified by knowing 1) date of birth, 2) sex, and 3) zip code.
2
k-anonymity
K-anonymity: A first definition of privacy by Sweeney that requires every query results in at least k people in the dataset.
Weakness: Fails under composition
3
Differential Privacy
A stronger notion of privacy that guarantees how much information you can learn about a person.
Consider two worlds, one where A participates in a study and one where they don’t. If results of the study are similar, we say it respects differential privacy.
4
Differential Privacy
Say an algorithm or analysis is 𝜀-differentially private if results with or without any single person in the dataset are “at most 𝜀” off.
Two methods for commonly achieving 𝜀-differential privacy
5
Jittering
Take result of analysis and add a small amount of random noise to result.
Specifically if you add noise that follows a Laplace distribution with parameter 𝜀, you can achieve 𝜀-differential privacy.
6
Randomized Response
What if we don’t trust the data collector with our data?
�Change the differential privacy mechanism to be done locally rather than centrally!
Differentially Private Polling Procedure:
�Key idea: Can learn aggregate trends without knowing true result of individual
7
Randomized Response Analysis
Key property: People tell the truth ¾ of the time and lie ¼ of the time. ½ of the time they are honest, and then half of the time they tell us a random answer that lines up with the truth.
To see why this work, suppose we know the answer is “Yes” for ⅓ of people. How many “Yes” responses would we expect in this procedure?
In general, work backwards to solve for underlying probability
8
Group Work:
Best Practices
When you first working with this group:
Tips:
9