Privacy Accounting and Quality Control in the Sage Differentially Private ML Platform
11/01
1
With presentation slides adapted from this
2
3
4
Protect Models: Differential Privacy
5
Protect Models: Differential Privacy
6
Protect Models: Differential Privacy
7
Practical challenges for growing database:��- Running out of budgets?�- Balancing privacy-utility?
8
Sage
Key Ideas
9
Sage Overview
Enforces global privacy budgets
Access control: assign blocks to training iterations
Training:
10
Key Ideas
11
Block Composition
12
Block Composition
13
Model 1
Model 2
Block Composition
14
Model 1
Model 2
Block Composition
15
Model 1
Model 2
Model 3
Block Composition
16
Model 1
Model 2
Model 3
X
X
Block Composition
Cap on max global privacy loss
| PrivacyLoss(stream) | ⩽ maxk | PrivacyLoss(Dk) |
17
Model 1
Model 2
Model 3
X
X
Block Composition
Cap on max global privacy loss
| PrivacyLoss(stream) | ⩽ maxk | PrivacyLoss(Dk) |
New blocks generated with zero privacy loss
18
Model 1
Model 2
Model 3
X
X
Key Ideas
19
Iterative Training
20
Iterative Training
21
Sage Access Control
Iterative Training - Validation
22
Sage Access Control
Evaluation
23
Benefits of block composition
24
Iterative training and DP-aware validation
25
Non DP | UC DP | Sage |
0.2% | 1.7% | 0.3% |
Failure rate at 1% prob. (η=0.01)
Continuous operation of ML pipeline
26
Discussion Questions
27