The future of AI infrastructure
Andrew Trask
Use Case 1: Global Trade
Use Case 2: Breast Cancer
Use Case 3: Money Laundering
Use Case 4: Algorithmic Accountability
Algorithmic Bias is a Sticky Issue
Algorithms often interact with private data
Algorithms increasingly drive decisions in society
Algorithms are often proprietary
Result: Algorithmic auditing is hard/rare
Harms of algorithms are often emergent from many algorithms at different orgs
Characteristics
How are we solving it?
Privacy Enhancing Technologies (PETs)
HOMOMORPHIC ENCRYPTION
K-ANONYMIZATION
SECURE ENCLAVES
FUNCTIONAL ENCRYPTION
ZERO-KNOWLEDGE PROOFS
SYNTHETIC DATA
DIFFERENTIAL PRIVACY
BLOCKCHAIN?
FEDERATED LEARNING
SECURE MULTI-PARTY COMPUTATION
What are PETs?
HOMOMORPHIC ENCRYPTION
K-ANONYMIZATION
SECURE ENCLAVES
FUNCTIONAL ENCRYPTION
ZERO-KNOWLEDGE PROOFS
SYNTHETIC DATA
DIFFERENTIAL PRIVACY
FEDERATED LEARNING
SECURE MULTI-PARTY COMPUTATION
Input Privacy
Output Privacy
What are PETs?
HOMOMORPHIC ENCRYPTION
K-ANONYMIZATION
SECURE ENCLAVES
FUNCTIONAL ENCRYPTION
ZERO-KNOWLEDGE PROOFS
SYNTHETIC DATA
DIFFERENTIAL PRIVACY
FEDERATED LEARNING
SECURE MULTI-PARTY COMPUTATION
Input Privacy
Output Privacy
ZERO-KNOWLEDGE PROOFS
CRYPTOGRAPHIC SIGNATURES
TRUST OVER IP INFRA
Input Verification
ACTIVE SECURITY
SECURE ENCLAVES
Output Verification
answer a question using data owned by someone else
PETS MAKE IT POSSIBLE TO:
In Another Country
In Another Org
In Another Dept.
answer a question using data owned by someone else
PETS MAKE IT POSSIBLE TO:
HOMOMORPHIC ENCRYPTION
K-ANONYMIZATION
SECURE ENCLAVES
FUNCTIONAL ENCRYPTION
ZERO-KNOWLEDGE PROOFS
SYNTHETIC DATA
DIFFERENTIAL PRIVACY
FEDERATED LEARNING
SECURE MULTI-PARTY COMPUTATION
This is the ability that matters!
These are just algorithms!
In Another Country
In Another Org
In Another Dept.
By analogy: everyone is working on car parts…
…but we don’t yet have a car.
What is the “car” of PETs?
KEY QUESTION:
The “car” of PETs
An Organisation’s
“Domain Server”
It’s like an Apache Web Server for private data
The “car” of PETs
An Organisation’s
“Domain Server”
Data Owner
The “car” of PETs
An Organisation’s
“Domain Server”
Data Owner
1
Loads private data into server
2
Creates an account for a data scientist
3
… goes and has a coffee…
…(or tea)
The “car” of PETs
An Organisation’s
“Domain Server”
Data Owner
1
Loads private data into server
2
Creates an account for a data scientist
3
… goes and has a coffee…
…(or tea)
Data Scientist
*********
+ Question Limitations
The “car” of PETs
An Organisation’s
“Domain Server”
Data Owner
1
Loads private data into server
2
Creates an account for a data scientist
3
… goes and has a coffee…
…(or tea)
bye!
*
The “car” of PETs
An Organisation’s
“Domain Server”
Data Scientist
*
The “car” of PETs
An Organisation’s
“Domain Server”
Data Scientist
*
1
Login to Domain Server
2
Get answers to allowed questions
3
Download Answers
The “car” of PETs
An Organisation’s
“Domain Server”
Data Scientist
*
1
Login to Domain Server
2
Create answers to allowed questions
3
Download Answers
Q
Q
Q
HOMOMORPHIC ENCRYPTION
K-ANONYMIZATION
SECURE ENCLAVES
FUNCTIONAL ENCRYPTION
ZERO-KNOWLEDGE PROOFS
SYNTHETIC DATA
DIFFERENTIAL PRIVACY
FEDERATED LEARNING
SECURE MULTI-PARTY COMPUTATION
This is the PETs part.
A
A
A
The “car” of PETs
An Organisation’s
“Domain Server”
Data Scientist
*
1
Login to Domain Server
2
Get answers to allowed questions
3
Download Answers
A
A
A
Notice what’s missing!
Bottom Line: answering questions using an org’s DS will be as easy as going to the organization’s public website
What are the “roads” of PETs?
KEY QUESTION:
Data Scientist
Network Server
Domain Server
Domain Server
Domain Server
Domain Server
Domain Nodes
Network Nodes
Data Scientist
Network Server
Domain Server
Domain Server
Domain Server
Domain Server
Science Project
Data Scientist
Network Server
Domain Server
Domain Server
Domain Server
Domain Server
Science Project
Data Scientist
Network Server
Domain Server
Domain Server
Domain Server
Domain Server
Science Project
Data Scientist
Network Server
Domain Server
Domain Server
Domain Server
Domain Server
Science Project
Science Project
Science Project
Science Project
Science Project
Data Scientist
Network Server
Domain Server
Domain Server
Domain Server
Domain Server
Science Project
Science Project
Science Project
Science Project
Science Project
Data Scientist
Network Server
Domain Server
Domain Server
Domain Server
Domain Server
Science Project
Science Project
Science Project
Science Project
Science Project
Remote Data Science via PETs
🇯🇵
🇨🇦
🇺🇸
🏴
🇫🇷
Massive
Federated Data�Networks
🇯🇵
🇨🇦
🇺🇸
🏴
🇫🇷
Data Scientist
🇯🇵
🇨🇦
🇺🇸
🏴
🇫🇷
Data Scientist
Data Scientist
Network Server
Domain Server
Domain Server
Domain Server
Domain Server
Science Project
Science Project
Science Project
Science Project
Science Project
Remote Data Science via PETs
How is OpenMined making concrete progress on this technical vision?
KEY QUESTION:
Let’s look closer at the tech!
NEXT TOPIC:
Tool 1: Remote Execution
Tool 1: Remote Execution
Tool 1: Remote Execution
Tool 1: Remote Execution
Tool 1: Remote Execution
Tool 1: Remote Execution
Tool 1: Remote Execution
Tool 1: Remote Execution
Tool 1: Remote Execution
Tool 1: Remote Execution
Pros:
Cons:
Top Contributors
Tool 1: Remote Execution
Tool 2: Search and Example Data
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
Tool 2: Search and Example Data
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
Tool 2: Search and Example Data
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
Tool 2: Search and Example Data
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
Tool 2: Search and Example Data
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
Tool 2: Search and Example Data
Pros:
Cons:
Top Contributors
Tool 2: Search and Example Data
Tool 3: Differential Privacy
Bob: 1
Bill: 0
Sue: 0
John: 1
Joe: 1
Pat: 0
Amy: 1
Alice: 0
Tool 3: Differential Privacy
Canonical DB
Tool 3: Differential Privacy
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
Tool 3: Differential Privacy
FEATURE IN DEVELOPMENT
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
Tool 3: Differential Privacy
FEATURE IN DEVELOPMENT
https://github.com/OpenMined/design-assets/blob/master/logos/PyGrid/horizontal-primary-trans.png
Tool 3: Differential Privacy
FEATURE IN DEVELOPMENT
Tool 3: Differential Privacy
FEATURE IN DEVELOPMENT
Top Contributors
Tool 4: Secure Multi-Party Computation
SHARE OWNERSHIP OF A NUMBER
Tool 4: Secure Multi-Party Computation
Tool 4: Secure Multi-Party Computation
2
3
5
Tool 4: Secure Multi-Party Computation
2
3
5
Tool 4: Secure Multi-Party Computation
5
2
3
Tool 4: Secure Multi-Party Computation
5
2
3
Tool 4: Secure Multi-Party Computation
5
2
3
Tool 4: Secure Multi-Party Computation
5
2
3
Tool 4: Secure Multi-Party Computation
5
2
3
X
2
4
X
2
6
Models and datasets are just
large collections of numbers
which we can encrypt
Tool 4: Secure Multi-Party Computation
5
2
3
X
2
4
X
2
6
10
Models and datasets are just
large collections of numbers
which we can encrypt
Tool 4: Secure Multi-Party Computation
Data Scientist
Network Server
Domain Server
Domain Server
Domain Server
Domain Server
Science Project
Science Project
Science Project
Science Project
Science Project
Remote Data Science via PETs
🇯🇵
🇨🇦
🇺🇸
🏴
🇫🇷
Data Scientist
Thank you!