2 of 53

…

Central Server

The poisoning adversary

owns/controls multiple compromised clients

The adversary leverages its compromised clients to

corrupt the global model,

by sending malicious model updates

Poisoning adversary

FL’s Achilles’ heel!

FL’s Secret Sauce!

Lack of Trust

3 of 53

This Talk!

Discuss the significance of poisoning attacks to production FL systems

Relevant attack variants, threat models
Performance

Overview attack and defense mechanisms

4 of 53

Outline

Understanding FL poisoning

Types
Systemization

Our poisoning techniques

Data and model poisoning

Defenses

Federated Supermask Learning

Evaluations for production FL

Various datasets, models, and techniques

5 of 53

Outline

Understanding FL poisoning

Types
Systemization

Our poisoning techniques

Data and model poisoning

Defenses

Federated Supermask Learning

Evaluations for production FL

Various datasets, models, and techniques

6 of 53

Types of FL Poisoning: Goal

Targeted attacks
Backdoor attacks

Semantic Backdoors
Artificial Backdoors

Untargeted attacks

7 of 53

Types of FL Poisoning: Goal

Targeted Attacks

Misclassify a small set of inputs

These inputs need not have any specific properties

Correctly classify all the other inputs

Sample universe

8 of 53

Types of FL Poisoning: Goal

Backdoor Attacks – Semantic Backdoors

Correctly classify all the other inputs

Misclassify a small set of inputs

These inputs should have specific properties naturally present inside them

Sample universe

9 of 53

Types of FL Poisoning: Goal

Backdoor Attacks – Artificial Backdoors

Misclassify any input when modified in a certain fashion

These inputs have specific backdoor trigger that is manually added to them

Correctly classify all the other inputs

Sample universe

10 of 53

Types of FL Poisoning: Goal

Untargeted Attacks

Degrade classification performance on arbitrary inputs without modifying the inputs

Sample universe

11 of 53

Three Classes of FL Poisoning

Which one we should care about more

for production FL?

12 of 53

Untargeted Poisoning �Impacts More Participants

Affects the entire sample space, and hence, affect all FL clients!

Affect only a tiny fraction of sample space, and hence, does not hurt most FL clients

Outsized impact factor (OIF) =

(# of successful backdoors)/(# of adversary’s data points)

Previous work: 0.002 to 0.02 (attacks are ineffective for larger OIFs)

Untargeted

Targeted/backdoor

Impact factor: https://www.youtube.com/watch?v=poOgbgGeySk&list=PLSIUOFhnxEiCJS8q6SYdc0944xlV_6Jbu&index=15

13 of 53

Untargeted Poisoning�Is More Difficult to Detect

Misclassification

Confidence reduction

Reduce overall accuracy by small percentages and remains undetected

Still it can impact all samples (FL clients) to large extents

Benign

Poisoned

Model accuracy

Non-suspicious reduction in accuracy

Model confidence

Impact on individual samples

Impact from the server’s point of view

14 of 53

Untargeted Poisoning �Is More Challenging to Succeed

Model confidence

Targeted/backdoor attacks should manipulate only a few test inputs that are already vulnerable or are made vulnerable (e.g., by adding a patch)

Untargeted attacks should manipulate all the test inputs that are not altered, and hence, not vulnerable in any way

Test inputs

15 of 53

Types of FL Poisoning: Mechanism

Adversary’s Capabilities

Model poisoning

Data poisoning

break-in to access these parts of device

Can directly manipulate model updates
Sophisticated but few compromised clients
Highly impactful poisoned update

Can indirectly manipulate model updates
Naïve but many compromised clients
Poisoned updates with relatively less impact

No break-ins required

16 of 53

Systematization of FL Poisoning

Adversary’s Knowledge of the Global Model

Whitebox access

Nobox access

Complete access to device and model
Few compromised clients
Can directly manipulate model updates

No access to device and model
Many compromised clients
Only indirectly manipulate model updates

17 of 53

Systematization of FL Poisoning

Mode of Poisoning Attack

Online attack

Offline attack

Repeatedly and adaptively poison the model updates
Requires whitebox access

Poison the model updates only once
Requires just nobox access

18 of 53

Outline

Understanding FL poisoning

Types
Systemization

Our poisoning techniques

Data and model poisoning

Defenses

Federated Supermask Learning

Evaluations for production FL

Various datasets, models, and techniques

19 of 53

Defending Against Poisoning in FL

Server

…

Key Idea of defenses:

aggregate updates from clients

- To attenuate malicious updates

- With minimal impact on model utility

20 of 53

The Goal of Poisoning

Circumvent the deployed FL aggregation rule (AGR) to reduce model’s accuracy
Various AGR algorithms (defenses) exist

21 of 53

High-level Intuition of Poisoning Attacks

Aggregate

Benign updates

Find malicious updates in the space of possible updates which maximize the distance between benign and malicious aggregates

Attacks are tailored to given AGR: Constraints of AGR decide the space of benign updates

Space of updates

(for a specific AGR)

Malicious update

22 of 53

Our (Untargeted) Model Poisoning

Exploit: Access to global model

Intuition: 1. Increase the loss on benign training data via stochastic gradient ascent (instead of decent)

Intuition: 2. Scale the update to circumvent detection by given AGR

V. Shejwalkar, A. Houmansadr. Manipulating the Byzantine: Optimizing Model Poisoning Attacks and Defenses for Federated Learning, NDSS 2021

23 of 53

Our (Untargeted) Data Poisoning

Exploit: Server has no visibility into the data on client devices

Intuition: 1. updates computed using more mislabeled data have higher losses and larger norms

Mislabeling strategies: Static label flip (SLF) and dynamic label flip (DLF)

Intuition: 2. Adjust size of mislabeled data to circumvent given AGR

V. Shejwalkar, A. Houmansadr, P. Kairouz, and D. Ramage. Back to the Drawing Board: A Critical Evaluation of Poisoning Attacks on Federated Learning, https://arxiv.org/pdf/2108.10241.pdf

24 of 53

A Peek at the Performance

25% reduction

in accuracy!

Performance depends on various factors, e.g., ratio of malicious clients, AGR techniques, FL algorithm, number of rounds, etc.

25 of 53

Outline

Understanding FL poisoning

Types
Systemization

Our poisoning techniques

Data and model poisoning

Defenses

Federated Supermask Learning

Evaluations for production FL

Various datasets, models, and techniques

26 of 53

High-level Intuition of Poisoning Attacks

Aggregate

Benign updates

Find malicious updates in the space of possible updates which maximize the distance between benign and malicious aggregates

Attacks are tailored to given AGR: Constraints of AGR decide the space of benign updates

Space of updates

(for a specific AGR)

Malicious update

Promising defense strategy:

Shrink the

space of acceptable updates

(with minimal impact

on model accuracy)

to reduce adversary’s choices

e.g., reduce the dimension of updates

27 of 53

Not all space reduction is equal!

Malicious update

The shrunken space excludes many benign updates

(hurts utility)

The shrunken space includes many malicious updates

(not robust)

28 of 53

Two dimensionality reduction approaches

Using knowledge transfer

Distill local model’s information through public data
Cronus: Robust and Heterogeneous Collaborative Learning with Black-Box Knowledge Transfer https://arxiv.org/pdf/1912.11279.pdf

Using gradient ranks

Train gradient ranks, not gradient values
FSL: Federated Supermask Learning https://arxiv.org/abs/2110.04350

29 of 53

Federated Supermask Learning (FSL)

Reduces the space of model updates by sending parameter ranks instead of gradient updates

To Improve robustness
But also improves communication efficiency

H. Mozaffari, V. Shejwalkar, and A. Houmansadr, FSL: Federated Supermask Learning, https://arxiv.org/abs/2110.04350

30 of 53

Federated Supermask Learning

Server

…

Server sends some ranking for model parameters
Each client sends back some updated ranks (based on local data)

31 of 53

Key Technique: Supermasks

Lottery Ticket Hypothesis [Frankle et al. ICLR 2019]:

Modern neural networks are generally overparametrized
A fully-trained neural network contains sparse subnetworks that can be trained from scratch and achieve good performance close to trained original network.

Supermask [Zhou et al. NeurIPS 2019]:

Instead of training weights, find a subnetwork in randomly initialized neural network.
A binary mask of 1’s and 0’s that is superimposed on the random neural network to obtain a final subnetwork.

32 of 53

Server Communications

Initialization: create a random network (random weights and scores) using a SEED
Round 1: Broadcast random weights and random scores

Through the SEED
Weights and scores remain fixed throughout training

Round t: Broadcast the current global ranking

33 of 53

Client Computations (each round)

Assign initial scores to edges based on their global ranking
Update the scores using Edge-PopUp Alg. [Ramanujan et al. CVPR 2020], [Wortsman et al. NeurIPS 2020]:

Assign a score to each edge in the network
Train on data, find best edges to minimize the loss
If adding an edge to the subnetwork increases the loss, reduce its score

34 of 53

Client Updates (each round)

Each client computes local edge ranks (based on updated scores)
Local ranks are sent to the server

35 of 53

Server Aggregation (each round)

The server aggregates client ranks through a Voting mechanism
Assign reputation for each vote

Sums the reputations, and sort them to obtain global ranking for next round

36 of 53

Overview of FSL

Instead of weight training, we ask the clients to rank the edges of a random neural network.

FSL Server instructs each client to:

Start from global ranking that the server announced for this round
Train a subnetwork with size k% of the original network
Order the importance of the edges for her local data
Send back a local ranking of the edges

Server aggregates ranks to obtain the global ranking for next round

37 of 53

FSL’s Communication Costs

FSL achieves similar performance as FedAvg with lower communication cost
Reduces costs from n*32 (bits) to n*log(n) (bits)

By sending and receiving rankings instead of model updates

On CIFAR10, distributed non-iid over 1000 clients, we achieve 85.3% accuracy while reducing communication cost by ~35%.

38 of 53

FSL’s Robustness

Superior robustness

thanks to

smaller update space

Check the paper for bounds on robustness

39 of 53

Takeaways From FSL

Ranking-based FL is a promising space reduction technique

Improves robustness
Reduces communication overhead

There are robust aggregation mechanisms with provable guarantees for ranking data

E.g., Simple voting

Other nice features of a ranking-based approach

Can be used to personalize models, achieve fairness
Can be used to unlearn malicious clients in the future

Future work can develop more effective ranking and ranking aggregation mechanisms

40 of 53

Outline

Understanding FL poisoning

Types
Systemization

Our poisoning techniques

Data and model poisoning

Defenses

Federated Supermask Learning

Evaluations for production FL

Various datasets, models, and techniques

41 of 53

Types of FL Poisoning

Targeted attacks
Backdoor attacks

Semantic Backdoors
Artificial Backdoors

Untargeted attacks

What type is

the most relevant to

production FL?

42 of 53

What is Production FL Anyways?

Cross-device FL

Cross-silo FL

Number of clients from thousands to billions
Small fraction of clients processed per round
Examples: Gboard (all Android users), Apple Siri (All iOS users)

Number of clients up to few hundreds
All clients processed per round
Examples: Banks, hospitals

43 of 53

The Gap Between Theory and Practice

Existing works use unrealistic ranges while evaluating their attacks (and defenses)

Unrealistic Percentages of Compromised Clients

44 of 53

Not all combinations are practical!

Practicality of Threat Models

Cross-silo FL + Model poisoning

Cross-silo FL + Data poisoning

Silos are large organizations, e.g., banks
Have sophisticated security measures
Impossible to break-in without detection

Nobox Online Data poisoning

Nobox Offline Data poisoning

45 of 53

Experimental Setup

Cross-device FL

FEMNIST + LeNet + 34,000 clients
CIFAR10 + VGG9 + 1,000 clients
Purchase + 1-layer FC + 5,000 clients

Cross-silo FL

FEMNIST, CIFAR10 + 50 clients

Client Data Distribution

FEMNIST: Naturally non-iid
CIFAR10 and Purchase: Non-iid using Dirichlet distribution

Attack Impact: Reduction in accuracy due to the attack, compared to the FL setting without any compromised clients

46 of 53

�Existing Attacks Are Not Quite Impactful!

No impact

Even the simple, low-cost robust AGRs are enough to protect production FL against untargeted poisoning.

47 of 53

�Simple Countermeasures May Be Enough!

Enforcing a limit on the size of the dataset contributed by each client can act as a highly effective (yet simple) defense against data poisoning

No impact even with 10% compromised clients

48 of 53

Evaluating Non-Robust FL

Practical % for model poisoning

Practical % for data poisoning

Cross-device FL with (the naive) Average AGR converges with high accuracy, i.e., is highly robust to poisoning attacks for practical percentages of compromised clients.

49 of 53

Robustness Over Time

Robustness of AGRs persists even when compromised clients consistently poison cross-device FL for large number of rounds.

50 of 53

Evaluating Robust FL

Norm-bounding is more robust

Understanding the robustness of AGRs in production FL requires a thorough empirical assessment of AGRs, on top of theoretical analysis.

51 of 53

Evaluating Cross-silo FL

No impact even with 10% compromised clients

Against Data Poisoning Attacks

Model poisoning is not practical in Cross-silo FL

For cross-silo FL, model poisoning attacks are not practical and state-of-the-art data poisoning attacks have no impact even with non-robust Average AGR

No impact even with non-robust Average AGR

52 of 53

Summary

Poisoning can be a major obstacle to the adoption of FL at scale

But needs to be evaluated for realistic settings

Existing defenses must be tailored to the unique constraints of production systems

Heterogenous devices
Limited/sparse connectivity
Compression requirements

Other contending issues

Privacy
Fairness

1 of 53

2 of 53

3 of 53

4 of 53

5 of 53

6 of 53

7 of 53

8 of 53

9 of 53

10 of 53

11 of 53

12 of 53

13 of 53

14 of 53

15 of 53

16 of 53

17 of 53

18 of 53

19 of 53

20 of 53

21 of 53

22 of 53

23 of 53

24 of 53

25 of 53

26 of 53

27 of 53

28 of 53

29 of 53

30 of 53

31 of 53

32 of 53

33 of 53

34 of 53

35 of 53

36 of 53

37 of 53

38 of 53

39 of 53

40 of 53

41 of 53

42 of 53

43 of 53

44 of 53

45 of 53

46 of 53

47 of 53

48 of 53

49 of 53

50 of 53

51 of 53

52 of 53

53 of 53