1 of 23

Federated Learning Benchmark for Domain Generalization

athul-s.raj@polymtl.ca

marc-antoine.provost@umontreal.ca

Quebec

Artificial

Intelligence

Institute

2 of 23

Agenda

Introduction

Federated Learning
Properties of FL
OOD in FL

Data heterogeneity in FL setting

Objective
Related work

DomainBed
Existing OOD gen algos in FL-GMA, FED-IRM

Federated datasets
FeDomainBed
Results
Discussion
Questions

2

3 of 23

Agenda

Introduction
Objective
Related work
Background
Results
Discussion

3

4 of 23

Machine Learning as we know it

Computer

Model

Data

4

5 of 23

Centralized Machine learning

Data

5

6 of 23

Is there a problem?

6

7 of 23

It’s a lot worse..

7

8 of 23

Can we solve this?

Perhaps, share the model..?

And not data!

8

9 of 23

Federated Learning!

Model

Repeat!

9

10 of 23

10

11 of 23

FedAVG

Average the weights

Server model

Client models

11

12 of 23

Homogenous data distribution a.k.a. IID

Average the weights

FedAVG

Generalization problem

Personalization problem

12

13 of 23

non-IID data distribution

Average the weights

FedAVG

Generalization problem

Personalization problem

Non-IID setting takes longer

13

14 of 23

Objective

To understand the performance of OOD gen algorithms in a non-i.i.d. federated learning setting when the algorithms are used for client model training.

14

15 of 23

DomainBed

DomainBed is a PyTorch suite containing benchmark datasets and algorithms for domain generalization, as introduced in In Search of Lost Domain Generalization.
Github: https://github.com/facebookresearch/DomainBed

15

16 of 23

Existing OODGen in FL

FedGMA

OOD Gen on server aggregation of models

CausalFed-IRM

Split federated learning setting - also, OOD gen at the server

16

17 of 23

Federated Datasets

Fed-CMNIST

Label distribution skew - random selection of labels(at max 3)/classes per client
Feature distribution skew - random colors(at max 3) on each client dataset, and test dataset

17

18 of 23

Federated Datasets

18

19 of 23

Federated Datasets

19

20 of 23

FeDomainBed

Train clients using OOD gen algorithms from DomainBed

Algos: ERM, IRM, VREx and a few more
FedAVG to aggregate client models to a global model

Evaluate global model on test dataset OOD to all client train datasets

20

21 of 23

Results

ERM(=FedAVG) performs better than all the OOD gen algos at client side

21

22 of 23

Inference & Discussion

Federated learning generalization objective is to learn mechanisms invariant across clients.
OOD Gen algorithms when applied at client, learns invariant mechanism within clients. But, these mechanisms may differ across clients.
So it may not be a good idea to perform OOD gen on client side.

22

23 of 23

Questions?

Quebec

Artificial

Intelligence

Institute