Equivariant Contrastive Learning
Rumen Dangovski, Li Jing, Charlotte Loh, Seungwook Han, Akash Srivastava, Brian Cheung, Pulkit Agrawal, Marin Soljačić
https://github.com/rdangovs/essl
http://super-ms.mit.edu/essl.html
Outline
Disclaimer: work focuses on joint embedding. Only preliminary results on MAE.
Exact equivariance matters in science
Nat. Comm. 2022
Invariance to Transformations Learns Good Features
Chen & He. CVPR 2021
Generalizing Invariance to Equivariance Brings New SSL
blur,
flip,
rotation,
…
insensitive: invariance,
sensitive: equivariance
proof-of-concept CIFAR-10 study motivates complementarity of invariance & equivariance
Concept
Our Simple Equivariant SSL Objective
E-SSL: Equivariant Self-Supervised Learning
We predict g to encourage sensitivity to g
We could use any SSL loss
We control the level of sensitivity
Example pseudocode for sensitive rotation
Even Simple Generalization Improves SOTA Methods
E-SSL’s Bag of Tricks
E-SSL
E-SSL is Robust to Restricted Augmentation and Labels
E-SSL in NLP: Sentence Embeddings SOTA
invariance to Replaced Token Transformation reduces feature quality
https://github.com/voidism/DiffCSE
arXiv:2204.10298 NAACL-HLT 2022
Next Steps of E-SSL for Unbiased Datasets
If they appear in the dataset then E-SSL does not work directly
Small E-SSL modification:
predict relative orientation from concatenated features instead
Pre-train on unbiased CIFAR-10, downstream rotation prediction: 67.1% -> 71.2%
Importance of E-SSL to Physics, and Physics to E-SSL
https://github.com/rdangovs/essl
Code for new photonics datasets for SSL,
Minimal code for strong CIFAR10 and ImageNet SSL baselines,
Pretrained E-SSL models on ImageNet
Further steps
E-SSL for Exact Equivariance
On CIFAR-10 not different from original E-SSL
Let’s model it
Exact equivariance matters in science
Nat. Comm. 2022
Method in science
Effective method
Effective against equivariant architectures
Outlook
Thank you!
https://github.com/rdangovs/essl
http://super-ms.mit.edu/essl.html
Appendix