Foundation Models
Hamidreza Moaddeli
1
What is Foundation Model?
2
What is Foundation Model?
3
What is Foundation Model?
4
Slide Credits : “Foundational Robustness of Foundation Models” , NeurIPS 2022 Tutorial
5
Slide Credits : “Foundational Robustness of Foundation Models” , NeurIPS 2022 Tutorial
What is Foundation Model?
6
Emergence & Homogenization
7
Emergence and homogenisation
8
Slide Credits : “"Foundation Models" , Samuel Albanie, Online Course 2022
Foundation models - NLP developments
9
Slide Credits : “"Foundation Models" , Samuel Albanie, Online Course 2022
Foundation models - homogenisation
10
Slide Credits : “"Foundation Models" , Samuel Albanie, Online Course 2022
Ecosystem
11
Slide Credits : “"Foundation Models" , Samuel Albanie, Online Course 2022
Resource accessibility
12
Slide Credits : “"Foundation Models" , Samuel Albanie, Online Course 2022
Technology
13
Slide Credits : “"Foundation Models" , Samuel Albanie, Online Course 2022
Technology
14
Slide Credits : “"Foundation Models" , Samuel Albanie, Online Course 2022
History of Deep Learning
15
Slide Credits : "CS 886: Recent Advances on Foundation Models", Wenhu Chen, University of Waterloo, Winter 2024
Pros and Cons of Specialized DL
16
Slide Credits : "CS 886: Recent Advances on Foundation Models", Wenhu Chen, University of Waterloo, Winter 2024
Transfer Learning
17
Slide Credits : "CS 886: Recent Advances on Foundation Models", Wenhu Chen, University of Waterloo, Winter 2024
Pros and Cons of Transfer DL
18
Slide Credits : "CS 886: Recent Advances on Foundation Models", Wenhu Chen, University of Waterloo, Winter 2024
Transfer Learning in Word Vector
19
Transfer Learning in Word Vector
20
Transfer Learning in Vision
21
Transfer learning & Scale
On a technical level, foundation models are enabled by transfer learning and scale. Transfer learning is what makes foundation models possible, but scale is what makes them powerful.
Scale required three ingredients:
22
Slide Credits :"UvA Foundation Models Course" , Cees Snoek, Yuki Asano , Spring 2024
Parameters of Foundation DL
23
Slide Credits : "CS 886: Recent Advances on Foundation Models", Wenhu Chen, University of Waterloo, Winter 2024
24
Image Source : "What are Foundation Models in AI?", https://www.youtube.com/watch?v=dV0X1QyLL8M
25
Image Source : "What are Foundation Models in AI?", https://www.youtube.com/watch?v=dV0X1QyLL8M
26
Image Source : "What are Foundation Models in AI?", https://www.youtube.com/watch?v=dV0X1QyLL8M
Self-Supervised Learning
Solving the problem of expensive annotations: self-supervision.
27
Slide Credits : "UVA Deep Learning Course", Yuki Asano , Fall 2022
General procedure of self-supervised learning
28
Slide Credits : "UVA Deep Learning Course", Yuki Asano , Fall 2022
Early methods: Context prediction
29
Slide Credits : "UVA Deep Learning Course", Yuki Asano , Fall 2022
Early methods: Context prediction
30
Modern Noise-contrastive self-supervised learning
31
Slide Credits : "UVA Deep Learning Course", Yuki Asano , Fall 2022
Masked Image Modelling (recent development)
32
Slide Credits : "UVA Deep Learning Course", Yuki Asano , Fall 2022
Transformers
33
Language Transformers (Encoder-only)
Well-suited for tasks requiring an understanding of the full sequence,
34
Slide Credits : "UvA Foundation Models Course" , Cees Snoek, Yuki Asano , Spring 2024
Bidirectional Encoder Representations from Transformers (BERT)
BERT uses two self-supervised objectives:
The pre-trained BERT model can be fine-tuned by adding a classifier layer for many language understanding tasks
35
Slide Credits : "UvA Foundation Models Course" , Cees Snoek, Yuki Asano , Spring 2024
BERT pre-training and fine-tuning
Fine-tuning is straightforward, simply plug in the task-specific inputs and outputs into BERT and finetune all the parameters end-to-end.
36
Slide Credits : "UvA Foundation Models Course" , Cees Snoek, Yuki Asano , Spring 2024
BERT
37
BERT Results
38
Language Transformers (Decoder-only)
39
Slide Credits : "UvA Foundation Models Course" , Cees Snoek, Yuki Asano , Spring 2024
GPT-2
40
GPT-3
41
In-context Learning
42
Few-shot Learning (GPT-3)
43
GPT-3 Results
Promising results in the zero- and one-shot settings, and in the few- shot setting sometimes competitive with fine-tuned state-of-the-art
44
Slide Credits : "UvA Foundation Models Course" , Cees Snoek, Yuki Asano , Spring 2024
GPT-3 Results
GPT-3 can be applied to any downstream task without any gradient updates or fine-tuning, with tasks and few-shot demonstrations specified purely via text interaction with the model.
45
Slide Credits : "UvA Foundation Models Course" , Cees Snoek, Yuki Asano , Spring 2024
Language Transformers (Encoder-Decoder)
46
Slide Credits : "UvA Foundation Models Course" , Cees Snoek, Yuki Asano , Spring 2024
T5: Text-to-Text Transfer Transformer
47
Slide Credits : "UvA Foundation Models Course" , Cees Snoek, Yuki Asano , Spring 2024
Emergent Ability
48
Slide Credits : "CS 886: Recent Advances on Foundation Models", Wenhu Chen, University of Waterloo, Winter 2024
Emergent Abilities of Large Language Models
49
Slide Credits : "CS25: Transformers United V4", Stanford University, Spring 2024
Few-Shot Prompting
50
Slide Credits : "CS25: Transformers United V4", Stanford University, Spring 2024
Potential Explanations of Emergence
51
Slide Credits : "CS25: Transformers United V4", Stanford University, Spring 2024
52
53
54
Vision transformer
55
ViT’s outperform ResNets at scale
56
Slide Credits : "UvA Foundation Models Course" , Cees Snoek, Yuki Asano , Spring 2024
Contrastive Language-Image Pre-training (CLIP)
57
CLIP
58
Slide Credits : "CS 8803 VLM Vision-Language Foundation Models", Zsolt Kira, Georgia Tech, Fall 2024
CLIP
59
Slide Credits : "CS 8803 VLM Vision-Language Foundation Models", Zsolt Kira, Georgia Tech, Fall 2024
CLIP ( Distribution Drift , Few-Shot )
60
Pathology Language and Image Pre-Training (PLIP)
The model is a fine-tuned version of the original CLIP model.
61
Slide Credits : "BIODS 271: Foundation Models for Healthcare" ,Stanford University, Winter 2024
Creating OpenPath: >200K high-quality Twitter image-text pairs
Largest public dataset of pathology image + discussions.
62
Slide Credits : "BIODS 271: Foundation Models for Healthcare" ,Stanford University, Winter 2024
PLIP applications
63
Slide Credits : "BIODS 271: Foundation Models for Healthcare" ,Stanford University, Winter 2024
PLIP can serve as a powerful search engine for medicine
64
Slide Credits : "BIODS 271: Foundation Models for Healthcare" ,Stanford University, Winter 2024
Natural language-to-code system based on GPT-3 (Codex)
65
Slide Credits : "COS 597G: Understanding Large Language Models" , Danqi Chen, Princeton University, Fall 2022
Codex (Examples)
66
Slide Credits : "COS 597G: Understanding Large Language Models" , Danqi Chen, Princeton University, Fall 2022
Segment Anything Model (SAM):
The first foundation model for promptable segmentation
67
Try the demo: https://segment-anything.com/demo
Slide Credits : "COMP 590/776: Computer Vision in 3D World", Roni Senguptam UNC, Spring 2023
SAM
SAM is built with three interconnected components: A task, an model, and a data engine.
68
SAM considers two sets of prompts: sparse (clicks, boxes, text) and dense (masks).
Slide Credits : "COMP 590/776: Computer Vision in 3D World", Roni Senguptam UNC, Spring 2023
SAM
69
Slide Credits : "COMP 590/776: Computer Vision in 3D World", Roni Senguptam UNC, Spring 2023
SAM (Zero-Shot Single Point Valid Mask Evaluation)
70
Slide Credits : "COMP 590/776: Computer Vision in 3D World", Roni Senguptam UNC, Spring 2023
Med-Gemini : Multimodal medical models built on Gemini
71
Slide Credits : "Emergence of Foundation Models: Opportunities to Rethink Medical AI", Shekoofeh Azizi, CVPR 2024
72
Slide Credits : "Emergence of Foundation Models: Opportunities to Rethink Medical AI", Shekoofeh Azizi, CVPR 2024
73
Slide Credits : "Emergence of Foundation Models: Opportunities to Rethink Medical AI", Shekoofeh Azizi, CVPR 2024
74
Slide Credits : "Emergence of Foundation Models: Opportunities to Rethink Medical AI", Shekoofeh Azizi, CVPR 2024
75
Slide Credits : "Emergence of Foundation Models: Opportunities to Rethink Medical AI", Shekoofeh Azizi, CVPR 2024
76
Image Credits : Ronnie, King of Zurich
77
Image Credits : Ronnie, King of Zurich