Juan C. Caicedo Ph.D.
HTC 2024
Madison, WI, July 8 2024
Advancing Biology through �High throughput computing
Modeling cellular morphology using machine learning
Imaging reveals cellular phenotypic variation
https://micro.magnet.fsu.edu/cells/fluorescencemitosis/index.html
Eitaki, M., et al. Vincristine enhances amoeboid-like motility via GEF-H1 / RhoA / ROCK / Myosin light chain signaling in MKN45 cells. BMC Cancer 12, 469 (2012).
Compound treatments
Cell-cycle stages
Screen for specific phenotypes using images
Clinical trials underway for Alisertib in adults with AMKL.
Wen Q, et al. (2012). Cell 150(3):575-89
DNA stain with outlines identifying the nuclei
DMSO
SU6656
Treatment for AMKL (leukemia)
1. Raw images
2. Segmented images
3. Single-cell feature matrices
4. Population profiles of treatments
Image-based cell profiling
5. Downstream statistical analysis
Are treatments
significantly different / effective?
Caicedo, et al. 2017 Nature Methods
Imaging data informs biomedical research
Chemical structures (CS)
Yeast
Fungal
Cell-based
Biochemical
Bacterial
Parasite
Worm
MO
CS
GE
11
14
8
4
3
0
Morphology (MO)
Gene expression (GE)
Gene Expression
Morphology
Same allele across platforms
EGFR_p.S645C
KRAS_p.G12V
CCA space
1.0
0.4
-0.2
EGFR Wild Type
Control
EGFR Mutant
Functional Genomics Research
Drug Discovery
Cell segmentation
Nucleus segmentation problem
Segmentation
Otsu’s thresholding method
Problem: where are the micronuclei?
Nucleus
Micronucleus
Example input
Manual annotation
A transformer-based segmentation model
Linear projection of patches
Output Tokens
1
2
3
4
5
6
7
8
9
0
Vision Transformer Network
Self-attention layers
Input
Tokenized image
Sequence of patches
Segmentation head
32x32
64x64
128x128
256x256
Probability map
32x32
256x256
Experimental setup
18 manually annotated images
Image size: 2,960x2,960 pixels
Each image has ~80 objects of interest
Randomly sample crops of 256x256 pixels
Leave-one-out cross validation
Explore hyper-parameters
Baseline model
Precision: 0.81465 Recall: 0.42912
Experiments
Precision: 0.81465 Recall: 0.42912
Precision: 0.68104 Recall: 0.32303
Precision: 0.42653 Recall: 0.64992
A: Frozen backbone
Experiments
Precision: 0.81465 Recall: 0.42912
Precision: 0.8352 Recall: 0.3367
Precision: 0.6884 Recall: 0.56416
A: Frozen backbone
B: Fine tuning transformer
Experiments
Precision: 0.81465 Recall: 0.42912
Precision: 0.6922 Recall: 0.6525
Precision: 0.7469 Recall: 0.591
A: Frozen backbone
B: Fine tuning transformer
C: Output resolution 128
Experiments
Precision: 0.81465 Recall: 0.42912
Precision: 0.7024 Recall: 0.71113
A: Frozen backbone
B: Fine tuning transformer
C: Output resolution 128
D: Training with more data
Experiments
Precision: 0.81465 Recall: 0.42912
Precision: 0.77526 Recall: 0.78932
A: Frozen backbone
B: Fine tuning transformer
C: Output resolution 128
D: Training with more data
E: Adding LayerNorm
Final result
Precision: 0.81465 Recall: 0.42912
Precision: 0.8157 Recall: 0.8352
A: Frozen backbone
B: Fine tuning transformer
C: Output resolution 128
D: Training with more data
E: Adding LayerNorm
F: Screen images
Cell phenotyping
Can we create a generalist bioimage analysis model?
Adapted from (a) https://bmcmolcellbiol.biomedcentral.com/articles/10.1186/1471-2121-9-42/ (b) https://www.kaggle.com/c/human-protein-atlas-image-classification/ � (c) https://www.broadinstitute.org/ (d) https://www.nature.com/articles/s41587-019-0207-y/
RGB natural images
Cell Painting(c)
Highly multiplexed immunofluorescence(d)
Human protein atlas(b)
High-content screening(a)
3 channels
4 channels
5 channels
10+ channels
ImageNet pretrained models are designed for 3 channel, RGB images
Learning from images at scale
Millions of images
High-throughput computing
Image-based discoveries
Data sources
30 studies ~5M images
10 TB
75 TB
120 TB
70 studies ~20M images
12 laboratories ~40M images
Powered by
Self-supervised representation learning
Linear projection of patches
Output
1
2
3
4
5
6
7
8
9
0
Vision Transformer Network
Self-attention layers
Input cell
Tokenized image
Sequence of patches
Goal: discover biologically relevant cellular features without human supervision
Teacher Network
Student Network
Features
Features
Global views
Local views
Self-supervision
Single-cell
image
Self-distillation with no labels
Doron, et al. "Unbiased single-cell morphology with self-supervised vision transformers." bioRxiv (2023).
A single model to recapitulate all studies
Compound treatments
Cell-cycle stages
Protein localization
Replicate findings from about 100 publicly available imaging studies
Use the model to power biological discoveries at Morgridge and UW
Summary
Imaging
Imaging encodes rich phenotypic information
Representation
Machine learning identifies useful biological variation
Generalization
From specialized methods to foundation model
Applications
Unbiased features have many biological applications
25
© 2023 Caicedo Lab.
Broad Institute
University of Helsinki
Columbia University
Meta AI
Morgridge Institute
CHTC Team
Biomedical Imaging
Boston University
Research Computing
Brian Bockelman�
Justin Hiemstra
�Andrew Maier
Gratitude
Caicedo Lab
Nikita Moshkov
John Peters
Other past and future members
Tyler Thompson
Zayn
Kayali
Yifan
Ren
Aditya Pillai