Cross-species project: �pilot work
Jon Manning, BBSRC-NSF update meeting
May 2021
Background
Two approaches
Marker overlap
More complex approach: SAMap
https://doi.org/10.1101/2020.09.28.317784
https://github.com/atarashansky/
Evaluation approach
M
H
1. Marker overlap method
Difficulty: standard SCXA marker sets are mismatched by context
Markers are only markers relative to something else
Importance of context for marker comparison�e.g. what’s the signature for a pancreatic B cell?
E-MTAB-5061: Human pancreas
Markers = “What genes make a B cell look different to average expression across the rest of the pancreas?”
OR
”What genes define the un-pancreasiness of the B cell?”
Pancreas
B cell
Acinar
A cell
D cell
E-ENAD-15: whole mouse
Markers = “What genes make a B cell look different to average expression across the rest of the mouse?”
OR
“What genes define the un-mousiness of the B cell?”
Mouse
Pancreas
B cell
Acinar
A cell
D cell
Blood
Kidney
Heart
Kidney
What happens if we try?
Markers for human B cells from the pancreas context overlap with marker genes for a variety of pancreas cell types from the whole-mouse context.
Markers for pancreatic cell types in the mouse dataset are less specific to those cell types within the pancreas, since the background contained a relatively low number of pancreatic cells.
The same applies to other hierarchical differences, e.g. pancreas vs sub-parts endocrine/ exocrine pancreas.
Common markers with B cells wrt mouse
Common markers with B cells wrt mouse pancreas
Easy, I’ll just split experiments by organism part and re-calculate markers before comparison
Not quite- granularity can be different, so we need to merge more specific terms
E-ENAD-15 granularity
E-MTAB-5061 granularity
So we need to merge these categories for comparison
Solution
Snakemake implementation: https://github.com/ebi-gene-expression-group/cross-species-cellgroup-comparison
Basic validation
Results
Pilot results: Between human and mouse, SAMap peforms similarly to marker overlap
Experiment 1 (mouse) | Experiment 2 (human) | Common organism part | Intersecting cell types | Predicted intersecting (top rank, markers) | Predicted intersecting (top rank, SAMap) |
E-ENAD-15 | E-GEOD-125970 | colon | 2 | 2 | 2 |
E-GEOD-83139 | pancreas | 5 | 5 | 4 | |
E-HCAD-10 | kidney | 4 | 4 | 4 | |
E-HCAD-1 | lung | 4 | 3 | 3 | |
E-HCAD-1 | spleen | 3 | 1 | 1 | |
E-MTAB-5061 | pancreas | 7 | 7 | 7 | |
E-MTAB-8410 | ascending colon | 2 | 2 | 2 |
Example: pancreas
Example: kidney
Interim conclusions
Next steps
F
M
H
Acknowledgements