Functional analysis
Martina Summer-Kutmon
martina.kutmon@maastrichtuniversity.nl NUTRIOME Workshop 1
Maastricht Centre for Systems Biology (MaCSBio) 30 May 2024
ORCID: 0000-0002-7699-8191
Part 1: Molecular processes and pathways
Current knowledge level
Introduction
Introduction enrichment analysis
Quantify
Isolated data points
Introduction enrichment analysis
Comparative statistics
Genes of interest (DEseq2)
Introduction enrichment analysis
Enrichment analysis
Pre-defined gene sets → functional groups
Introduction enrichment analysis
Enrichment analysis
Pre-defined gene sets → functional groups
Apoptosis
Introduction enrichment analysis
Enrichment analysis
Pre-defined gene sets → functional groups
Apoptosis
Catalytic activity
Introduction enrichment analysis
Enrichment analysis
Pre-defined gene sets → functional groups
Apoptosis
Catalytic activity
GATA3 targets
Introduction enrichment analysis
Enrichment analysis
Pathway analysis
Pathway = gene set with information about relationships
Introduction enrichment analysis
Systems organization
Network analysis
Why enrichment analysis?
“Enrichment” of gene sets
How does it work?
Gene expression
(microarray / RNASeq)
Gene sets
Pathways, GO, gene sets
Enrichment analysis
method
Over-representation analysis
Functional class scoring
Gene set
significance
Gene set collections
Gene set collections
Group genes based on some shared characteristic, e.g.
Molecular signature database�https://www.gsea-msigdb.org/gsea/msigdb
Subramanian (2005) PNAS
doi: 10.1073/pnas.0506580102
Gene sets - level of detail
Example: Hedgehog signaling pathway
García-Campos (2015) Front. Physiol.
doi: 10.3389/fphys.2015.00383
Gene Ontology
The Gene Ontology Consortium (2023) Genetics
doi: 10.1093/genetics/iyad031
Gene ontology vocabularies
Gene Ontology - coverage
The Gene Ontology Consortium (2023) Genetics
doi: 10.1093/genetics/iyad031
Gene Ontology - structure
https://geneontology.org/docs/ontology-documentation/
Gene Ontology - annotations
Gene Ontology - annotations
Pathway databases
García-Campos (2015) Front. Physiol.
doi: 10.3389/fphys.2015.00383
Biological pathways
Pathway diagrams are found everywhere!
Biological pathways
https://www.genome.gov/about-genomics/fact-sheets/Biological-Pathways-Fact-Sheet
Biological pathways
Pathway diagrams are found everywhere!
Utility to biologists as conceptual models is obvious
If modeled properly - immensely useful for computational analysis and interpretation of large-scale experimental data
Biological pathways
PDGFR-beta pathway with transcriptomic/phosphoproteomic data
Static image
Zhang et al, Cell 2016
Pathway Databases
García-Campos (2015) Front. Physiol.
doi: 10.3389/fphys.2015.00383
WikiPathways
Too much data!
Difficult to keep knowledge up-to-date, accessible and integrated
Taking advantage of direct participation by a greater portion of the community (crowdsourcing)
Image: �https://www.vizioninteractive.com/blog/data-overload-when-it-all-becomes-too-much/
WikiPathways
Content:
www.wikipathways.org
30
Agrawal (2024) NAR
doi: 10.1093/nar/gkad960
31
Community portals
Martens (2021) NAR
doi: 10.1093/nar/gkaa1024
33
https://academy.wikipathways.org/
Pathway databases - coverage
MSigDb �Human MSigDB v2023.2.Hs
19,846 protein coding genes (Ensembl GRCh38.p14)
Genes in at least one pathway of the three databases� → 12,960 genes (65%)
File format
Pathway enrichment
Over Representation Analysis (ORA)
García-Campos (2015) Front. Physiol.
doi: 10.3389/fphys.2015.00383
Over Representation Analysis (ORA)
Over Representation Analysis (ORA)
Over Representation Analysis (ORA)
Pathway X
Over Representation Analysis (ORA)
Pathway X
Over Representation Analysis (ORA)
Pathway X
Enrichment score (e.g. Z-score)
Over Representation Analysis (ORA)
García-Campos (2015) Front. Physiol.
doi: 10.3389/fphys.2015.00383
Functional Class Scoring (FCS)
44
García-Campos (2015) Front. Physiol.
doi: 10.3389/fphys.2015.00383
Functional Class Scoring (FCS)
45
Subramanian (2015) PNAS
doi: 10.1073/pnas.0506580102
Functional Class Scoring (FCS)
46
García-Campos (2015) Front. Physiol.
doi: 10.3389/fphys.2015.00383
Pathway Topology Based (PTB)
47
García-Campos (2015) Front. Physiol.
doi: 10.3389/fphys.2015.00383
Pathway Topology Based (PTB) - SPIA
Adi Laurentiu Tarca (2009) Bioinformatics
doi: 10.1093/bioinformatics/btn577
Pathway Topology Based (PTB)
49
García-Campos (2015) Front. Physiol.
doi: 10.3389/fphys.2015.00383
Interpretation
Tools
Interpretation and visualization of results
Analysis results
53
Analysis results
54
https://yulab-smu.top/biomedical-knowledge-mining-book/enrichplot.html
Gene-concept networks
55
https://yulab-smu.top/biomedical-knowledge-mining-book/enrichplot.html
Gene-concept networks
56
Niarakis (2023) Frontiers Immunology
doi: 10.3389/fimmu.2023.1282859
Enrichment maps
57
https://yulab-smu.top/biomedical-knowledge-mining-book/enrichplot.html
Tree plots
58
https://yulab-smu.top/biomedical-knowledge-mining-book/enrichplot.html
Data visualization in Cytoscape
Data visualization in Cytoscape
Multiple comparisons
Miller (2019) Frontiers Genetics
doi: 10.3389/fgene.2019.00059
Time-series data
Tisoncik (2012) Microbiol Mol Biol Rev.
doi: 10.1128/MMBR.05015-11
Questions?
Martina Summer-Kutmon
martina.kutmon@maastrichtuniversity.nl
Maastricht Centre for Systems Biology (MaCSBio)