Prediction of response to melanoma immunotherapy using gut metagenomic data
Authors
M. Filippov,
T. Ziubko
Supervisor
Eugeniy Olekhnovich
FRCC PCM, Moscow, Russia
Immunotherapy has revolutionized the field of oncology. It has been demonstrated highly effective against various types of cancer, but a considerable number of patients were shown to be resistant.
Identification of individuals susceptible or resistant to cancer immunotherapy is a crucial point in response enhancement and patient survival.
The intestinal microbiota has been shown to significantly affect the immunotherapy of malignant tumours, which can be considered as a basis for the prediction of clinical response status.
Here we developed Machine learning-based predictive models utilizing gut metagenomic data to identify responded patients under melanoma immunotherapy. Microbial signatures as well as taxonomic assignments associated with it were also identified.
Introduction
Goal:
confirm the hypothesis of melanoma immunotherapy response status
can be predicted from gut microbiomes using machine learning models
Objectives:
Goals and objectives
Methods and approaches
Collecting raw data
MetaPhlAn (MPA)
KEGG Orthology (KO)
metagenome-assembled genomes (MAGs)
ConQuR package (v2.0)
https://wdl2459.github.io/ConQuR/ConQuR.Vignette.html
R-4.3.0
680 metagenomic stool samples from 7 studies published in 2017-2022
Frankel et al. 2017
Gopalakrishnan et al. 2018
Matson et al. 2018
Davar et al. 2021
Baruch et al. 2021
Spencer et al. 2021
Lee et al. 2022
Arranged in tables
with feature frequencies
Batch effect correction
RFC trained with sklearn package on python 3.9
Model validation on corrected and uncorrected data
Random forest classifier (RFC)
Extraction of the importances
Analysis of the most important features toward literature data
Identification of feature signatures associated with positive immunotherapy response
Annotation and analysis
Results: Batch effect correction
ConQuR
In attempt to get the most accurate feature importances, we corrected batch effect with ConQuR tool.
Using quantile regression it diminishes the dispersion explained by the batch and keeps the dispersion explained by the target variable (R/NR).
Results: model validation
Results: importance extraction
We extracted the most important feature from each of MetaPhlAn, KEGG Orthology and MAGs datasets.
Results: interpretation
KO annotation
29
bacterial species
from 17 genuses
MPH/MAGs genuses crossing tree
Bacteroides
Alistipes
Blautia
Parabacteroides
Roseburia
Akkermansia
Barnesiella
Bifidobacterium
Dorea
Escherichia
Faecalibacterium
Flavonifractor
Fusicatenibacter
Gemmiger
Odoribacter
Ruminococcus
Ruthenibacterium
Results: interpretation
Results: interpretation
Bacteroides: caccae, cellulosilyticus, ovatus, stercoris, thetaiotaomicron, uniformis, xylanisolvens | 7 |
Alistipes: finegoldii, indistinctus, putredinis, shahii | 4 |
Blautia obeum, wexlerae | 2 |
Parabacteroides: distasonis, merdae | 2 |
Roseburia intestinalis, inulinivorans | 2 |
Akkermansia muciniphila | 1 |
Barnesiella intestinihominis | 1 |
Bifidobacterium longum | 1 |
Dorea longicatena | 1 |
Escherichia coli | 1 |
Faecalibacterium prausnitzii | 1 |
Flavonifractor plautii | 1 |
Fusicatenibacter saccharivorans | 1 |
Gemmiger formicilis | 1 |
Odoribacter splanchnicus | 1 |
Ruminococcus bromii | 1 |
Ruthenibacterium lactatiformans | 1 |
Lachnospiraceae
Bacteroidaceae
Ruminococcaceae
Rikenellaceae
Bifidobacteriaceae
Tannerellaceae
Akkermansiaceae
Enterobacteriaceae
Odoribacteraceae
Prevotellaceae
Results: interpretation
MPH/MAGs genuses crossing tree
belong to 39 & 37 genuses (57 unique in total) with:
19 common ones
(from 10 families)
Conclusion: machine learning approaches can reveal gut microbiome-immunotherapy interactions and predict response status in melanoma therapy with good predictive value, which is ough to improve cancer patient outcomes
Conclusion
Future goals: