Unveiling the role of epigenomic features on RNA splicing throughout neural development using machine learning.
4DN Hackathon: Team 7
DNA transcription and RNA splicing were thought to be two independent processes
The DNA/chromatin state can affect RNA splicing
Nucleic Acids Res. 2022 Nov 11; 50(20): 11563–11579.
Can we predict RNA splicing events using genomic and epigenomic features?
Excitatory and interneuron were selected for training
PLAC-seq (H3K4me3)
RNA-seq
PSI score was used to quantify RNA splicing events
Ref (Methods in Molecular Biology ((MIMB,volume 2117)))
Splicing changes between excitatory and interneuron
Retained (PSI >=0.1):
Non retained (PSI < 0.02):
Retained (PSI >=0.1):
Non retained (PSI < 0.02):
Interneuron
Excitatory neuron
Splicing prediction setup
Input modality:
Cell type specific epigenetic signatures predict splicing levels
Model : XGBoost (regression)
Diverse machine learning models can be employed to predict splicing levels
Random Forest
Regressor
Ridge
XGB-BCE
Gradient models outperform other models in predicting splicing
Sequence only and multi-modal SpliceNet
Model Training & Metrics
Sequence-only SpliceNet (EN cell type):
Multiple-modal SpliceNet (EN and IN cell types):
In silico saturation mutagenesis using SpliceNet
Junction center ±150bp at both 5’ and 3’ end, categorized by High-intron-retaining and Low-intron-retaining groups. Aggregated 300 samples from validation set.
5’
3’
HR
LR
HR
LR
Intron retention is associated with lower DNA methylation levels near splice junctions and within retained introns.
Justin J. -L. Wong. Nature Communications volume 8, 15134 (2017)
Multi-modal SpliceNet captures cell-type-specific events
Conclusions and next steps
Next steps
Thank you!
Team 7:
Trainee Organizers