Identification of RNA Pseudouridine Sites using Deep Learning Approaches
Supervised By
Dr. Md. Al Mamun
Professor
Dept. of Computer Science & Engineering
Rajshahi University of Engineering & Technology
Presented By
Abu Zahid Bin Aziz
Roll: 1503047
Dept. of Computer Science & Engineering
Rajshahi University of Engineering & Technology
Outline
2
Introduction
3
14/02/2021 Pseudouridine Identification by Deep Learning 3
4
Motivation
14/02/2021 Pseudouridine Identification by Deep Learning 4
5
14/02/2021 Pseudouridine Identification by Deep Learning 5
Objectives
6
14/02/2021 Pseudouridine Identification by Deep Learning 6
Literature Review
7
14/02/2021 Pseudouridine Identification by Deep Learning 7
Workflow
8
Species | Benchmark Samples | Independent samples | Length of Nucleotides |
H. Sapiens(HS) | 990 | 200 | 21 |
S. Cerevisiae(SC) | 628 | 200 | 31 |
M. Musculus(MM) | 944 | None | 21 |
14/02/2021 Pseudouridine Identification by Deep Learning 8
Dataset Collection
9
14/02/2021 Pseudouridine Identification by Deep Learning 9
Data Preprocessing
10
14/02/2021 Pseudouridine Identification by Deep Learning 10
Data Preprocessing(Contd.)
11
14/02/2021 Pseudouridine Identification by Deep Learning 11
CNN Architecture(General)
12
14/02/2021 Pseudouridine Identification by Deep Learning 12
CNN Architecture(Multi-stage)
13
Hyperparameters | Ranges of Values | Datasets | |
HS_990, MM_944 | SC_628 | ||
Batch Size | [10,20,30,40] | 10 | 10 |
No. of Epochs | [10,50,100,200] | 100 | 100 |
No. of Channels | [5,7,9,10,11] | 11 | 9 |
Filter Height | [3,5,7,9] | 9 | 7 |
Learning Rate | [0.001,0.0003,0.0005,0.00057,0.0001] | 0.0005 | 0.0001 |
Dropout Probability | [0.4,0.45,0.5,0.55,0.6] | 0.6 | 0.4 |
14/02/2021 Pseudouridine Identification by Deep Learning 13
Hyperparameter Tuning
14
14/02/2021 Pseudouridine Identification by Deep Learning 14
Hyperparameter Tuning(Contd.)
15
Predictors | Independent Datasets | |||||||
S_200 | H_200 | |||||||
AC(%) | SN(%) | SP(%) | MCC | AC(%) | SN(%) | SP(%) | MCC | |
iRNA-Pseu[2] | 60.00 | 63.00 | 57.00 | 0.20 | 61.50 | 58.00 | 65.00 | 0.23 |
PseUI[3] | 68.50 | 65.00 | 72.00 | 0.37 | 65.50 | 63.00 | 68.00 | 0.31 |
iPseu-CNN[4] | 73.50 | 68.76 | 77.42 | 0.47 | 69.00 | 77.72 | 60.81 | 0.40 |
iPseu-Layer[6] | 72.50 | 68.00 | 77.00 | 0.45 | 71.00 | 63.00 | 79.00 | 0.43 |
Ours(General) | 75.00 | 67.00 | 83.00 | 0.50 | 72.5 | 80.00 | 65.00 | 0.44 |
Ours (Merged-seq) | 76.50 | 80.00 | 73.00 | 0.53 | 74.00 | 73.00 | 75.00 | 0.48 |
14/02/2021 Pseudouridine Identification by Deep Learning 15
Results
*AC= ACCURACY, SN=SENSITIVITY, SP=SPECIFICITY, MCC= MATTHEWS CORRELATION COEFFICIENT
16
14/02/2021 Pseudouridine Identification by Deep Learning 16
Results(Contd.)
17
14/02/2021 Pseudouridine Identification by Deep Learning 17
Results(Contd.)
18
14/02/2021 Pseudouridine Identification by Deep Learning 18
Webserver Implementation(Step-1)
19
14/02/2021 Pseudouridine Identification by Deep Learning 19
Webserver Implementation(Step-2)
20
14/02/2021 Pseudouridine Identification by Deep Learning 20
Future Scopes
21
14/02/2021 Pseudouridine Identification by Deep Learning 21
Publication
[1] Y.-H. Li, G. Zhang, and Q. Cui, “Ppus: a web server to predict pusspecific pseudouridine sites,” Bioinformatics, vol. 31, no. 20, pp. 3362–3364, 2015.
[2] W. Chen, H. Tang, J. Ye, H. Lin, and K.-C. Chou, “iRNA-Pseu: Identifying rna pseudouridine sites,” Molecular Therapy-Nucleic Acids, vol. 5, p.e332, 2016.
[3] J. He, T. Fang, Z. Zhang, B. Huang, X. Zhu, and Y. Xiong, “PseUI: pseudouridine sites identification based on rna sequence information,” BMC bioinformatics, vol. 19, no. 1, p. 306, 2018.
[4] M. Tahir, H. Tayara, and K. T. Chong, “iPseU-CNN: Identifying RNA pseudouridine sites using convolutional neural networks,” Molecular Therapy-Nucleic Acids, vol. 16, pp. 463–470, 2019.
[5] Liu K, Chen W, Lin H. XG-PseU: an eXtreme Gradient Boosting based method for identifying pseudouridine sites. Molecular Genetics and Genomics. 2020;295(1):13–21.
[6] Mu Y, Zhang R, Wang L, Liu X. iPseU-Layer: Identifying RNA Pseudouridine Sites Using Layered Ensemble Model. Interdisciplinary Sciences: Computational Life Sciences. 2020; p. 1–11.
14/02/2021 Pseudouridine Identification by Deep Learning 22
References
Thank You