1 of 26

A DATA DRIVEN UNDERSTANDING OF COVID-19 DYNAMICS AND IDENTIFICATION OF THE POSITIVE COVID-19 CASES USING GENETIC ALGORITHM

Soorayadas Sudhakaran (Roll no.: B180494PE)

Farshad O P (Roll no.: B180415PE)

Rohan Raj (Roll no.: B180420PE)

Deepak P (Roll no.: B170198PE)

Sanal P (Roll no.: B180803PE)

2 of 26

CONTENTS

  • Introduction
  • Literature Review
  • Methodology
  • Results
  • Conclusions
  • Future scope
  • References

3 of 26

INTRODUCTION

  • Coronaviruses are a family of viruses that can cause respiratory illness in humans.

  • The new strain of coronavirus, COVID-19, was first reported in Wuhan, China in December 2019. The virus has since spread to all continents

  • This highly infectious disease transmit from person to person through respiratory droplets produced by infected person

  • The measures thus taken include social distancing, restrictions on domestic as well as international travel, cancelling social events, shutting down of public as well as commercial activities etc.

4 of 26

INTRODUCTION (Contd…)

  • Various technological measures and data analytics and data scientist are working to come out with this novel solution

  • Here we study the extension of contact tracing app wherein the cluster of users are identified who all are come in contact with the Covid-19 infected person but also after taking the list, their symptoms are matched with the infected person

  • The new algorithm has been developed GABFCov-19 which is based on genetic algorithm.

  • This algorithm will generate high probability positive cases. So, that it can reduce the community spread.

5 of 26

LITERATURE REVIEW

Sl.no

Author

Title

Contents

1

L. Li, Z. Yang, Z. Dang, C. Meng, J. Huang, H. Meng, D. Wang, G. Chen, J. Zhang, H. Peng

Propagation analysis and prediction of the COVID-19

Infect. Dis. Model. 5 (2020) 282–292

  • Predicted the factors affecting spread of COVID-19.
  • Used Guassian distribution , obtained curve almost similar to actual data.

2

Rizk-Allah, R.M. And Hassanien

Preliminary identification of potential vaccine targets for the COVID-19

coronavirus (SARS-CoV-2) based on SARS-CoV immunological studies.(2020)

  • A forecasting model to analyze and forecast the Confirmed cases of COVID-19
  • Interior search algorithm based learning to improvise the Feed forward Neural network.

6 of 26

LITERATURE REVIEW (Contd…)

Sl.no

Author

Title

Contents

3

Al-qaness, M.A., Ewees, A.A., Fan, H. And Abd El Aziz, M.

Optimization method for forecasting confirmed cases of covid-19 in china.(2020)

  • Used an ANFIS model which enhanced by applying Flower pollination and Salp swarm algorithm.
  • System shows better performance in MAPE, RMSE and computing time.

4

Ghazaly, N.M., Abdel-Fattah, M.A. and Abd El-Aziz, A.A

Novel Coronavirus Forecasting Model using Nonlinear Autoregressive

Artificial Neural Network.(2020)

  • Prediction of cases with the help of AI and deep learning.
  • Forecasted corona virus cases around 9 countries by times series using non Linear regressive network.

7 of 26

METHODOLOGY

Contact tracing app: The Contact tracing app gives us the cluster of the entity which are came in contact with infected person within 14 days.

Figure 1: working algorithm of Contact tracing Proximity App

8 of 26

METHODOLOGY (Contd…)

On the basis of the cluster found by using contact tracing app, the symptoms parameter has been matched by using genetic algorithm

Figure 2:Working Methodology of GABFCov-19

9 of 26

METHODOLOGY (Contd…)

  • Algorithm assigns the fitness of the entity depending upon its occurrences in the symptoms that affect the covid-19.

  • The fitness has been computed according to the matching parameters. The entity whose more parameters matched, the more will be the fitted entity.

  • High fitness value represents the high probability of infected Covid-19 patient.

10 of 26

  • Suppose that binary chromosomes of the positive case is 1 1 1 0 1 1
  • Entity found in the cluster has been represented by the chromosomes as 1 1 0 0 0 1
  • By matching parameters in both chromosomes Fitness value = 4

METHODOLOGY (Contd…)

11 of 26

Steps of GABFCov-19 Algorithm

  • Input: Cluster’s entities
  • Output: The number of expected high probability positive cases

STEP 1

  • Initialize the parameters

STEP 2

  • Randomly generate the m chromosomes

12 of 26

STEP 3

  • Calculate weights to perform the Weighted Fuzzy C Means Clustering

STEP 4

  • Perform the genetic process of selection, mutation and cross over

STEP 5

  • The process continue until the condition meet the genetic termination

Steps of GABFCov-19 Algorithm

13 of 26

STEP 6

  • Compute the new generation fitness of the cluster and compare with each individual to find the highest fitness of individual.

STEP 7

  • Fitness = Matching parameters

Steps of GABFCov-19 Algorithm

14 of 26

RESULT

Figure 3a:Cluster 1 based on location

Figure 3b: Cluster 2 based on location

15 of 26

RESULT (Contd…)

  • The clusters earlier have been passed as input to the GABFCov-19 algorithm.

  • They are converted in to binary chromosomes.

  • The binary chromosome generated has been run over the parameters mentioned in step 3 of the proposed algorithm to find the high probability positive cases.

16 of 26

RESULT (Contd…)

• Age

• Sensitivity

• Past case history

• Fever

• Cold & Cough

• Breathing Problem

• Tiredness

• Chest pain or pressure

• Loss of speech or movement

• Headache

• Loss of taste or smell

• A rash on skin, or discolouration of fingers or toes

• Aches and pains

• Sore throat

• Diarrhoea

TOTAL PARAMETERS: 15

17 of 26

Parameter tested on 12 individuals

[[0. 0. 0. 0. 0. 1. 1. 0. 0. 1. 1. 1. 1. 1. 1.]

[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]

[0. 0. 0. 0. 1. 0. 0. 0. 1. 1. 1. 1. 0. 0. 0.]

[1. 1. 0. 1. 0. 1. 0. 0. 0. 1. 1. 0. 0. 0. 0.]

[1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1.]

[1. 0. 0. 1. 1. 1. 1. 0. 1. 1. 1. 0. 1. 0. 1.]

[1. 0. 1. 1. 0. 0. 1. 0. 1. 0. 0. 0. 1. 0. 0.]

[0. 1. 0. 0. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1.]

[0. 1. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 1. 1. 0.]

[0. 0. 0. 0. 0. 1. 0. 0. 0. 0. 0. 0. 0. 1. 0.]

[0. 1. 1. 1. 1. 1. 0. 1. 1. 0. 0. 1. 1. 1. 1.]

[1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 0. 0. 1. 1. 1.]]

RESULT (Contd…)

18 of 26

RESULT (Contd…)

Chromosome of the positive case generated is taken from the cluster

[0. 0. 0. 0. 1. 0. 0. 1. 1. 1. 1. 0. 1. 1. 0.]

Figure 4: Chromosomes of the cluster

19 of 26

Reference solution:

[0. 0. 0. 0. 1. 0. 0. 1. 1. 1. 1. 0. 1. 1. 0.]

Starting population:

[[0. 0. 0. 0. 0. 1. 1. 0. 0. 1. 1. 1. 1. 1. 1.]

[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]

[0. 0. 0. 0. 1. 0. 0. 0. 1. 1. 1. 1. 0. 0. 0.]

[1. 1. 0. 1. 0. 1. 0. 0. 0. 1. 1. 0. 0. 0. 0.]

[1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1.]

[1. 0. 0. 1. 1. 1. 1. 0. 1. 1. 1. 0. 1. 0. 1.]

[1. 0. 1. 1. 0. 0. 1. 0. 1. 0. 0. 0. 1. 0. 0.]

[0. 1. 0. 0. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1.]

[0. 1. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 1. 1. 0.]

[0. 0. 0. 0. 0. 1. 0. 0. 0. 0. 0. 0. 0. 1. 0.]

[0. 1. 1. 1. 1. 1. 0. 1. 1. 0. 0. 1. 1. 1. 1.]

[1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 0. 0. 1. 1. 1.]]

AFTER APPLYING FITNESS FUNCTION

Fitness Scores: [ 8 8 11 6 7 8 6 10 9 8 7 7]

RESULT (Contd…)

20 of 26

RESULT (Contd…)

Parent chromosomes 1: [1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 1.]

Parent chromosomes 2: [0. 0. 1. 0. 0. 0. 1. 0. 1. 1. 0. 0. 0. 0. 1.]

Child chromosome 1: [1. 1. 1. 1. 1. 1. 1. 1. 1. 1. 0. 0. 0. 0. 1.]

Child chromosome 2: [0. 0. 1. 0. 0. 0. 1. 0. 1. 1. 1. 1. 1. 1. 1.]

Population before mutation:

[[0. 1. 0. 0. 0. 0. 1. 0. 0. 1. 1. 0. 0. 0. 0.]

[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 1. 1.1.]]

21 of 26

RESULT (Contd…)

Population after mutation:

[[0. 1. 1. 0. 0. 0. 1. 1. 0. 1. 0. 0. 0. 0. 1.]

[0. 0. 1. 0. 0. 0. 0. 0. 0. 0. 0. 0. 1. 1. .]]

Result after 200 generations:

Starting best score, percent target: 86.7

End best score, percent target: 100.0

Figure 5: Comparison of GABFCov-19 with Existing Algorithm

22 of 26

CONCLUSION

  • A new algorithm named GABFCov-19 algorithm where the inputs are taken from the clusters generated by proximity app.

  • Match the symptoms of a Covid-19 positive patient with the cluster

  • Forecast the risk associated with the individuals

23 of 26

CONCLUSION (Contd…)

  • Results have been tested with 15 parameters

  • The process was repeated iteratively and we get the accuracy of predictive positive cases to be 86.7%

24 of 26

FUTURE SCOPE OF THE STUDY

  • Algorithm has good accuracy

  • High testing time is a disadvantage

  • Using machine learning and artificial intelligence

25 of 26

REFERENCES

  • Rizk-Allah, R.M. And Hassanien, A.E., 2020. COVID-19 forecasting based on an improved interior search algorithm and multi-layer feed forward neural network. arXiv preprint arXiv:2004.05960.

  • Ghazaly, N.M., Abdel-Fattah, M.A. And Abd El-Aziz, A.A., Novel Coronavirus Forecasting Model using Nonlinear Autoregressive Artificial Neural Network.

  • Al-qaness, M.A., Ewees, A.A., Fan, H. And Abd El Aziz, M., 2020. Optimization method for forecasting confirmed cases of covid-19 in china. Journal of Clinical Medicine, 9(3), p.674.

26 of 26

THANK YOU