Data Based Model for Predicting COVID-19 Incidence Using Data Mining

Authors

  • Yadav R Computer Science and Engineering, M.I.T.S. Gwalior, India
  • Kumar Manjhwar A Computer Science and Engineering, M.I.T.S. Gwalior, India

DOI:

https://doi.org/10.26438/ijcse/v10i5.6573

Keywords:

Covid19, Machine Learning, , EDA (Exploratory data analysis), Linear Regression,, Gaussian Naïve Baye, Decision Tree,, Ensemble learning,, Random forest,, Gradient boosting

Abstract

Since, covid19 is affecting many countries in the world therefore, to take the necessary steps in order to control the outbreak or incidence of covid19, which is possible if we know the outbreak or incidence of covid19 which is possible with machine learning. Therefore, in this study we are analyzing the covid19 data of India and performing EDA (exploratory data analysis) and proposing various machine learning algorithm in order to predict the outbreak of covid19. We are using various machine learning algorithms like Linear regression, Gaussian naïve bayes, Decision tree and ensemble learning like random forest, gradient boosting and then finding the best algorithm by comparing their accuracy score. With the help of best algorithm, the outbreak of covid19 to manage the health crisis in each country can be controlled by taking the essential steps.

References

[1] Cucinotta D, Vanelli M. WHO Declares COVID-19 a Pandemic. Acta Biomed. 91:157–160, 2020.

[2] Zhao Y, Shou M, Wang Z. Prediction of the Number of Patients Infected with COVID-19 Based on Rolling Grey Verhulst Models. Int J Environ Res Public Health. 17(12):4582, 2020.

[3] Leeuwenberg AM, Schuit E. Prediction models for COVID-19 clinical decision making. The Lancet Digital Health. 2:e496–e497, 2020.

[4] M. I. Jordan1 and T. M. Mitchell, Machine learning: Trends, perspectives, and prospects (sciencemag.org) 2019.

[5] Godio A, Pace F, Vergnano A. SEIR Modeling of the Italian Epidemic of SARS-CoV-2 Using Computational Swarm Intelligence. Int J Environ Res Public Health. 17(10):3535, 2020.

[6] Sharma, V.K., Nigam, U.: Modelling and forecasting of COVID-19 growth curve in India. Trans. Indian Nat. Acad. Eng. 5, 697–710, 2020.

[7] Saeed, S., Humayun, M.: Quantitative analysis of COVID-19 patients: a preliminary statistical result of deep learning artificial intelligence framework. In: Book Series: ICT Solutions for Improving Smart Communities in Asia, IGI Gobal, pp. 218–242, 2021.

[8] Ghosal, S., Sengupta, S., Majumder, M., Sinha, B.: Linear regression analysis to predict the number of deaths in India due to SARS-CoV-2 at 6 weeks from day 0 (100 cases - March 14th 2020). Diabetes Metab. Syndr. Clin. Res. Rev. 14(January), 311–315, 2020.

[9] Rushing J, Ramachandran R, Nair U, Graves S, Welch R, Lin H. ADaM: a data mining toolkit for scientists and engineers. Computers & Geosciences 31(5):607-618, Jun. 2005.

[10] Dangare CS, Apte SS. Improved Study of Heart Disease Prediction System using Data Mining Classification Techniques. IJCA 30;47(10):44-48, Jun 2012.

[11] R.S. Yadav, Data analysis of COVID-2019 epidemic using machine learning methods: a case study of India. Int. J. Inf. Technol., pp. 1-10, 2020.

[12] S.F. Ardabili, A. Mosavi, P. Ghamisi, F. Ferdinand, A.R. Varkonyi-Koczy, U. Reuter, et al., COVID-19 outbreak prediction with machine learning, medRxiv (2020), 2020.04.17.20070094

[13] P. Bedi, S. Dhiman, P. Gole, N. Gupta, V. Jindal, Prediction of COVID-19 trend in India and its four worst-affected states using modified SEIRD and LSTM models, SN Comput. Sci., 2(3), p. 224, 2021.

[14] T. Chakraborty, A. Bhattacharyya, M. Pattnaik, Theta autoregressive neural network model for COVID-19 outbreak predictions, medRxiv (2020), 2020.10.01.20205021

[15] A. Chatterjee, M.W. Gerdes, S.G. Martinez, Statistical explorations and univariate timeseries analysis on COVID-19 datasets to understand the trend of disease spreading and death, Sensors, 20(11), 2020.

[16] N.P. Dharani, P. Bojja, P. Raja Kumari, Evaluation of performance of an LR and SVR models to predict COVID-19 pandemic, Mater. Today Proc. 2021.

[17] S.W. Doe, T.R. Seekins, D. Fitzpatrick, D. Blanchard, S.Y. Sekeh, Adaptive County Level COVID-19 Forecast Models: Analysis and Improvement (2020), arXiv preprint arXiv:200612617

[18] S. Ganiny, O. Nisar, Mathematical modeling and a month ahead forecast of the coronavirus disease 2019 (COVID-19) pandemic: an Indian scenario, medRxiv (2020), 2020.09.10.20192195

[19] K.K.A. Ghany, H.M. Zawbaa, H.M. Sabri, COVID-19 prediction using LSTM algorithm: GCC case study, Inform. Med. Unlocked, 23, p. 100566, 2021.

[20] N.M. Ghazaly, M.A. Abdel-Fattah, A.A. Abd El-Aziz, Novel coronavirus forecasting model using nonlinear autoregressive artificial neural network, Int. J. Adv. Sci. Technol., 29, 5 Special Issue, pp. 1831-1849, 2020.

[21] Q. Guo, Z. He, Prediction of the confirmed cases and deaths of global COVID-19 using artificial intelligence, Environ. Sci. Pollut. Res. Int., 28 (9), pp. 11672-11682, 2021.

[22] K.T. Hasan, M.M. Rahman, M.M. Ahmmed, A.A. Chowdhury, M.K. Islam, 4P model for dynamic prediction of COVID-19: a statistical and machine learning approach, Cognit. Comput., pp. 1-14, 2021.

[23] B.B. Hazarika, D. Gupta Modelling and forecasting of COVID-19 spread using wavelet-coupled random vector functional link networks, Appl. Soft Comput., 96, p. 106626, 2020.

[24] M. Hawas, Generated time-series prediction data of COVID-19`s daily infections in Brazil by using recurrent neural networks, Data Brief, p.106175, 2020.

[25] B. Heni, COVID-19, Bacille Calmette-Guerin (BCG) and tuberculosis: cases and recovery previsions with deep learning sequence prediction, Ingénierie Des. Systèmes Inf., 25 (2), pp. 165-172, 2020.

[26] A.-E.E. Hridoy, M. Naim, N.U. Emon, I.H. Tipo, S. Alam, A. Al Mamun, et al., Forecasting COVID-19 dynamics and endpoint in Bangladesh: a data-driven approach, medRxiv (2020), 2020.06.26.20140905

Downloads

Published

2022-05-31
CITATION
DOI: 10.26438/ijcse/v10i5.6573
Published: 2022-05-31

How to Cite

[1]
R. Yadav and A. Kumar Manjhwar, “Data Based Model for Predicting COVID-19 Incidence Using Data Mining”, Int. J. Comp. Sci. Eng., vol. 10, no. 5, pp. 65–73, May 2022.

Issue

Section

Research Article