Comparative Performance Analysis of Datamining and Machine Learning Techniques for Diabetes Prediction
DOI:
https://doi.org/10.26438/ijcse/v11i7.17Keywords:
KNN, Support Vector Machine, decision Tree, Naive Bayes and Artificial Neural NetworkAbstract
Diabetes is caused by the high blood sugar. Body’s main source of energy is glucose. Our body can produce glucose, but glucose also comes from the various foods we eat. One of the hormone called Insulin is generated by the pancreas to help glucose to move into the cells and to be used for energy later. If anyone is diabetic then body doesn’t make sufficient, or any insulin, or doesn’t usage insulin appropriately. Glucose then remains in the blood and not able to move to cells. Diabetes involves the risk of damage to the eyes, kidneys, nerves, and heart. Early prediction of diabetes can lower the risk of developing diabetes health problems. This paper uses five different techniques from data mining and machine learnings- KNN, Support Vector Machine, decision Tree, Naive Bayes and Artificial Neural Network for the prediction of diabetes. Comparative study based on the performance of these algorithms has been presented. The measures used for the performance analysis of all the five algorithms are Accuracy, Precision, Recall, f1-score and Support. For the experiment purpose the dataset is taken from Mendeley data[1] . It has records of 1000 patients. Result shows that decision tree achieved the best accuracy as compared to the other data mining and machine learning techniques.
References
[1] Rashid, Ahlam, “Diabetes Dataset”, Mendeley Data, V1 2020, doi:10.17632/wj9rwkp9c2.1
[2] World Health Organization, 2016. WHO/NMH/NVI/16.3.
[3] Debadri Dutta, Debpriyo Paul, Parthajeet Ghosh, "Analyzing Feature Importance’s for Diabetes Prediction using Machine Learning". Conference: 2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) IEEE, pp.942-928, 2018.
[4] Tao Zheng , Wei Xie , Liling Xu , Xiaoying He , Ya Zhang , Mingrong You , Gong Yang , You Chen "A machine learning-based framework to identify type 2 diabetes through electronic health records." International journal of medical informatics 97. pp.120- 127, 2017.
[5] Nahla hosny Barakat, Andrew P Bradley and Mohammed N Barakat, "Intelligible support vector machines for diagnosis of diabetes mellitus”. Information Technology in Biomedicine", IEEE Transactions. pp.1114-1120, 14 July, 2010.
[6] V. Veena Vijayan and C. Anjali, ”Prediction and diagnosis of diabetes mellitus A machine learning approach”. IEEE Recent Advances in Intelligent Computational Systems (RAICS), pp.122–127, 2015. doi:10.1109/RAICS.2015.7488400.
[7] Deeraj Shetty, Kishor Rit, Sohail Shaikh, and Nikita Patil, "Diabetes Disease Prediction Using Data Mining". International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS), 2017.
[8] B.P. Shantakumar, and Y.S. Kumaraswamy, “Intelligent and effective heart attack prediction system using data mining and artificial neural networks”. European Journal of Scientific Research. Vol.31, pp.642–656. 2009.
[9] Karim M. Orabi, Yasser M. Kamal and Thanaa M. Rabah,”Early Predictive System for Diabetes Mellitus Disease”. In Industrial Conference on Data Mining, Springer, pp.420–427, 2016.
[10] Tarik A. Rashid, Saman M. Abdullah, and Reezhna M. Abdullah, “An Intelligent Approach for Diabetes Classification, Prediction and Description”. In Advances in Intelligent Systems and Computing, Vol.424, pp.323–335, 2016.
[11] Zarita Zainuddin, Pauline Ong, and Cemal Ardil, “ A neural network approach in predicting the blood glucose level for diabetic patients”. International Journal of Computational Intelligence, Vol.5, Issue.1, pp.72-79, 2009.
[12] Sadhana Tiwari , Awadhesh Kumar , Aasha Singh, “A Machine Learning Based Diabetes Prediction Using Stacking and Stacking With Hyperparameter Tuning”, International Journal of Computer Sciences and Engineering, Vol.10, Issue.6, June 2022.
[13] Pradeep Kumar G., R. Vadivel, “Python Based Diabetes Prediction Using Ensemble Machine Learning Techniques Using LR Algorithm and Hybrid Method”, International Journal of Computer Sciences and Engineering, Vol.10, Issue.5, May 2022.
[14] K. Gandhimathi1 , N. Umadevi, “ Prediction of Type 2 Diabetics besed on Clustering Algorithm:”, International Journal of Computer Sciences and Engineering, Vol.8, Issue.11, November 2020.
[15] B. Vinothkumar1* , M. Ramaswami, “A Novel Prediction of Diabetes by Automatic Insulin Therapy Using Machine Learning Algorithm”, International Journal of Computer Sciences and Engineering, Vol.8, Issue.3, Mar 2020.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
