Predicting Heart Attack Using NBC, k-NN and ID3
Keywords:
Classification, ID3, Data mining, Supervised Learning, Naive Bayesian, k-Nearest NeighborAbstract
We are living in a world full of data. Every day people encounter large amounts of data. Main problem here is dealing with this huge data. Data mining techniques can be used to handle such huge data. Health care environment collects vast amounts of data, but the unfortunate thing is that it is not efficient in extracting useful information from this wealthy data. Data mining techniques can be applied to extract valuable knowledge from the health care environment. In this paper, three supervised learning classification algorithms have been implemented to predict heart attack risk from heart disease database. The classification algorithms used are Naive Bayesian Classification (NBC), k-Nearest Neighbor (k-NN) Classification and ID3 Classification. As a pre-processing step Discretization of continuous variables is adopted. The heart disease data set is trained with these classifiers. A GUI is designed so that the user can input patient�s record. The system is then able to predict whether or not the user has a risk of heart attack. The performance of these three algorithms is determined by computing accuracy. From the experiments, it is found that ID3 Classification outperforms other two classifiers with the accuracy of 91.72%.
References
Sivagowry .S, Dr. Durairaj. M, Persia.A, “An Empirical Study on applying Data Mining Techniques for the Analysis and Prediction of Heart Disease”, Int, Conference on Information Communication and Embedded System (ICICES), ISBN: 978-1-4673-5786-9, Page No (265-270), Feb 21-22, 2013
Jiawei Han, Micheline Kamber, and Jian Pei, “Data Mining Concepts and Techniques”, Morgan Kaufmann Publishers, Third (3rd) Edition, ISBN: 1-55860-901-6, 2012
Jyoti Soni, Uzma Ansari, Dipesh Sharma, Sunita Soni, “Intelligent and Effective Heart Disease Prediction System using Weighted Associative Classifiers”, Int. Journal on Computer Science and Engineering (IJCSE), Volume-03, Issue-06, Page No (2385-2392), 2011
Asha Rajkumar, Mrs. G.Sophia Reena, “Diagnosis Of Heart Disease Using Datamining Algorithm”, Global Journal of Computer Science and Technology, Vol ume-10, Issue--10, Page No (38-43), 2010
Indian Express: http://archive.indianexpress.com/news/india-set-to-be--heart-disease-capital-of-world--say-doctors/1009607/
UCI Machine Learning Repository [Online]. Available: http://archive.ics.uci.edu/ml/datasets/Heart+Disease
K.Srinivas, B.Kavihta Rani, Dr. A.Govrdhan, “Applications of Data Mining Techniques in Healthcare and Prediction of Heart Attacks”, International Journal on Computer Science and Engineering (IJCSE), Volume-02, Issue-02, Page No (250-255), 2010
Shamsher Bahadur Patel, Pramod Kumar Yadav, Dr. D. P.Shukla, “Predict the Diagnosis of Heart Disease Patients Using Classification Mining Techniques”, IOSR Journal of Agriculture and Veterinary Science (IOSR-JAVS), Volume -04 Issue-02, Page No (61-64), 2013
Mai Shouman, Tim Turner, Rob Stocker, “Applying k-Nearest Neighbour in Diagnosing Heart Disease Patients”, International Journal of Information and Education Technology, Volume-02 Issue-03, Page No (220-223), 2012
Yanwei Xing, Jie Wang, Zhihong Zhao, Yonghong Gao, “Combination data mining methods with new medical data to predicting outcome of Coronary Heart Disease”, International Conference on Convergence Information Technology, ISBN: 0-7695-3038-9, Page No (868 – 872), Nov 21-23, 2007
Mary Slocum, “Decision Making Using ID3 Algorithm”, Rivier Academic Journal, Volume-08, Number-02, Page No (1-12), 2012
Hnin Wint Khaing, “Data Mining based Fragmentation and Prediction of Medical Data”, Int, Conference on Computer Researh and Development(ICCRD), ISBN: 978-1-61284-839-6, Page No (480-485), March 11-13, 2011
EntropyBasedBinning: http://www.saedsayad.com/supervised_binning.htm
Pang-Ning Tan, Vipin Kumar, Michael Steinbach, “Introduction to Data Mining”, Addison-Wesley, 2006
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
