A Revised and efficient K-means Clustering Algorithm

Authors

Jat P Computer Science and engineering, College Of Technology and Engineering, Maharana Pratap University of Agriculuture and Technology, Udaipur, India
Jain K Computer Science and engineering, College Of Technology and Engineering, Maharana Pratap University of Agriculuture and Technology, Udaipur, India

DOI:

https://doi.org/10.26438/ijcse/v6i12.118124

Keywords:

Clustering, Centroids

Abstract

In digital era large volumes of data are generated by enterprises. Mining on this large volume of data provides valuable insights into user behaviors and helps to improve the business. Various Machine learning algorithms are proposed for data mining. Clustering is an important data mining algorithm for grouping the records and analyzing the data. K-means is a most used Clustering algorithm, but the time taken to cluster large volume of records is high. To reduce the clustering time many approaches are proposed in literature. In this work an improved K-means clustering is proposed which is able to reduce the clustering time.

References

[1] Wang Shunye “An Improved K-means Clustering Algorithm Based on Dissimilarity” 2013 International Conference on Mechatronic Sciences, Electric Engineering and Computer (MEC)Dec 20-22, 2013, Shenyang, China IEEE.

[2] Navjot Kaur, Jaspreet Kaur Sahiwal, Navneet Kaur “EFFICIENT KMEANSCLUSTERING ALGORITHM USING RANKING METHOD IN DATA MINING” ISSN: 2278 – 1323 International Journal of Advanced Research in Computer Engineering & Technology Volume 1, Issue 3, May2012.

[3] Md. Sohrab Mahmud, Md. Mostafizer Rahman, and Md.Nasim Akhtar ―”Improvement of K-means Clustering algorithm with better initial centroids based on weighted average” 2012 7th International Conference on Electrical and Computer Engineering 20-22 December, 2012, Dhaka, Bangladesh, 2012 IEEE.

[4] Juntao Wang & Xiaolong Su “ An improved K-Means clustering algorithm” 2011 IEEE.

[5] Mohamed Abubaker, Wesam Ashour, "Efficient Data Clustering Algorithms: Improvements over K-means", International Journal of Intelligent Systems and Applications, vol. 5, issue 3, pages 37-49, 2013.

[6] Mohammed EI Agha, Wesam M. Ashour, " Efficient and Fast Initializtion Algorithm for K-means Clustering", LJ. Intelligent Systems and Applications, vol. 4, issue 1, pages 21-31, 2012

[7] Stephen J. Redmon, Conor Heneghan, " A method for initializing the K-means clustering algorithm using kd-trees", Journal Pattern Recognition Letters, vol. 28, issue 8, pages 965-973, 2007.

[8] Ling-bo Han, Qiang Wang, Zhengfeng Jiang etc..Improved k-means initial clustering center selection algorithm. Computer Engineering and Applications. 2010, 46(17):150–152.

[9] Wang, H., Qi, J., Zheng, W., & Wang, M. “Balance K-means algorithm. In Computational Intelligence and Software Engineering,” Cise 2009 International Conference on, pp. 1-3, IEEE

[10] Idrizi F., Rustemi, A., & Dalipi F., (2017, June), Anew modified sorting algorithm: A comparison with state of the art. In embedded computing (MECO) .20176th Mediterranean Conference on (pp 1-6)IEEE.

[11] Esteves, R. M., Hacker, T., & Rong, C. “Competitive k-means, a new accurate and distributed k-means algorithm for large datasets” In Cloud Computing Technology and Science (cloudcom), 2013 IEEE 5th International Conference on ,Vol. 1, pp. 17-24.

[12]MerzCand Murphy P, UCI Repository of MachineLearningDatabases,Available:ftp://ftp.ics.uci.edu/pub/machine-learning-databases

Downloads

PDF ⁰

Published

2018-12-31

CITATION

DOI: 10.26438/ijcse/v6i12.118124

Published: 2018-12-31

How to Cite

[1]

P. Jat and K. Jain, “A Revised and efficient K-means Clustering Algorithm”, Int. J. Comp. Sci. Eng., vol. 6, no. 12, pp. 118–124, Dec. 2018.

Download Citation

Issue

Vol. 6 No. 12 (2018): IJCSE December Editor

Section

Research Article

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.

A Revised and efficient K-means Clustering Algorithm

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Journal Information

UGC Gazette Regulation

Join Editorial Board

Information

Current Issue

Keywords