Determination of Optimal Number of Clusters in Cure Using Representative Points
DOI:
https://doi.org/10.26438/ijcse/v6i2.313320Keywords:
Algorithm, Clustering, CURE, MeasureAbstract
In most of the clustering algorithms, the number of clusters has to be supplied in as an input. In CURE clustering algorithm also, the same problem exists. In this paper, we try to find the optimal cluster number in the CURE clustering algorithm by calculating an optimality measure corresponding to each cluster number produced by CURE clustering algorithm after it enters a range ,based on the intra cluster measure and the inter cluster measure of the clusters. The clustering along with the optimality check continues as long the optimality measure is increasing and the cluster number doesn’t fall below the minimum boundary of our range.
References
Thomas H. Cormen, Charles E. Leiserson, and Ronald L. Rivest. Introduction to Algorithms, The MIT Press, Massachusetts, 1990.
Hanan Samet. The Design and Analysis of Spatial Data Structures. Addison-Wesley Publishing Company Inc., New York, 1990.
R.O. Duda, P.E. Hart, D.G. Stork, Pattern Classification, second ed., Wiley-Interscience, New York, 2001.
J.Han, M. Kamber, Data Mining, Morgan Kaufmann Publishers, 2000.
M. Grotschel, L. Lovasz, A. Schrijver, Geometric Algorithms and Combinatorial Optimization, second ed., Algorithm and Combinatorics, vol. 2, Springer-Verlag, Berlin, Heidelberg, 1994.
A.K. Jain, M.N. Murty, P.J. Flynn, Data clustering: A review, ACM Computing Surveys 31 (1999) 264–323.
V.V. Vazirani, Approximation Algorithms, Springer, 2001.
Sudipto Guha, Rajeev Rastogi, and Kyuseok Shim (1998) CURE: An Efficient Clustering Algorithm for Large Databases
Siddheswar Ray and Rose H. Turi Determination of Number of Clusters in K-Means Clustering and Application in Colour Image Segmentation.
Ng R., Han J. (1994) Efficient and effective clustering method for spatial data mining. In Proc. Conf. on Very Large Data Bases, pp 144-155.
Ester M., Kriegel H., Sander J., and Xu X. (1996) A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proc. 3rd Int. Conf. on Knowledge Discovery and Data Mining, AAAI Press.
Hinneburg A., Keim D. (1998) An Efficient Approach to Clustering in Large Multimedia Databases with Noise. Proc AAAI.
Seikholeslami G., Chatterjee S., and Zhang A. WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases. Proceedings of the 24th VLDB Conference, 1998
Guha, S., R. Rastogi & K. Shim. CURE: An Efficient Clustering Algorithm for Large Databases. In Proc. Of ACM SIGMOD Intl. Conf. on Management of Data, pp. 73-82, 1998.
Karypis, G., E. Han & V. Kumar. Chameleon: A hierarchical clustering algorithm using dynamic modeling. IEEE Computer, 32(8) pp. 68-75, 1999.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
