An Enhanced Fuzzy Based Linkage Clustering Algorithm (EFCA) in High Dimensional Data
DOI:
https://doi.org/10.26438/ijcse/v8i2.1217Keywords:
Data mining,, Big data cluster analysi, Fuzzy, LinkageAbstract
In data mining, clustering algorithm is a powerful meta-learning tool to precisely examine the huge volume of data created by recent applications. In particular, their major objective is to group data into clusters such that data points are grouped in the similar cluster when they are “similar” according to specific metrics. Several clustering algorithms have been developed to deal with very large number of features or with a very high number of dimensions, but they are often not practical when the data is large in both aspects. To address these issues, this paper work, developed an Enhanced Fuzzy based Linkage Clustering Algorithm (EFCA), which combines FCM and cluster assignment strategy to solve the optimization problem during high dimensional data processing. The proposed EFCA approach it can work with large volumes of high dimensional dataset for discovering the outliers. The experimental results shown that the proposed EFCA performance to improve 21.9% especial in terms of Partition Accuracy (PA), Dunn Index (DI) improves 28 %, and Computational time improves 16.4% compared with other existing clusiVAT and FensiVAT algorithms.
References
1] U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy. Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press, 1996.
[2] Hoppner, F.; Klawnn, F.; Kruse, R.; Runkler, T.;. Fuzzy Cluster Analysis: “Methods for classification data analysis and image recognition” John Wiley & Sons Inc. New York NY., 2000.
[3] H. Gunadi “Comparing nearest neighbor algorithms in highdimensional space ” 2011.
[4] T. C. Havens and J. C. Bezdek “An efficient formulation of the improved visual assessment of cluster tendency (iVAT) algorithm ” IEEE Trans. Knowl. Data Eng. vol. 24, no. 5, pp. 813–822, May 2012.
[5] D. Kumar, M. Palaniswami, S. Rajasegarar, C. Leckie, J. C. Bezdek and T. C. Havens “clusiVAT: A mixed visual/numerical clustering algorithm for big data ” in Proc. IEEE Int. Conf. Big Data, pp. 112–117, 2013.
[6] A. Fahad, N. Alshatri, Z. Tari, A. Alamri, I. Khalil, A. Y. Zomaya, S. Foufou and A. Bouras “A survey of clustering algorithms for big data: Taxonomy and empirical analysis ” IEEE Trans. Emerging Topics Comput., vol. 2, no. 3, pp. 267–279, Sep. 2014.
[7] M. Popescu, J. Keller, J. Bezdek and A. Zare “Random projections fuzzy c-means (RPFCM) for big data clustering ” in Proc. IEEE Int. Conf. Fuzzy Syst., pp. 1–6, 2015.
[8] D. Kumar, J. C. Bezdek, M. Palaniswami, S. Rajasegarar, C. Leckie and T. C. Havens “A hybrid approach to clustering in big data ” IEEE Trans. Cybern. vol. 46, no. 10, pp. 2372–2385, Oct. 2016.
[9] J. C. Bezdek, Primer on Cluster Analysis: Four Basic Methods that (Usually) Work, vol. 1. Sarasota, FL, USA: First Edition Design Publishing, 2017.
[10] P. Rathore, J. C. Bezdek, S. M. Erfani, S. Rajasegarar, and M. Palaniswami “Ensemble fuzzy clustering using cumulative aggregation on random projections ” IEEE Trans. Fuzzy Syst. vol. 26, no. 3, pp. 1510–1524, Jun. 2018.
[11] P. Rathore, A. S. Rao, S. Rajasegarar, E. Vanz, J. Gubbi, and M. Palaniswami “Real-time urban microclimate analysis using internet of things ” IEEE Internet Things J. vol. 5, no. 2, pp. 500–511, Apr. 2018.
[12] Punit Rathore, Dheeraj Kumar, James C. Bezdek, Sutharshan Rajasegarar, and Marimuthu Palaniswami, "A Rapid Hybrid Clustering Algorithm for Large Volumes of High Dimensional Data", IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 31, NO. 4, APRIL 2019.
[13] J. C. Dunn “A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters ” J. Cybern. vol. 3, no. 3, pp. 32–57, 1973.
[14] P. Rathore, Z. Ghafoori, J. C. Bezdek, M. Palaniswami, and C. Leckie “Approximating Dunn’s cluster validity indices for partitions of big data ” IEEE Trans. Cybern
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
