An Enhanced Fuzzy Based Linkage Clustering Algorithm (EFCA) in High Dimensional Data

Authors

  • Kiruthika R Dept. of Computer Science, Sri Ramakrishna College of Arts and Science, Coimbatore, India
  • Vijayakumar v Dept. of Computer Science, Sri Ramakrishna College of Arts and Science, Coimbatore, India

DOI:

https://doi.org/10.26438/ijcse/v8i2.1217

Keywords:

Data mining,, Big data cluster analysi, Fuzzy, Linkage

Abstract

In data mining, clustering algorithm is a powerful meta-learning tool to precisely examine the huge volume of data created by recent applications. In particular, their major objective is to group data into clusters such that data points are grouped in the similar cluster when they are “similar” according to specific metrics. Several clustering algorithms have been developed to deal with very large number of features or with a very high number of dimensions, but they are often not practical when the data is large in both aspects. To address these issues, this paper work, developed an Enhanced Fuzzy based Linkage Clustering Algorithm (EFCA), which combines FCM and cluster assignment strategy to solve the optimization problem during high dimensional data processing. The proposed EFCA approach it can work with large volumes of high dimensional dataset for discovering the outliers. The experimental results shown that the proposed EFCA performance to improve 21.9% especial in terms of Partition Accuracy (PA), Dunn Index (DI) improves 28 %, and Computational time improves 16.4% compared with other existing clusiVAT and FensiVAT algorithms.

References

1] U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy. Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press, 1996.

[2] Hoppner, F.; Klawnn, F.; Kruse, R.; Runkler, T.;. Fuzzy Cluster Analysis: “Methods for classification data analysis and image recognition” John Wiley & Sons Inc. New York NY., 2000.

[3] H. Gunadi “Comparing nearest neighbor algorithms in highdimensional space ” 2011.

[4] T. C. Havens and J. C. Bezdek “An efficient formulation of the improved visual assessment of cluster tendency (iVAT) algorithm ” IEEE Trans. Knowl. Data Eng. vol. 24, no. 5, pp. 813–822, May 2012.

[5] D. Kumar, M. Palaniswami, S. Rajasegarar, C. Leckie, J. C. Bezdek and T. C. Havens “clusiVAT: A mixed visual/numerical clustering algorithm for big data ” in Proc. IEEE Int. Conf. Big Data, pp. 112–117, 2013.

[6] A. Fahad, N. Alshatri, Z. Tari, A. Alamri, I. Khalil, A. Y. Zomaya, S. Foufou and A. Bouras “A survey of clustering algorithms for big data: Taxonomy and empirical analysis ” IEEE Trans. Emerging Topics Comput., vol. 2, no. 3, pp. 267–279, Sep. 2014.

[7] M. Popescu, J. Keller, J. Bezdek and A. Zare “Random projections fuzzy c-means (RPFCM) for big data clustering ” in Proc. IEEE Int. Conf. Fuzzy Syst., pp. 1–6, 2015.

[8] D. Kumar, J. C. Bezdek, M. Palaniswami, S. Rajasegarar, C. Leckie and T. C. Havens “A hybrid approach to clustering in big data ” IEEE Trans. Cybern. vol. 46, no. 10, pp. 2372–2385, Oct. 2016.

[9] J. C. Bezdek, Primer on Cluster Analysis: Four Basic Methods that (Usually) Work, vol. 1. Sarasota, FL, USA: First Edition Design Publishing, 2017.

[10] P. Rathore, J. C. Bezdek, S. M. Erfani, S. Rajasegarar, and M. Palaniswami “Ensemble fuzzy clustering using cumulative aggregation on random projections ” IEEE Trans. Fuzzy Syst. vol. 26, no. 3, pp. 1510–1524, Jun. 2018.

[11] P. Rathore, A. S. Rao, S. Rajasegarar, E. Vanz, J. Gubbi, and M. Palaniswami “Real-time urban microclimate analysis using internet of things ” IEEE Internet Things J. vol. 5, no. 2, pp. 500–511, Apr. 2018.

[12] Punit Rathore, Dheeraj Kumar, James C. Bezdek, Sutharshan Rajasegarar, and Marimuthu Palaniswami, "A Rapid Hybrid Clustering Algorithm for Large Volumes of High Dimensional Data", IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 31, NO. 4, APRIL 2019.

[13] J. C. Dunn “A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters ” J. Cybern. vol. 3, no. 3, pp. 32–57, 1973.

[14] P. Rathore, Z. Ghafoori, J. C. Bezdek, M. Palaniswami, and C. Leckie “Approximating Dunn’s cluster validity indices for partitions of big data ” IEEE Trans. Cybern

Downloads

Published

2020-02-28
CITATION
DOI: 10.26438/ijcse/v8i2.1217
Published: 2020-02-28

How to Cite

[1]
R. Kiruthika and V. Vijayakumar, “An Enhanced Fuzzy Based Linkage Clustering Algorithm (EFCA) in High Dimensional Data”, Int. J. Comp. Sci. Eng., vol. 8, no. 2, pp. 12–17, Feb. 2020.

Issue

Section

Research Article