A Survival Study on Data Structure Based Clustering Techniques for Multidimensional Data Stream Analysis

Authors

  • K Chitra School of Computer Studies, Rathnavel Subramaniam College of Arts and Science, Coimbatore, India
  • D Maheswari School of Computer Studies, Rathnavel Subramaniam College of Arts and Science, Coimbatore, India

DOI:

https://doi.org/10.26438/ijcse/v5i12.101108

Keywords:

Data stream, Multidimensional data, Data mining, Data structure, Subspace clustering

Abstract

Data mining plays an effective role in the field of computer science to analysis the data objects. The data mining process is used to mine the knowledge from huge database. Then, the extracted information is modified into an understandable data structure for the future analysis. The data structure in a computer is an essential approach to categorize and manage the data which is utilized for efficient usage. The data stream is referred as a structured sequence of instances; the data stream mining discovers the knowledge structures from continuous and fast data records. The clustering is the process of creating the group by collecting the data of similar patterns and also describes the meaningful structure of data. The additional process of traditional clustering termed as Subspace Clustering which is utilized for detecting the clusters in various subspaces within dataset. Then, the subspace clustering algorithms are introduced to discover the cluster in multiple overlapping subspaces by searching the relevant dimensions. Many research works are developed for managing the high dimensional data with the objective of providing better improvement on minimizing the performance of dimensionality and enhancing the clustering accuracy. However, the existing works failed to reduce the space complexity. Therefore, the research work focuses on reducing the dimensionality with improved clustering accuracy by executing the clustering and subspace clustering for data stream with data structure techniques.

References

Salman Ahmed Shaikh and Hiroyuki Kitagawa, “Top-k Outlier Detection from Uncertain Data”, International Journal of Automation and Computing, Volume 11, Issue 2, Pages 128-142, April 2014.

Juan Zamora, Marcelo Mendoza and Hector Allende, “Hashing-based clustering in high dimensional data”, Expert Systems with Applications, Elsevier, Volume 62, Pages 202-211, November 2016,

Weiling Cai, “A dimension reduction algorithm preserving both global and local clustering structure”, Knowledge-Based Systems, Volume 118, Pages 191–203, February 2017

Ewa Nowakowska, Jacek Koronacki and Stan Lipovetsky, “Dimensionality reduction for data of unknown cluster structure”, Information Sciences, Elsevier, Volume 330, Pages 74-87, February 2016,

Yuhua Qian, Feijiang Li, Jiye Liang, Bing Liu, and Chuangyin Dang, “Space Structure and Clustering of Categorical Data”, IEEE Transactions on Neural Networks and Learning Systems, Volume 27, Issue 10, Pages 2047-2059, October 2016

Can Wang, Xiangjun Dong, Fei Zhou, Longbing Cao and Chi-Hung Chi, “Coupled Attribute Similarity Learning on Categorical Data”, IEEE Transactions on Neural Networks and Learning Systems, Volume 26, Issue 4, Pages 781-797, April 2015

Yifan Fu, Junbin Gao, David Tien, Zhouchen Lin, and Xia Hong, “Tensor LRR and Sparse Coding-Based Subspace Clustering”, IEEE Transactions on Neural Networks and Learning Systems, Volume 27, Issue 10, Pages 2120 – 2133, October 2016

Xi Peng, Zhiding Yu, Zhang Yi, and Huajin Tang, “Constructing the L2-Graph for Robust Subspace Learning and Subspace Clustering”, IEEE Transactions on Cybernetics Volume 47, Issue 4, Pages 1053 – 1066, April 2017

Madjid Khalilian, Norwati Mustapha and Nasir Sulaiman, “Data stream clustering by divide and conquer approach based on vector model”, Journal of Big Data, Springer, Volume 3, Issue 1, Pages 1-21, 2016

Niwan Wattanakitrungroj, Saranya Maneeroj and Chidchanok Lursinsap “Versatile Hyper-Elliptic Clustering Approach for Streaming Data Based on One-Pass-Thrown-Away Learning”, Journal of Classification, Springer, Volume 34, Issue 1, Pages 108–147, April 2017

Dominik Olszewski, “Asymmetric k-Means Clustering of the Asymmetric Self-Organizing Map”, Neural Processing Letters, Springer, Volume 43, Pages 231–253, 2016

Ghatage Trupti B, Patil Deepali E, Takmare Sachin B and Patil Sushama A “Joint Feature Learning and Clustering Techniques for Clustering High Dimensional Data: A Review”, International Journal of Computer Sciences and Engineering, Volume 4, Issue 3, Pages 54 – 58, 2016

Downloads

Published

2025-11-12
CITATION
DOI: 10.26438/ijcse/v5i12.101108
Published: 2025-11-12

How to Cite

[1]
K. Chitra and D. Maheswari, “A Survival Study on Data Structure Based Clustering Techniques for Multidimensional Data Stream Analysis”, Int. J. Comp. Sci. Eng., vol. 5, no. 12, pp. 101–109, Nov. 2025.

Issue

Section

Research Article