Quality Cluster Generation Using Random Projections

Authors

  • PA Gat Department of Computer Science, DKTE’s College, Ichalkaranji, India
  • KS Kadam Department of Computer Science, DKTE’s College, Ichalkaranji, India

DOI:

https://doi.org/10.26438/ijcse/v7i6.933936

Keywords:

Cluster Analysis, Random Projections, Neighbouring

Abstract

Clustering is the grouping of a particular set of objects based on their characteristics, aggregating them according to their similarities. Regarding data mining, this methodology partitions the data implementing a specific join algorithm, most suitable for the desired information analysis. Clusters are obtained by using density based clustering and DBSCAN clustering. DBSCAN cluster is a fast clustering technique, large complexity and requires large parameters. To overcome of these problems uses the OPTICS density based algorithm. The algorithm requires the simply a single parameter, namely the least amount of points in a cluster which is required as input in density based technique. Using random projection improving the cluster quality and run time.

References

[1] Ester M, Krigel H-P, Sander J, Xu X(1996)”A density based algorithm for discovering clusters in large spatial databases either noise.” In proceeding of the ACM conference knowledge discovery and data mining (KDD),pp226-231.

[2] Ankerst M, Breunig MM, Kriegel H-P, Sander J (1999) “Optics: ordering points to identify the clustering structure.” In: Proceedings of the ACM international conference on management of data (SIGMOD),pp. 49–60.

[3] Alexander Hinneburg, Daniel A. Keim (1998),"An Efficient Approach to Clustering in Large Multimedia Databases with Noise [Online] Available: http://www.aaai.org.

[4] Hinneburg A, Gabriel H-H (2007) Denclue 2.0: fast clustering based on kernel density estimation. In Advances in intelligent data analysis (IDA), pp 70–80.

[5] Imran Khan, Joshua Zhexue Huang (2012),” Ensemble Clustering of High Dimensional Data With random Projection.” In: Proceeding of the international conference on information and knowledge management.

[6] Qi Xianting, Wang Pan,“A density-based clustering algorithm for high-dimensional data with feature selection”, 2016,IEEE.

[7] Schneider J, Vlachos M (2013) “Fast parameter less density-based clustering via random projections.” In: Proceedings of the international conference on information and knowledge management (CIKM), pp 861–866.

[8] Johannes Schneider, Michail Valchos(2017) “Scalable density based clustering with quality guarantees using random projections.” Published in Journal: Data Mining and Knowledge Discovery Volume 31 Issue 4, July 2017 pages 972-1005.

Downloads

Published

2019-06-30
CITATION
DOI: 10.26438/ijcse/v7i6.933936
Published: 2019-06-30

How to Cite

[1]
G. PA and K. Kadam, “Quality Cluster Generation Using Random Projections”, Int. J. Comp. Sci. Eng., vol. 7, no. 6, pp. 933–936, Jun. 2019.

Issue

Section

Research Article