Quality Cluster Generation Using Random Projections
DOI:
https://doi.org/10.26438/ijcse/v7i6.933936Keywords:
Cluster Analysis, Random Projections, NeighbouringAbstract
Clustering is the grouping of a particular set of objects based on their characteristics, aggregating them according to their similarities. Regarding data mining, this methodology partitions the data implementing a specific join algorithm, most suitable for the desired information analysis. Clusters are obtained by using density based clustering and DBSCAN clustering. DBSCAN cluster is a fast clustering technique, large complexity and requires large parameters. To overcome of these problems uses the OPTICS density based algorithm. The algorithm requires the simply a single parameter, namely the least amount of points in a cluster which is required as input in density based technique. Using random projection improving the cluster quality and run time.
References
[1] Ester M, Krigel H-P, Sander J, Xu X(1996)”A density based algorithm for discovering clusters in large spatial databases either noise.” In proceeding of the ACM conference knowledge discovery and data mining (KDD),pp226-231.
[2] Ankerst M, Breunig MM, Kriegel H-P, Sander J (1999) “Optics: ordering points to identify the clustering structure.” In: Proceedings of the ACM international conference on management of data (SIGMOD),pp. 49–60.
[3] Alexander Hinneburg, Daniel A. Keim (1998),"An Efficient Approach to Clustering in Large Multimedia Databases with Noise [Online] Available: http://www.aaai.org.
[4] Hinneburg A, Gabriel H-H (2007) Denclue 2.0: fast clustering based on kernel density estimation. In Advances in intelligent data analysis (IDA), pp 70–80.
[5] Imran Khan, Joshua Zhexue Huang (2012),” Ensemble Clustering of High Dimensional Data With random Projection.” In: Proceeding of the international conference on information and knowledge management.
[6] Qi Xianting, Wang Pan,“A density-based clustering algorithm for high-dimensional data with feature selection”, 2016,IEEE.
[7] Schneider J, Vlachos M (2013) “Fast parameter less density-based clustering via random projections.” In: Proceedings of the international conference on information and knowledge management (CIKM), pp 861–866.
[8] Johannes Schneider, Michail Valchos(2017) “Scalable density based clustering with quality guarantees using random projections.” Published in Journal: Data Mining and Knowledge Discovery Volume 31 Issue 4, July 2017 pages 972-1005.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
