Clustering of Web Access Patterns for Segmenting Web Users Using a Fuzzy Based Cluster Estimation Method

Authors

  • Rajimol A Department of Computer Applications, Marian College Kuttikkanam, Kerala, India

DOI:

https://doi.org/10.26438/ijcse/v7i2.216222

Keywords:

Fuzzy Clustering, Web access pattern, Fuzzy Logic, k-means clustering, c-means clustering

Abstract

This paper presents a method for segmenting the web users based on their web access patterns. History of web pages visited by users includes informations like access sequences of web users and number of visits of web pages and reveals interest of users in particular pages. The web users’ access patterns can be segmented to group the users with similar interests. In this work, a simple, count based technique is used for preprocessing web access data so as to convert it into a database with fixed number of attributes. A novel approach based on fuzzy clustering principles for unsupervised clustering is extended to identify the number of web user groups based on their access patterns. This method starts by assuming that all the data points are initial clusters. Pairs of similar clusters are then merged based on fuzzy membership values. This paper also compare the cluster count obtained with this approach with the cluster count obtained with Cohonen’s unsupervised clustering algorithm. The tools available with IBM SPSS Modeler 14.1 are used to benchmark the quality of cluster estimation.

References

[1] W.H. Inmon, “The Data Warehouse and Data Mining”, ACM Commn , 1996, 39:.49-50.

[2] Berkhin P. (2006) A Survey of Clustering Data Mining Techniques. In: Kogan J., Nicholas C., Teboulle M. (eds) Grouping Multidimensional Data. Springer, Berlin, Heidelberg

[3] J Han, M Kamber, Data Mining Concepts and Techniques, Elsevier, 2003.

[4] R. Cooley, B. Mobasher, J. Srivastava: Data Preparation for Mining World Wide Web Browsing Patterns, Journal of Knowledge and Information Systems, 1999, 1(1).

[5] G. Raju, A. Singh, Th. Shanta Kumar, Binu Thomas, Integration of Fuzzy Logic in Data Mining: A comparative Case Study, Proc. of International Conf. on Mathematics and Computer Science, Loyola College, Chennai, 2008, pp.128-136,

[6] I.V. Cadez, C. Meek, Visualization of Navigation Patterns on a Web Site Using Model Based Clustering, citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.163.1042.

[7] Ajith Abraham, Business Intelligence from Web Usage Mining, Journal of Information & Knowledge Management, 2003, 2(4).

[8] D. Cosic , S. Loncaric, New Methods for Cluster Selection in Unsupervised Fuzzy Clustering, citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.51.2642.

[9] X. Xiong, K.L.Chan, K.L. Tan, Similarity-driven cluster merging method for unsupervised fuzzy clustering, Proceedings of the 20th conference on Uncertainty in artificial intelligence AUAI Press Arlington, Virginia, United States, 2004.

[10] J.Srivastava, R.Cooley, M. Deshpande, and P.-N. Tan. Web Usage Mining: Discover and Applications of Usage Patterns from Web Data. In ACM SIGKDD Explorations, 2000, 1(2) , pp 12-23.

[11] J. Pei, J. Han, B. Mortazavi, H. Zhu: Mining Access Patterns Efficiently from Web Logs. Proceedings of the 4th PAKDD, Kyoto, Japan, 2000, pp.396-407.

[12] E. Cox, Fuzzy Modeling and Genetic Algorithms for Data Mining And Exploration, Elsevier, 2005.

[13] Sankar K. Pal, P. Mitra, Data Mining in Soft Computing Framework: A Survey, IEEE transactions on neural networks, 2002, 13(1).

[14] Binu Thomas, Raju G., and Sonam Wangmo, A Modified Fuzzy C-Means Algorithm for Natural Data Exploration, www.waset.org/journals/waset/v49/v49-88.pdf.

[15] B. Thomas and G. Raju, A Fuzzy Threshold Based Modified Clustering Algorithm for Natural Data Exploration, Lecture Notes in Computer Science, 2010, 6122, pp. 167-172.

[16] J. Pei, J. Han, B. Mortazavi-asl, and H. Zhu. Mining Access Pattern Efficiently from Web Logs in Proc. 2000 Pacific-Asia Conf. on Knowledge Discovery and Data Mining (PAKDD`00), Kyoto, Japan, April 2000

[17] M. Spiliopoulou, L. C. Faulstich, K. Winkler, A Data Miner Analyzing the Navigational Behaviour of Web Users in Proc. of the Workshop on Machine Learning in User Modelling of the ACAI`99 Int. Conf., Creta, Greece, July 1999

[18] R.Cooley, B. Mobasher and J. Srivastava. Web Mining: Information and Pattern Discovery on the World Wide Web in Proceedings of the 9th IEEE International Conference on Tools with Artificial Intelligence (ICTAI`97), November 1997.

[19] B. Mobasher, R.Cooley, and J. Srivastava. Automatic Personalization Based on Web Usage Mining in Communication of ACM, August, 2000 (Volume 43 , Issue 8)

[20] T. Joachims, D. Freitag, T. Mitchell WebWatcher: A Tour Guide for the World Wide Web in IJCAI97 -- Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, pages 770--775, Nagoya, Japan.

[21] N.Sujatha, K. Prakash, "An Efficient and Scalable Auto Recommender System Based on Users Behavior", International Journal of Scientific Research in Computer Science and Engineering, Vol.6, Issue.6, pp.35-40, 2018.

[22] D.S. Rajput, R.S. Thakur, G.S. Thakur, "Clustering approach based on Efficient Coverage with Minimum Weight for Document Data", International Journal of Computer Sciences and Engineering, Vol.1, Issue.1, pp.6-13, 2013.

[23] http://kdd.ics.uci.edu/databases/msnbc/

Downloads

Published

2019-02-28
CITATION
DOI: 10.26438/ijcse/v7i2.216222
Published: 2019-02-28

How to Cite

[1]
A. Rajimol, “Clustering of Web Access Patterns for Segmenting Web Users Using a Fuzzy Based Cluster Estimation Method”, Int. J. Comp. Sci. Eng., vol. 7, no. 2, pp. 216–222, Feb. 2019.

Issue

Section

Research Article