An Approach For Web Log Pre-Processing And Evidence Preservation For Web Mining

Authors

  • Richa Chourasia Department of CSE, Infinity Management & Engineering Institute, Sagar, M.P., India
  • Preeti Choudhary Department of CSE, Infinity Management & Engineering Institute, Sagar, M.P., India

Keywords:

Preprocessing, Web usage, Web log

Abstract

The time needed to scrape out any true information is for the most part used on information preprocessing. The information preprocessing stage lays the foundation for information mining with which, the client extricate and distinguish pertinent data from the World Wide Web. In this paper, we examine information preprocessing systems and different steps included in getting the obliged substance adequately. A powerful web log preprocessing technique is constantly proposed for web log preprocessing to concentrate the client designs. The information cleaning method uproots the unessential passages from web log and sifting calculation disposes of the uninterested characteristics from log record.

References

Muhammad Kamran Ahmed, Mukhtar Hussain and Asad Raza “An Automated User Transparent Approach to log Web URLs for Forensic Analysis” Fifth International Conference on IT Security Incident Management and IT Forensics 2009.

Yan LI, Boqin FENG and Qinjiao MAO, “Research on Path Completion Technique In Web Usage Mining”, IEEE International Symposium On Computer Science and Computational Technology, pp. 554-559, 2008.

Tasawar Hussain, Dr. Sohail Asghar and Nayyer Masood, “Hierarchical Sessionization at Preprocessing Level of WUM Based on Swarm Intelligence ”, 6th International Conference on Emerging Technologies (ICET) IEEE, pp. 21-26, 2010.

Doru Tanasa and Brigitte Trousse, ”Advanced Data Preprocessing for Intersites Web Usage Mining “, Published by the IEEE Computer Society, pp. 59-65, March/April 2004.

Huiping Peng, “Discovery of Interesting Association Rules Based On Web Usage Mining”, IEEE Coference, pp.272-275, 2010.

Ling Zheng, Hui Gui and Feng Li, “ Optimized Data Preprocessing Technology For Web Log Mining”, IEEE International Conference On Computer Design and Applications( ICCDA ), pp. VI-19-VI-21,2010.

JING Chang-bin and Chen Li, “ Web Log Data Preprocessing Based On Collaborative Filtering ”, IEEE 2nd International Workshop On Education Technology and Computer Science, pp.118-121, 2010.

Bzip2 and libbzip2 project official home page, http://www.bzip.org/ .

gzip official home page, algorithm description, http://www.gzip.org/algorithm.txt.

M. Nelson. Data Compression with the Burrows-Wheeler Transform. In Dr. Dobbs Journal September 1996.

J. Ziv, A. Lamapel. A Universal Algorithm for Sequential Data Compression. In IEEE Transactions on Information Theory, May 1977.

7 zip project official home page, http://www.7-zip.org.

M. Drini´c, D. Kirovski et al. PPMexe: PPM for Compressing Software. In Proceedings of the Data Compression Conference, IEEE, 2002.

Bal´azs R´ACZ, A. Luk ´acs. High density compression of log files. In Proceedings of Data Compression Conference (DCC'04), IEEE Page 557, 2004.

Y. Liang, Y. Y. Zhang et al. Filtering Failure Logs for a Blue Gene/L Prototype. In Proceedings of IEEE International Conference on Dependable Systems and Networks , 2005.

Vijayashri Losarwar, Dr. Madhuri Joshi, Data Preprocessing in Web Usage Mining, International Conference on Artificial Intelligence and Embedded Systems (ICAIES'2012) July 15-16, 2012 Singapore.

D.Vasumathi, D.Vasumathi and K.Suresh, “Effective Web Personalization Using Clustering”, IEEE IAMA, 2009.

Richa Chourasia, Prof. Preeti Choudhary, “A Survey On Web Log Pre-Processing And Evidence Preservation For Web Mining”, International Journal Of Innovative Research In Technology & Science, Volume1, Issue 4, Issn:2321-1156.

Downloads

Published

2014-04-30

How to Cite

[1]
R. Chourasia and P. Choudhary, “An Approach For Web Log Pre-Processing And Evidence Preservation For Web Mining”, Int. J. Comp. Sci. Eng., vol. 2, no. 4, pp. 210–216, Apr. 2014.

Issue

Section

Research Article