Implementation of Web Content Extraction of Structured Data Using DotNet Framework

Authors

  • Florence Dayana M Department of Computer Science, A.V.V.M Sri Pushpam College (Autonomous), Poondi, Thanjavur, India
  • Chidambaram M Department of Computer Science, Rajah Serfoji Government College (Autonomous), Thanjavur, India

DOI:

https://doi.org/10.26438/ijcse/v6i5.659663

Keywords:

Web Content Mining, Structured Data, Web Data Extraction, HTML, Data mining, Web Mining

Abstract

This paper deals in Web Content Mining for extraction of structured data. While perusing the web, the client needs to experience numerous pages of the Internet, channel the information and download related records and documents. This errand of seeking and downloading is tedious. Now and again the look inquiries call for particular choice, say, restricting inquiry to few connections. To lessen the time spent by clients, a web extraction and capacity apparatus has been composed and executed in .Net framework, that robotizes the downloading task from a given client question. The Test Scenario has been given different catchphrases. The present work can be a valuable contribution to Web Manipulators, Staff, Students and Web Administrators in an Academic Environment.

References

U. Moulali, V. Sasidhar, “Competent pattern innovation designed for textual content mining”, 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), pp. 572 – 577.

Farman Ali, Pervez Khan, Kashif Riaz, Daehan Kwak, Tamer Abuhmed, Daeyoung Park, Kyung Sup Kwak, “A Fuzzy Ontology and SVM–Based Web Content Classification System”, IEEE Access, Vol. 5, pp. 25781 – 25797.

Yeongsu Kim, Seungwoo Lee, “SVM-based web content mining with leaf classification unit from DOM-tree”, 2017 9th International Conference on Knowledge and Smart Technology (KST), pp. 359 – 364.

Tak-Lam Wong, Wai Lam, “Learning to Adapt Web Information Extraction Knowledge and Discovering New Attributes via a Bayesian Approach” IEEE Trans. on Knowledge and Data Engineering, Vol. 22, No. 4, pp. 523 – 536, 2010.

Charu C. Aggarwal, Yuchen Zhao, Philip S. Yu, “On the Use of Side Information for Mining Text Data”, IEEE Trans. on Knowledge and Data Engineering, Vol. 26, No. 6, pp. 1415 – 1429, 2014.

Kaveh Hassani, Won-Sook Lee, “Adaptive animation generation using web content mining”, 2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS), pp. 1 – 8.

G. Dhivya, K. Deepika, J. Kavitha, V. Nithya Kumari, “Enriched content mining for web applications”, 2015 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS), pp. 1 – 5.

Tao Jiang, Ah-hwee Tan, Ke Wang, “Mining Generalized Associations of Semantic Relations from Textual Web Content”, IEEE Trans. on Knowledge and Data Engineering, Vol. 19, No. 2, pp. 164 – 179.

Hung-Yu Kao, Shian-Hua Lin, Jan-Ming Ho, Ming-Syan Chen, “Mining Web informative structures and contents based on entropy analysis”, IEEE Trans. on Knowledge and Data Engineering, Vol. 16, No. 1, pp. 41 – 55, 2004.

F. de la Rosa Troyano, S. del Pozo Hidalgo, R. Martinez Gasca, “Analysis and Visualization of Scientific Communities with Information Extracted from the Web”, IEEE Latin America Transactions, Vol. 3, No. 1, pp. 56 – 61.

I-Jen Chiang, Charles Chih-Ho Liu, Yi-Hsin Tsai, Ajit Kumar, “Discovering Latent Semantics in Web Documents Using Fuzzy Clustering”, IEEE Trans. on Fuzzy Systems, Vol. 23, No. 6, pp. 2122 – 2134.

Hao Ma, Irwin King, Michael R. Lyu, “Mining Web Graphs for Recommendations”, IEEE Trans. on Knowledge and Data Engineering, Vol. 24, No. 6, pp. 1051 – 1064, 2012.

Tak-Lam Wong, Wai Lam, “Learning to Adapt Web Information Extraction Knowledge and Discovering New Attributes via a Bayesian Approach”, IEEE Trans. on Knowledge and Data Engineering, Vol. 22, No. 4, pp. 523 – 536.

Wei Liu, Xiaofeng Meng, Weiyi Meng, “ViDE: A Vision-Based Approach for Deep Web Data Extraction”, IEEE Trans. on Knowledge and Data Engineering, Vol. 22, No. 3, pp. 447 – 460.

Downloads

Published

2025-11-13
CITATION
DOI: 10.26438/ijcse/v6i5.659663
Published: 2025-11-13

How to Cite

[1]
M. Florence Dayana and M. Chidambaram, “Implementation of Web Content Extraction of Structured Data Using DotNet Framework”, Int. J. Comp. Sci. Eng., vol. 6, no. 5, pp. 659–663, Nov. 2025.

Issue

Section

Research Article