Implementation of Web Content Extraction of Structured Data Using DotNet Framework
DOI:
https://doi.org/10.26438/ijcse/v6i5.659663Keywords:
Web Content Mining, Structured Data, Web Data Extraction, HTML, Data mining, Web MiningAbstract
This paper deals in Web Content Mining for extraction of structured data. While perusing the web, the client needs to experience numerous pages of the Internet, channel the information and download related records and documents. This errand of seeking and downloading is tedious. Now and again the look inquiries call for particular choice, say, restricting inquiry to few connections. To lessen the time spent by clients, a web extraction and capacity apparatus has been composed and executed in .Net framework, that robotizes the downloading task from a given client question. The Test Scenario has been given different catchphrases. The present work can be a valuable contribution to Web Manipulators, Staff, Students and Web Administrators in an Academic Environment.
References
U. Moulali, V. Sasidhar, “Competent pattern innovation designed for textual content mining”, 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), pp. 572 – 577.
Farman Ali, Pervez Khan, Kashif Riaz, Daehan Kwak, Tamer Abuhmed, Daeyoung Park, Kyung Sup Kwak, “A Fuzzy Ontology and SVM–Based Web Content Classification System”, IEEE Access, Vol. 5, pp. 25781 – 25797.
Yeongsu Kim, Seungwoo Lee, “SVM-based web content mining with leaf classification unit from DOM-tree”, 2017 9th International Conference on Knowledge and Smart Technology (KST), pp. 359 – 364.
Tak-Lam Wong, Wai Lam, “Learning to Adapt Web Information Extraction Knowledge and Discovering New Attributes via a Bayesian Approach” IEEE Trans. on Knowledge and Data Engineering, Vol. 22, No. 4, pp. 523 – 536, 2010.
Charu C. Aggarwal, Yuchen Zhao, Philip S. Yu, “On the Use of Side Information for Mining Text Data”, IEEE Trans. on Knowledge and Data Engineering, Vol. 26, No. 6, pp. 1415 – 1429, 2014.
Kaveh Hassani, Won-Sook Lee, “Adaptive animation generation using web content mining”, 2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS), pp. 1 – 8.
G. Dhivya, K. Deepika, J. Kavitha, V. Nithya Kumari, “Enriched content mining for web applications”, 2015 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS), pp. 1 – 5.
Tao Jiang, Ah-hwee Tan, Ke Wang, “Mining Generalized Associations of Semantic Relations from Textual Web Content”, IEEE Trans. on Knowledge and Data Engineering, Vol. 19, No. 2, pp. 164 – 179.
Hung-Yu Kao, Shian-Hua Lin, Jan-Ming Ho, Ming-Syan Chen, “Mining Web informative structures and contents based on entropy analysis”, IEEE Trans. on Knowledge and Data Engineering, Vol. 16, No. 1, pp. 41 – 55, 2004.
F. de la Rosa Troyano, S. del Pozo Hidalgo, R. Martinez Gasca, “Analysis and Visualization of Scientific Communities with Information Extracted from the Web”, IEEE Latin America Transactions, Vol. 3, No. 1, pp. 56 – 61.
I-Jen Chiang, Charles Chih-Ho Liu, Yi-Hsin Tsai, Ajit Kumar, “Discovering Latent Semantics in Web Documents Using Fuzzy Clustering”, IEEE Trans. on Fuzzy Systems, Vol. 23, No. 6, pp. 2122 – 2134.
Hao Ma, Irwin King, Michael R. Lyu, “Mining Web Graphs for Recommendations”, IEEE Trans. on Knowledge and Data Engineering, Vol. 24, No. 6, pp. 1051 – 1064, 2012.
Tak-Lam Wong, Wai Lam, “Learning to Adapt Web Information Extraction Knowledge and Discovering New Attributes via a Bayesian Approach”, IEEE Trans. on Knowledge and Data Engineering, Vol. 22, No. 4, pp. 523 – 536.
Wei Liu, Xiaofeng Meng, Weiyi Meng, “ViDE: A Vision-Based Approach for Deep Web Data Extraction”, IEEE Trans. on Knowledge and Data Engineering, Vol. 22, No. 3, pp. 447 – 460.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
