Web information extraction for content augmentation

With the development of society, people pay more and more attention to the safety of food, and relevant laws and policies are gradually introduced and being improved. The research and development of agricultural product quality and safety system has become a research hot spot, and how to obtain the Web information of the system effectively and quickly is the focus of the research, so it is essential to carry out the intelligent extraction of Web information for agricultural product quality and safety system. The purpose of this paper is to solve the problem of how to efficiently extract the Web information of the agricultural product quality and safety system. By studying the Web information extraction methods of various systems, the paper makes a detailed analysis and research on how to realize the efficient and intelligent extraction of the Web information of the agricultural product quality and safety system. This paper analyzes in detail all kinds of template information extraction algorithms used at present, and systematically discusses a set of schemes that can automatically extract the Web information of agricultural product quality and safety system according to the template. The research results show that the proposed scheme is a dynamically extensible information extraction system, which can independently implement dynamic configuration templates according to different requirements without changing the code. Compared with the general way, the Web information extraction speed of agricultural product quality safety system is increased by 25%, the accuracy is increased by 12%, and the recall rate is increased by 30%.

Download Full-text

PSO: A Language for Web Information Extraction and Web Page Clipping

Lecture Notes in Computer Science - Adaptive Hypermedia and Adaptive Web-Based Systems ◽

10.1007/978-3-540-27780-4_45 ◽

2004 ◽

pp. 332-335

Author(s):

Tetsuya Suzuki ◽

Takehiro Tokuda

Keyword(s):

Information Extraction ◽

Web Page ◽

Web Information Extraction ◽

Web Information

Download Full-text

The Ex Project: Web Information Extraction Using Extraction Ontologies

Knowledge Discovery Enhanced with Semantic and Social Information - Studies in Computational Intelligence ◽

10.1007/978-3-642-01891-6_5 ◽

2009 ◽

pp. 71-88 ◽

Cited By ~ 2

Author(s):

Martin Labský ◽

Vojtěch Svátek ◽

Marek Nekvasil ◽

Dušan Rak

Keyword(s):

Information Extraction ◽

Web Information Extraction ◽

Web Information

Download Full-text

A Research of the Internet Based on Web Information Extraction and Data Fusion

Lecture Notes in Computer Science - New Horizons in Web-Based Learning - ICWL 2010 Workshops ◽

10.1007/978-3-642-20539-2_22 ◽

2011 ◽

pp. 195-206

Author(s):

Yajun Jiang ◽

Zaoliang Wu ◽

Zengrong Zhan ◽

Lingyu Xu

Keyword(s):

Data Fusion ◽

Information Extraction ◽

The Internet ◽

Web Information Extraction ◽

Web Information

Download Full-text

Combining Classification Algorithm with DOM Algorithm for Web Information Extraction – A Hybrid Approach

Advances in Intelligent and Soft Computing - Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012 ◽

10.1007/978-3-642-27443-5_68 ◽

2012 ◽

pp. 591-596

Author(s):

Venkat Ramana Bhavanasi ◽

A. Damodaram

Keyword(s):

Information Extraction ◽

Hybrid Approach ◽

Classification Algorithm ◽

Web Information Extraction ◽

Web Information

Download Full-text

Web Information Extraction via Web Views

Web Information Systems ◽

10.4018/978-1-59140-208-4.ch007 ◽

2004 ◽

pp. 227-267

Author(s):

Wee Keong Ng ◽

Zehua Liu ◽

Zhao Li ◽

Ee Peng Lim

Keyword(s):

Information Extraction ◽

Data Model ◽

Information Source ◽

Extraction Process ◽

Web Pages ◽

Efficient Manner ◽

Web Information Extraction ◽

Web Information ◽

Definition Of ◽

The Web

With the explosion of information on the Web, traditional ways of browsing and keyword searching of information over web pages no longer satisfy the demanding needs of web surfers. Web information extraction has emerged as an important research area that aims to automatically extract information from target web pages and convert them into a structured format for further processing. The main issues involved in the extraction process include: (1) the definition of a suitable extraction language; (2) the definition of a data model representing the web information source; (3) the generation of the data model, given a target source; and (4) the extraction and presentation of information according to a given data model. In this chapter, we discuss the challenges of these issues and the approaches that current research activities have taken to revolve these issues. We propose several classification schemes to classify existing approaches of information extraction from different perspectives. Among the existing works, we focus on the Wiccap system — a software system that enables ordinary end-users to obtain information of interest in a simple and efficient manner by constructing personalized web views of information sources.

Download Full-text

Web information extraction for content augmentation

An approach of semi-supervised Web information extraction

Cross domain web information extraction with multi-level feature model

An agent-based system framework for multi-slot Web information extraction

Web Information Extraction System

Intelligent Web Information Extraction Model for Agricultural Product Quality and Safety System

PSO: A Language for Web Information Extraction and Web Page Clipping

The Ex Project: Web Information Extraction Using Extraction Ontologies

A Research of the Internet Based on Web Information Extraction and Data Fusion

Combining Classification Algorithm with DOM Algorithm for Web Information Extraction – A Hybrid Approach

Web Information Extraction via Web Views

Export Citation Format