An Approach for Ontology Based Information Extraction

N. Borisova

doi:10.1515/itc-2015-0007

An Approach for Ontology Based Information Extraction

Information Technologies and Control ◽

10.1515/itc-2015-0007 ◽

2014 ◽

Vol 12 (1) ◽

pp. 15-20 ◽

Cited By ~ 3

Author(s):

N. Borisova

Keyword(s):

Information Extraction ◽

Data Extraction ◽

Free Software ◽

Language Resources ◽

Text Documents ◽

Automatic Data ◽

Content Creation ◽

Pos Tagger ◽

The Given ◽

Processing Language

Abstract An approach for Ontology based Information Extraction (OBIE) from unstructured text in the Bulgarian language is presented in this paper. The presented method and algorithm provide a solution for automatic data extraction from text documents exploiting ontologies. To this end, in addition to the standard tools for processing language resources in an open source free software, a dictionary-based lemmatizer for Bulgarian has been developed and integrated. It is distributed as free software, publicly available to download and use under the GPL v3 license. Due to the specifics of inflection in Bulgarian the developed tools for lemmatization will contribute to improving the results of the POS tagger. This approach will offer opportunities for developing a dynamically created gazetteer that is, in combination with a few other generic GATE resources, capable of producing ontologybased annotations over the given content with regards to the given ontology. This algorithm can also be used in the processes of content creation and management of information and knowledge.

Download Full-text

A FRAME WORK FOR WEB INFORMATION EXTRACTION AND ANALYSIS

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY ◽

10.24297/ijct.v7i2.3459 ◽

2013 ◽

Vol 7 (2) ◽

pp. 574-579 ◽

Cited By ~ 3

Author(s):

Dr Sunitha Abburu ◽

G. Suresh Babu

Keyword(s):

Information Extraction ◽

Data Extraction ◽

Research Work ◽

Web Pages ◽

Web Documents ◽

E Learning ◽

Structured Information ◽

Frame Work ◽

Effective Decision ◽

The Web

Day by day the volume of information availability in the web is growing significantly. There are several data structures for information available in the web such as structured, semi-structured and unstructured. Majority of information in the web is presented in web pages. The information presented in web pages is semi-structured.Â But the information required for a context are scattered in different web documents. It is difficult to analyze the large volumes of semi-structured information presented in the web pages and to make decisions based on the analysis. The current research work proposed a frame work for a system that extracts information from various sources and prepares reports based on the knowledge built from the analysis. This simplifies Â data extraction, data consolidation, data analysis and decision making based on the information presented in the web pages.The proposed frame work integrates web crawling, information extraction and data mining technologies for better information analysis that helps in effective decision making.Â Â It enables people and organizations to extract information from various sourses of web and to make an effective analysis on the extracted data for effective decision making.Â The proposed frame work is applicable for any application domain. Manufacturing,sales,tourisum,e-learning are various application to menction few.The frame work is implemetnted and tested for the effectiveness of the proposed system and the results are promising.

Download Full-text

Research on Framework Load Correlations Based on Automatic Data Extraction Algorithm

Advances in Intelligent Systems and Computing - Advances in Intelligent Systems and Interactive Applications ◽

10.1007/978-3-030-34387-3_74 ◽

2019 ◽

pp. 604-613

Author(s):

Meiwen Hu ◽

Binjie Wang ◽

Shouguang Sun

Keyword(s):

Data Extraction ◽

Automatic Data ◽

Extraction Algorithm

Download Full-text

Automatic data extraction: A prerequisite for productivity measurement

2008 IEEE International Engineering Management Conference ◽

10.1109/iemce.2008.4617971 ◽

2008 ◽

Author(s):

D. Zaum ◽

M. Olbrich ◽

E. Barke

Keyword(s):

Data Extraction ◽

Productivity Measurement ◽

Automatic Data

Download Full-text

Automatic data extraction from 24 hour blood pressure measurement reports of a large multicenter clinical trial

Computer Methods and Programs in Biomedicine ◽

10.1016/j.cmpb.2021.106588 ◽

2021 ◽

pp. 106588

Author(s):

Janis M Nolde ◽

Ajmal Mian ◽

Luca Schlaich ◽

Justine Chan ◽

Leslie Marisol Lugo-Gavidia ◽

...

Keyword(s):

Blood Pressure ◽

Clinical Trial ◽

Pressure Measurement ◽

Data Extraction ◽

Blood Pressure Measurement ◽

Multicenter Clinical Trial ◽

Automatic Data

Download Full-text

Natural Language Processing-Based Information Extraction and Abstraction for Lease Documents

Advances in Computer and Electrical Engineering - Neural Networks for Natural Language Processing ◽

10.4018/978-1-7998-1159-6.ch011 ◽

2020 ◽

pp. 170-187

Author(s):

Sumathi S. ◽

Rajkumar S. ◽

Indumathi S.

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Information Extraction ◽

Language Processing ◽

Data Extraction ◽

Easy Access ◽

Property A ◽

Key Events

Lease abstraction is the method of compartmentalization of key data from a lease document. Lease document for a property contains key business, money, and legal data about a property. A lease abstract report contains details concerning the property location and basic lease details, price schedules, key events, terms and conditions, automobile parking arrangements, and landowner and tenant obligations. Abstracting a true estate contract into electronic type facilitates easy access to key data, exchanging the tedious method of reading the whole contents of the contract every time. Language process may be used for data extraction and abstraction of knowledge from lease documents.

Download Full-text

Information Extraction in Biomedical Literature

Encyclopedia of Data Warehousing and Mining ◽

10.4018/978-1-59140-557-3.ch116 ◽

2011 ◽

pp. 615-620

Author(s):

Min Song ◽

Il-Yeol Song ◽

Xiaohua Hu ◽

Hyoil Han

Keyword(s):

Natural Language ◽

Information Extraction ◽

Latin American ◽

Relational Databases ◽

Biomedical Literature ◽

Text Documents ◽

Plain Text ◽

Natural Language Text ◽

Relational Form ◽

Structured Representation

Information extraction (IE) technology has been defined and developed through the US DARPA Message Understanding Conferences (MUCs). IE refers to the identification of instances of particular events and relationships from unstructured natural language text documents into a structured representation or relational table in databases. It has proved successful at extracting information from various domains, such as the Latin American terrorism, to identify patterns related to terrorist activities (MUC-4). Another domain, in the light of exploiting the wealth of natural language documents, is to extract the knowledge or information from these unstructured plain-text files into a structured or relational form. This form is suitable for sophisticated query processing, for integration with relational databases, and for data mining. Thus, IE is a crucial step for fully making text files more easily accessible.

Download Full-text

Automatic Data Extraction from Data-Rich Web Pages

Database Systems for Advanced Applications - Lecture Notes in Computer Science ◽

10.1007/11408079_75 ◽

2005 ◽

pp. 828-839 ◽

Cited By ~ 4

Author(s):

Dongdong Hu ◽

Xiaofeng Meng

Keyword(s):

Data Extraction ◽

Web Pages ◽

Automatic Data

Download Full-text

Wrapper Generation for Automatic Data Extraction from Large Web Sites

Databases in Networked Information Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-540-31970-2_3 ◽

2005 ◽

pp. 34-53 ◽

Cited By ~ 1

Author(s):

Nitin Jindal

Keyword(s):

Web Sites ◽

Data Extraction ◽

Automatic Data ◽

Wrapper Generation

Download Full-text

Automatic Data Extraction from Web Discussion Forums

2009 Fourth International Conference on Frontier of Computer Science and Technology ◽

10.1109/fcst.2009.20 ◽

2009 ◽

Cited By ~ 4

Author(s):

Suke Li ◽

Liyong Tang ◽

Jianbin Hu ◽

Zhong Chen

Keyword(s):

Data Extraction ◽

Discussion Forums ◽

Automatic Data

Download Full-text

HTML Pattern Generator--Automatic Data Extraction from Web Pages

2006 Eighth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing ◽

10.1109/synasc.2006.43 ◽

2006 ◽

Author(s):

Mirel Cosulschi ◽

Adrian Giurca ◽

Bogdan Udrescu ◽

Nicolae Constantinescu ◽

Mihai Gabroveanu

Keyword(s):

Data Extraction ◽

Pattern Generator ◽

Web Pages ◽

Automatic Data

Download Full-text