ONTOLOGY-BASED INFORMATION EXTRACTION FROM PDF DOCUMENTS WITH XONTO

Information extraction is of paramount importance in several real world applications in the areas of business, competitive and military intelligence because it enables to acquire information contained in unstructured documents and store them in structured forms. Unstructured documents have different internal encodings, one of the most diffused encoding is the visualization-oriented Adobe portable document format (PDF). Although several sophisticated and indeed complex approaches were proposed, they are still limited in many aspects. In particular, existing information extraction systems cannot be applied to PDF documents because of their completely unstructured nature that pose many issues in defining IE approaches. In this paper the novel ontology-based system named XONTO, that allows the semantic extraction of information from PDF documents, is presented. The XONTO system is founded on the idea of self-describing ontologies in which objects and classes can be equipped by a set of rules named descriptors. These rules represent patterns that allow to automatically recognize and extract ontology objects contained in PDF documents also when information is arranged in tabular form. This way a self-describing ontology expresses the semantic of the information to extract and the rules that, in turn, populate itself. In the paper XONTO system behaviors and structure are sketched by means of a running example.

Download Full-text

Linear Bandits with Feature Feedback

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5980 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5331-5338

Author(s):

Urvashi Oswal ◽

Aniruddha Bhargava ◽

Robert Nowak

Keyword(s):

Computational Complexity ◽

Prior Knowledge ◽

Real World ◽

Time Horizon ◽

The Novel ◽

Bandit Problem ◽

Real World Applications ◽

New Form ◽

Feature Feedback ◽

Over Time

This paper explores a new form of the linear bandit problem in which the algorithm receives the usual stochastic rewards as well as stochastic feedback about which features are relevant to the rewards, the latter feedback being the novel aspect. The focus of this paper is the development of new theory and algorithms for linear bandits with feature feedback which can achieve regret over time horizon T that scales like k√T, without prior knowledge of which features are relevant nor the number k of relevant features. In comparison, the regret of traditional linear bandits is d√T, where d is the total number of (relevant and irrelevant) features, so the improvement can be dramatic if k ≪ d. The computational complexity of the algorithm is proportional to k rather than d, making it much more suitable for real-world applications compared to traditional linear bandits. We demonstrate the performance of the algorithm with synthetic and real human-labeled data.

Download Full-text

LearningPinocchio: adaptive information extraction for real world applications

Natural Language Engineering ◽

10.1017/s135132490400333x ◽

2004 ◽

Vol 10 (2) ◽

pp. 145-165 ◽

Cited By ~ 12

Author(s):

F. CIRAVEGNA ◽

A. LAVELLI

Keyword(s):

Information Extraction ◽

Real World ◽

Real World Applications

Download Full-text

Low frequency of drug-drug interactions (DDIs) with the novel all oral antivirals elbasvir (EBV) and grazoprevir (GRZ) in patients with HCV genotype 1 infection in German real-world

Zeitschrift für Gastroenterologie ◽

10.1055/s-0036-1587006 ◽

2016 ◽

Vol 54 (08) ◽

Author(s):

P Buggisch ◽

H Löhr ◽

G Teuber ◽

H Steffens ◽

M Kraus ◽

...

Keyword(s):

Drug Interactions ◽

Real World ◽

Low Frequency ◽

The Novel ◽

Genotype 1 ◽

Hcv Genotype ◽

Hcv Genotype 1

Download Full-text

Designing Stochastic Optimization Algorithms for Real-world Applications

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.132.2 ◽

2012 ◽

Vol 132 (1) ◽

pp. 2-5

Author(s):

Hiroshi Someya ◽

Hisashi Handa ◽

Seiichi Koakutsu

Keyword(s):

Stochastic Optimization ◽

Real World ◽

Optimization Algorithms ◽

Real World Applications ◽

Stochastic Optimization Algorithms

Download Full-text

Document management. Portable document format

10.3403/30167699 ◽

2008 ◽

Keyword(s):

Portable Document Format ◽

Document Management ◽

Document Format

Download Full-text

Literature Survey on Real World Applications Using Internet of Things

SSRN Electronic Journal ◽

10.2139/ssrn.3165327 ◽

2018 ◽

Author(s):

Dr. K. Sailaja ◽

M. Rohitha

Keyword(s):

Internet Of Things ◽

Real World ◽

Literature Survey ◽

Real World Applications

Download Full-text

Blockchain Beyond Bitcoin: Blockchain Technology Challenges and Real-World Applications

2018 International Conference on Computing, Electronics & Communications Engineering (iCCECE) ◽

10.1109/iccecome.2018.8658518 ◽

2018 ◽

Cited By ~ 5

Author(s):

Muniba Memon ◽

Syed Shahbaz Hussain ◽

Umair Ahmed Bajwa ◽

Asad Ikhlas

Keyword(s):

Real World ◽

Blockchain Technology ◽

Real World Applications

Download Full-text

A Review of Machine Learning Classification Using Quantum Annealing for Real-World Applications

SN Computer Science ◽

10.1007/s42979-021-00751-0 ◽

2021 ◽

Vol 2 (5) ◽

Author(s):

Rajdeep Kumar Nath ◽

Himanshu Thapliyal ◽

Travis S. Humble

Keyword(s):

Machine Learning ◽

Real World ◽

Quantum Annealing ◽

Machine Learning Classification ◽

Real World Applications

Download Full-text

Electronic Phenomena of Transition Metal Oxides

Crystals ◽

10.3390/cryst11030256 ◽

2021 ◽

Vol 11 (3) ◽

pp. 256

Author(s):

Christian Rodenbücher ◽

Kristof Szot

Keyword(s):

Solid State ◽

Transition Metal ◽

Metal Oxides ◽

Real World ◽

Transition Metal Oxides ◽

Major Research ◽

Research Fields ◽

Real World Applications

Transition metal oxides with ABO3 or BO2 structures have become one of the major research fields in solid state science, as they exhibit an impressive variety of unusual and exotic phenomena with potential for their exploitation in real-world applications [...]

Download Full-text

Designing Motion Matching for Real-World Applications

Proceedings of the Thirteenth International Conference on Tangible, Embedded, and Embodied Interaction - TEI '19 ◽

10.1145/3294109.3295628 ◽

2019 ◽

Cited By ~ 1

Author(s):

David Verweij ◽

Augusto Esteves ◽

Saskia Bakker ◽

Vassilis-Javed Khan

Keyword(s):

Real World ◽

Real World Applications

Download Full-text