Building chatbots from large scale domain-specific knowledge bases: challenges and opportunities

This paper describes a learning/adaptive approach to automatically building knowledge bases for information extraction from text based web pages. A frame based representation is introduced to represent domain knowledge as knowledge unit frames. A frame learning algorithm is developed to automatically learn knowledge unit frames from training examples. Some training examples can be obtained by automatically parsing a number of tabular web pages in the same domain, which greatly reduced the amount of time consuming manual work. This approach was investigated on ten web sites of real estate advertisements and car advertisements and nearly all the information was successfully extracted with very few false alarms. These results suggest that both the knowledge unit frame representation and the frame learning algorithm work well, domain specific knowledge bases can be learned from training examples, and the domain specific knowledge base can be used for information extraction from flexible text-based semi-structured Web pages on multiple Web sites. The investigation of the knowledge representation on five other domains suggests that this approach can be easily applied to other domains by simply changing the training examples.

Download Full-text

Natural Language Interfaces to Domain Specific Knowledge Bases: An Illustration for Querying Elements of the Periodic Table

2018 IEEE 17th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC) ◽

10.1109/icci-cc.2018.8482023 ◽

2018 ◽

Cited By ~ 1

Author(s):

Mukesh Kumar Rohil ◽

Rohan Kumar Rohil ◽

Divyesakshi Rohil ◽

Anurag Runthala

Keyword(s):

Natural Language ◽

Periodic Table ◽

Knowledge Bases ◽

Specific Knowledge ◽

Natural Language Interfaces ◽

Domain Specific ◽

Domain Specific Knowledge

Download Full-text

PreFace: Faceted Retrieval of Prerequisites Using Domain-Specific Knowledge Bases

Lecture Notes in Computer Science - The Semantic Web – ISWC 2020 ◽

10.1007/978-3-030-62419-4_34 ◽

2020 ◽

pp. 601-618

Author(s):

Prajna Upadhyay ◽

Maya Ramanath

Keyword(s):

Knowledge Bases ◽

Specific Knowledge ◽

Domain Specific ◽

Domain Specific Knowledge

Download Full-text

AN INTERACTIVE TOOL FOR THE RAPID DEVELOPMENT OF KNOWLEDGE BASES

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213001000428 ◽

2001 ◽

Vol 10 (01n02) ◽

pp. 65-86 ◽

Cited By ~ 15

Author(s):

DAN I. MOLDOVAN ◽

ROXANA C. GÎRJU

Keyword(s):

Rapid Development ◽

Knowledge Bases ◽

General Purpose ◽

Specific Knowledge ◽

Interactive Mode ◽

Domain Specific ◽

New Concepts ◽

Financial Domain ◽

Domain Specific Knowledge ◽

Knowledge Intensive

It is widely accepted that more knowledge means more intelligence. In many knowledge intensive applications, it is necessary to have extensive domain-specific knowledge in addition to general-purpose knowledge bases. This paper presents a methodology for discovering domain-specific concepts and relationships in an attempt to extend WordNet. The method was tested on five seed concepts selected from the financial domain: interest rate, stock market, inflation, economic growth, and employment. Queries were formed with each of these concepts and a corpus of 5000 sentences was extracted automatically from the Internet and TREC-8 corpora. On this corpus, the system discovered a total of 264 new concepts not defined in WordNet, of which 221 contain the seeds and 43 are other related concepts. The system also discovered 64 relationships that link these concepts with either WordNet concepts or with each other. The relationships were extracted with the help of 22 distinct lexico-syntactic patterns representing four semantic relations. It takes the system approximately 40 minutes per seed working in interactive mode to discover the new concepts and relationships on the 5000 sentence corpus.

Download Full-text

A pipeline for extracting and deduplicating domain-specific knowledge bases

2015 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata.2015.7363868 ◽

2015 ◽

Cited By ~ 4

Author(s):

Mayank Kejriwal ◽

Qiaoling Liu ◽

Ferosh Jacob ◽

Faizan Javed

Keyword(s):

Knowledge Bases ◽

Specific Knowledge ◽

Domain Specific ◽

Domain Specific Knowledge

Download Full-text

A Light-weight Text Summarizer for Fast Access to Medical Evidence

10.1101/2020.05.22.20110742 ◽

2020 ◽

Author(s):

Abeed Sarker ◽

Yuan-Chi Yang ◽

Mohammed Ali Al-Garadi

Keyword(s):

Point Of Care ◽

Knowledge Bases ◽

Evidence Based ◽

Specific Knowledge ◽

Domain Specific ◽

Domain Specific Knowledge ◽

Simple Implementation ◽

Summarization System ◽

Simple Features ◽

Based Medicine

AbstractThe performances of current medical text summarization systems rely on resource-heavy domain-specific knowledge sources, and preprocessing methods (e.g., classification or deep learning) for deriving semantic information. Consequently, these systems are often difficult to customize, extend or deploy in low-resource settings, and are operationally slow. We propose a fast summarization system that can aid practitioners at point-of-care, and, thus, improve evidence-based healthcare. At runtime, our system utilizes similarity measurements derived from pre-trained domain-specific word embeddings in addition to simple features, rather than clunky knowledge bases and resource-heavy preprocessing. Automatic evaluation on a public dataset for evidence-based medicine shows that our system’s performance, despite the simple implementation, is statistically comparable with the state-of-the-art.

Download Full-text

Con2KG-A Large-scale Domain-Specific Knowledge Graph

Proceedings of the 30th ACM Conference on Hypertext and Social Media - HT '19 ◽

10.1145/3342220.3344931 ◽

2019 ◽

Author(s):

Nidhi Goyal ◽

Niharika Sachdeva ◽

Vijay Choudhary ◽

Rijula Kar ◽

Ponnurangam Kumaraguru ◽

...

Keyword(s):

Large Scale ◽

Knowledge Graph ◽

Specific Knowledge ◽

Domain Specific ◽

Domain Specific Knowledge

Download Full-text

K-BERT: Enabling Language Representation with Knowledge Graph

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i03.5681 ◽

2020 ◽

Vol 34 (03) ◽

pp. 2901-2908 ◽

Cited By ~ 3

Author(s):

Weijie Liu ◽

Peng Zhou ◽

Zhe Zhao ◽

Zhiruo Wang ◽

Qi Ju ◽

...

Keyword(s):

Domain Knowledge ◽

Large Scale ◽

Model Parameters ◽

Specific Knowledge ◽

Domain Specific ◽

Language Representation ◽

Finance Law ◽

Domain Specific Knowledge ◽

The Impact ◽

Knowledge Incorporation

Pre-trained language representation models, such as BERT, capture a general language representation from large-scale corpora, but lack domain-specific knowledge. When reading a domain text, experts make inferences with relevant knowledge. For machines to achieve this capability, we propose a knowledge-enabled language representation model (K-BERT) with knowledge graphs (KGs), in which triples are injected into the sentences as domain knowledge. However, too much knowledge incorporation may divert the sentence from its correct meaning, which is called knowledge noise (KN) issue. To overcome KN, K-BERT introduces soft-position and visible matrix to limit the impact of knowledge. K-BERT can easily inject domain knowledge into the models by being equipped with a KG without pre-training by itself because it is capable of loading model parameters from the pre-trained BERT. Our investigation reveals promising results in twelve NLP tasks. Especially in domain-specific tasks (including finance, law, and medicine), K-BERT significantly outperforms BERT, which demonstrates that K-BERT is an excellent choice for solving the knowledge-driven problems that require experts.

Download Full-text