Bag-of-Concepts representation for document classification based on automatic knowledge acquisition from probabilistic knowledge base

2020 ◽  
Vol 193 ◽  
pp. 105436 ◽  
Author(s):  
Pengfei Li ◽  
Kezhi Mao ◽  
Yuecong Xu ◽  
Qi Li ◽  
Jiaheng Zhang
Author(s):  
B Sathiya ◽  
T.V. Geetha

The prime textual sources used for ontology learning are a domain corpus and dynamic large text from web pages. The first source is limited and possibly outdated, while the second is uncertain. To overcome these shortcomings, a novel ontology learning methodology is proposed to utilize the different sources of text such as a corpus, web pages and the massive probabilistic knowledge base, Probase, for an effective automated construction of ontology. Specifically, to discover taxonomical relations among the concept of the ontology, a new web page based two-level semantic query formation methodology using the lexical syntactic patterns (LSP) and a novel scoring measure: Fitness built on Probase are proposed. Also, a syntactic and statistical measure called COS (Co-occurrence Strength) scoring, and Domain and Range-NTRD (Non-Taxonomical Relation Discovery) algorithms are proposed to accurately identify non-taxonomical relations(NTR) among concepts, using evidence from the corpus and web pages.


Author(s):  
Samir Rohatgi ◽  
James H. Oliver ◽  
Stuart S. Chen

Abstract This paper describes the development of OPGEN (Opportunity Generator), a computer based system to help identify areas where a knowledge based system (KBS) might be beneficial, and to evaluate whether a suitable system could be developed in that area. The core of the system is a knowledge base used to carry out the identification and evaluation functions. Ancillary functions serve to introduce and demonstrate KBS technology to enhance the overall effectiveness of the system. All aspects of the development, from knowledge acquisition through to testing are presented in this paper.


Author(s):  
Alfio Massimiliano Gliozzo ◽  
Aditya Kalyanpur

Automatic open-domain Question Answering has been a long standing research challenge in the AI community. IBM Research undertook this challenge with the design of the DeepQA architecture and the implementation of Watson. This paper addresses a specific subtask of Deep QA, consisting of predicting the Lexical Answer Type (LAT) of a question. Our approach is completely unsupervised and is based on PRISMATIC, a large-scale lexical knowledge base automatically extracted from a Web corpus. Experiments on the Jeopardy! data shows that it is possible to correctly predict the LAT in a substantial number of questions. This approach can be used for general purpose knowledge acquisition tasks such as frame induction from text.


Author(s):  
Yingxu Wang

A cognitive knowledge base (CKB) is a novel structure of intelligent knowledge base that represents and manipulates knowledge as a dynamic concept network mimicking human knowledge processing. The essence of CKB is the denotational mathematical model of formal concept that is dynamically associated to other concepts in a CKB beyond conventional rule-based or ontology-based knowledge bases. This paper presents a formal CKB and autonomous knowledge manipulation system based on recent advances in neuroinformatics, concept algebra, semantic algebra, and cognitive computing. An item knowledge in CKB is represented by a formal concept, while the entire knowledge base is embodied by a dynamic concept network. The CKB system is manipulated by algorithms of knowledge acquisition and retrieval on the basis of concept algebra. CKB serves as a kernel of cognitive learning engines for cognitive robots and machine learning systems. CKB plays a central role not only in explaining the mechanisms of human knowledge acquisition and learning, but also in the development of cognitive robots, cognitive learning engines, and knowledge-based systems.


Sign in / Sign up

Export Citation Format

Share Document