Aspects of Intelligence in an "SP" Database System

2009 ◽  
pp. 725-754
Author(s):  
J. Gerard Wolff

This chapter describes some of the kinds of “intelligence” that may be exhibited by an intelligent database system based on the SP theory of computing and cognition. The chapter complements an earlier paper on the SP theory as the basis for an intelligent database system (Wolff, forthcoming b) but it does not depend on a reading of that earlier paper. The chapter introduces the SP theory and its main attractions as the basis for an intelligent database system: that it uses a simple but versatile format for diverse kinds of knowledge, that it integrates and simplifies a range of AI functions, and that it supports established database models when that is required. Then with examples and discussion, the chapter illustrates aspects of “intelligence” in the system: pattern recognition and information retrieval, several forms of probabilistic reasoning, the analysis and production of natural language, and the unsupervised learning of new knowledge.

2011 ◽  
pp. 197-237
Author(s):  
J. Gerard Wolff

This chapter describes some of the kinds of “intelligence” that may be exhibited by an intelligent database system based on the SP theory of computing and cognition. The chapter complements an earlier paper on the SP theory as the basis for an intelligent database system (Wolff, forthcoming b) but it does not depend on a reading of that earlier paper. The chapter introduces the SP theory and its main attractions as the basis for an intelligent database system: that it uses a simple but versatile format for diverse kinds of knowledge, that it integrates and simplifies a range of AI functions, and that it supports established database models when that is required. Then with examples and discussion, the chapter illustrates aspects of “intelligence” in the system: pattern recognition and information retrieval, several forms of probabilistic reasoning, the analysis and production of natural language, and the unsupervised learning of new knowledge.


2019 ◽  
Vol 53 (2) ◽  
pp. 3-10
Author(s):  
Muthu Kumar Chandrasekaran ◽  
Philipp Mayr

The 4 th joint BIRNDL workshop was held at the 42nd ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2019) in Paris, France. BIRNDL 2019 intended to stimulate IR researchers and digital library professionals to elaborate on new approaches in natural language processing, information retrieval, scientometrics, and recommendation techniques that can advance the state-of-the-art in scholarly document understanding, analysis, and retrieval at scale. The workshop incorporated different paper sessions and the 5 th edition of the CL-SciSumm Shared Task.


Author(s):  
Sijia Liu ◽  
Yanshan Wang ◽  
Andrew Wen ◽  
Liwei Wang ◽  
Na Hong ◽  
...  

BACKGROUND Widespread adoption of electronic health records has enabled the secondary use of electronic health record data for clinical research and health care delivery. Natural language processing techniques have shown promise in their capability to extract the information embedded in unstructured clinical data, and information retrieval techniques provide flexible and scalable solutions that can augment natural language processing systems for retrieving and ranking relevant records. OBJECTIVE In this paper, we present the implementation of a cohort retrieval system that can execute textual cohort selection queries on both structured data and unstructured text—Cohort Retrieval Enhanced by Analysis of Text from Electronic Health Records (CREATE). METHODS CREATE is a proof-of-concept system that leverages a combination of structured queries and information retrieval techniques on natural language processing results to improve cohort retrieval performance using the Observational Medical Outcomes Partnership Common Data Model to enhance model portability. The natural language processing component was used to extract common data model concepts from textual queries. We designed a hierarchical index to support the common data model concept search utilizing information retrieval techniques and frameworks. RESULTS Our case study on 5 cohort identification queries, evaluated using the precision at 5 information retrieval metric at both the patient-level and document-level, demonstrates that CREATE achieves a mean precision at 5 of 0.90, which outperforms systems using only structured data or only unstructured text with mean precision at 5 values of 0.54 and 0.74, respectively. CONCLUSIONS The implementation and evaluation of Mayo Clinic Biobank data demonstrated that CREATE outperforms cohort retrieval systems that only use one of either structured data or unstructured text in complex textual cohort queries.


Sign in / Sign up

Export Citation Format

Share Document