Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications

AbstractObjectiveNatural language processing (NLP) engines such as the clinical Text Analysis and Knowledge Extraction System are a solution for processing notes for research, but optimizing their performance for a clinical data warehouse remains a challenge. We aim to develop a high throughput NLP architecture using the clinical Text Analysis and Knowledge Extraction System and present a predictive model use case.Materials and MethodsThe CDW was comprised of 1 103 038 patients across 10 years. The architecture was constructed using the Hadoop data repository for source data and 3 large-scale symmetric processing servers for NLP. Each named entity mention in a clinical document was mapped to the Unified Medical Language System concept unique identifier (CUI).ResultsThe NLP architecture processed 83 867 802 clinical documents in 13.33 days and produced 37 721 886 606 CUIs across 8 standardized medical vocabularies. Performance of the architecture exceeded 500 000 documents per hour across 30 parallel instances of the clinical Text Analysis and Knowledge Extraction System including 10 instances dedicated to documents greater than 20 000 bytes. In a use–case example for predicting 30-day hospital readmission, a CUI-based model had similar discrimination to n-grams with an area under the curve receiver operating characteristic of 0.75 (95% CI, 0.74–0.76).Discussion and ConclusionOur health system’s high throughput NLP architecture may serve as a benchmark for large-scale clinical research using a CUI-based approach.

Download Full-text

Knowledge Extraction System from Reports in Fabrication Workshops

Global Perspective for Competitive Enterprise, Economy and Ecology - Advanced Concurrent Engineering ◽

10.1007/978-1-84882-762-2_29 ◽

2009 ◽

pp. 317-325

Author(s):

Kazuo Hiekata ◽

Hiroyuki Yamato ◽

Sho Tsujimoto

Keyword(s):

Knowledge Extraction ◽

Extraction System

Download Full-text

Clinical text analysis using machine learning methods

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS) ◽

10.1109/icis.2016.7550908 ◽

2016 ◽

Cited By ~ 2

Author(s):

Krishna Prasad Chodey ◽

Gongzhu Hu

Keyword(s):

Machine Learning ◽

Text Analysis ◽

Learning Methods ◽

Clinical Text ◽

Machine Learning Methods

Download Full-text

A knowledge extraction system from online reviews using fuzzy logic

2012 Ninth International Conference on Computer Science and Software Engineering (JCSSE) ◽

10.1109/jcsse.2012.6261950 ◽

2012 ◽

Author(s):

Phichayasini Kitwatthanathawon ◽

Thara Angskun ◽

Jitimon Angskun

Keyword(s):

Fuzzy Logic ◽

Online Reviews ◽

Knowledge Extraction ◽

Extraction System

Download Full-text

A Knowledge Extraction System from Manager's Operation Sequences in System Development Project

2016 5th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI) ◽

10.1109/iiai-aai.2016.131 ◽

2016 ◽

Author(s):

Masaki Samejima ◽

Masanori Akiyoshi

Keyword(s):

System Development ◽

Knowledge Extraction ◽

Development Project ◽

Extraction System

Download Full-text

KEYS: A Knowledge Extraction System Based on UNL Knowledge Infrastructure

The Egyptian Journal of Language Engineering ◽

10.21608/ejle.2015.60255 ◽

2015 ◽

Vol 2 (1) ◽

pp. 25-42

Author(s):

Sameh Alansary ◽

Magdy Nagi

Keyword(s):

Knowledge Extraction ◽

Extraction System ◽

Knowledge Infrastructure

Download Full-text

WIKE: A Web Information/Knowledge Extraction System for Web Service Generation

10.1109/icwe.2008.30 ◽

2008 ◽

Cited By ~ 7

Author(s):

Hao Han ◽

Takehiro Tokuda

Keyword(s):

Web Service ◽

Knowledge Extraction ◽

Extraction System ◽

Web Information

Download Full-text

Search for Information in Text Files

Advances in Library and Information Science - Critical Approaches to Information Retrieval Research ◽

10.4018/978-1-7998-1021-6.ch004 ◽

2020 ◽

pp. 69-77

Author(s):

Mouhcine El Hassani ◽

Noureddine Falih ◽

Belaid Bouikhalene

Keyword(s):

Information Retrieval ◽

Retrieval System ◽

Knowledge Extraction ◽

Information Retrieval System ◽

Extraction System ◽

Text Documents ◽

The Subject ◽

Search For Information ◽

General Architecture ◽

The Web

As information becomes increasingly abundant and accessible on the web, researchers do not have a need to go to excavate books in the libraries. These require a knowledge extraction system from the text (KEST). The goal of authors in this chapter is to identify the needs of a person to do a search in a text, which can be unstructured, and retrieve the terms of information related to the subject of research then structure them into classes of useful information. These may subsequently identify the general architecture of an information retrieval system from text documents in order to develop it and finally identify the parameters to evaluate its performance and the results retrieved.

Download Full-text