scholarly journals Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications

2010 ◽  
Vol 17 (5) ◽  
pp. 507-513 ◽  
Author(s):  
Guergana K Savova ◽  
James J Masanz ◽  
Philip V Ogren ◽  
Jiaping Zheng ◽  
Sunghwan Sohn ◽  
...  
2019 ◽  
Vol 26 (11) ◽  
pp. 1364-1369 ◽  
Author(s):  
Majid Afshar ◽  
Dmitriy Dligach ◽  
Brihat Sharma ◽  
Xiaoyuan Cai ◽  
Jason Boyda ◽  
...  

AbstractObjectiveNatural language processing (NLP) engines such as the clinical Text Analysis and Knowledge Extraction System are a solution for processing notes for research, but optimizing their performance for a clinical data warehouse remains a challenge. We aim to develop a high throughput NLP architecture using the clinical Text Analysis and Knowledge Extraction System and present a predictive model use case.Materials and MethodsThe CDW was comprised of 1 103 038 patients across 10 years. The architecture was constructed using the Hadoop data repository for source data and 3 large-scale symmetric processing servers for NLP. Each named entity mention in a clinical document was mapped to the Unified Medical Language System concept unique identifier (CUI).ResultsThe NLP architecture processed 83 867 802 clinical documents in 13.33 days and produced 37 721 886 606 CUIs across 8 standardized medical vocabularies. Performance of the architecture exceeded 500 000 documents per hour across 30 parallel instances of the clinical Text Analysis and Knowledge Extraction System including 10 instances dedicated to documents greater than 20 000 bytes. In a use–case example for predicting 30-day hospital readmission, a CUI-based model had similar discrimination to n-grams with an area under the curve receiver operating characteristic of 0.75 (95% CI, 0.74–0.76).Discussion and ConclusionOur health system’s high throughput NLP architecture may serve as a benchmark for large-scale clinical research using a CUI-based approach.


Author(s):  
Mouhcine El Hassani ◽  
Noureddine Falih ◽  
Belaid Bouikhalene

As information becomes increasingly abundant and accessible on the web, researchers do not have a need to go to excavate books in the libraries. These require a knowledge extraction system from the text (KEST). The goal of authors in this chapter is to identify the needs of a person to do a search in a text, which can be unstructured, and retrieve the terms of information related to the subject of research then structure them into classes of useful information. These may subsequently identify the general architecture of an information retrieval system from text documents in order to develop it and finally identify the parameters to evaluate its performance and the results retrieved.


2017 ◽  
Vol 10 ◽  
pp. 829-840
Author(s):  
Taniana Rodriguez ◽  
Jose Aguilar ◽  
Alexandra Gonzalez

2009 ◽  
Vol 15 (4) ◽  
pp. 31-39 ◽  
Author(s):  
Jesús Pascual Mena-Chalco ◽  
Roberto Marcondes Cesar Junior

Sign in / Sign up

Export Citation Format

Share Document