scholarly journals LncRNA and predictive model to improve the diagnosis of clinically diagnosed pulmonary tuberculosis

2019 ◽  
Author(s):  
Xuejiao Hu ◽  
Hao Chen ◽  
Shun Liao ◽  
Hao Bai ◽  
Shubham Gupta ◽  
...  

ABSTRACTBackgroundClinically diagnosed pulmonary tuberculosis (PTB) patients lack Mycobacterium tuberculosis (MTB) microbiologic evidence, and misdiagnosis or delayed diagnosis often occurs as a consequence. We investigated the potential of lncRNAs and corresponding predictive models to diagnose these patients.MethodsWe enrolled 1372 subjects, including clinically diagnosed PTB patients, non-TB disease controls and healthy controls, in three cohorts (Screening, Selection and Validation). Candidate lncRNAs differentially expressed in blood samples of the PTB and healthy control groups were identified by microarray and qRT-PCR in the Screening Cohort. Logistic regression models were developed using lncRNAs and/or electronic health records (EHRs) from clinically diagnosed PTB patients and non-TB disease controls in the Selection Cohort. These models were evaluated by AUC and decision curve analysis, and the optimal model was presented as a Web-based nomogram, which was evaluated in the Validation Cohort. The biological function of lncRNAs was interrogated using ELISA, lactate dehydrogenase release analysis and flow cytometry.ResultsThree differentially expressed lncRNAs (ENST00000497872, n333737, n335265) were identified. The optimal model (i.e., nomogram) incorporated these three lncRNAs and six EHR variables (age, hemoglobin, weight loss, low-grade fever, CT calcification and TB-IGRA). The nomogram showed an AUC of 0.89, sensitivity of 0.86 and specificity of 0.82 in the Validation Cohort, which demonstrated better discrimination and clinical net benefit than the EHR model. ENST00000497872 may regulate inflammatory cytokine production, cell death and apoptosis during MTB infection.ConclusionLncRNAs and the user-friendly nomogram could facilitate the early identification of PTB cases among suspected patients with negative MTB microbiologic evidence.Key MessagesWhat is the key question?Does integrating immune-related lncRNA signatures and electronic health records (EHRs) promote the early identification of PTB patients who are symptomatic but lack microbiologic evidence of Mycobacterium tuberculosis (MTB)?What is the bottom line?We found three long non-coding RNAs (lncRNAs), i.e., ENST00000497872, n333737 and n335265, were potential diagnostic biomarkers for clinically diagnosed PTB patients; and we further developed and validated a novel nomogram incorporating these three lncRNAs and six electronic health records (EHRs), which were readily obtainable even in a resource-constrained setting and achieved a c-statistic of 0.89, sensitivity of 0.86 and specificity of 0.82 in a separate validation cohort.Why read on?This study focuses on the challenge of accurately diagnosing PTB patients with negative MTB microbiological evidence and serves as the first proof-of-concept that integrating lncRNA signatures and EHR data could be a more promising diagnostic approach for clinically diagnosed PTB patients.SUMMARYThis study developed and validated a novel nomogram that incorporated three lncRNAs and six EHR fields could be a useful predictive tool in identifying PTB patients who lack MTB microbiologic evidence.

2020 ◽  
Vol 58 (7) ◽  
Author(s):  
Xuejiao Hu ◽  
Shun Liao ◽  
Hao Bai ◽  
Shubham Gupta ◽  
Yi Zhou ◽  
...  

ABSTRACT Clinically diagnosed pulmonary tuberculosis (PTB) patients lack microbiological evidence of Mycobacterium tuberculosis, and misdiagnosis or delayed diagnosis often occurs as a consequence. We investigated the potential of long noncoding RNAs (lncRNAs) and corresponding predictive models to diagnose these patients. We enrolled 1,764 subjects, including clinically diagnosed PTB patients, microbiologically confirmed PTB cases, non-TB disease controls, and healthy controls, in three cohorts (screening, selection, and validation). Candidate lncRNAs differentially expressed in blood samples of the PTB and healthy control groups were identified by microarray and reverse transcription-quantitative PCR (qRT-PCR) in the screening cohort. Logistic regression models were developed using lncRNAs and/or electronic health records (EHRs) from clinically diagnosed PTB patients and non-TB disease controls in the selection cohort. These models were evaluated by area under the concentration-time curve (AUC) and decision curve analyses, and the optimal model was presented as a Web-based nomogram, which was evaluated in the validation cohort. Three differentially expressed lncRNAs (ENST00000497872, n333737, and n335265) were identified. The optimal model (i.e., nomogram) incorporated these three lncRNAs and six EHRs (age, hemoglobin, weight loss, low-grade fever, calcification detected by computed tomography [CT calcification], and interferon gamma release assay for tuberculosis [TB-IGRA]). The nomogram showed an AUC of 0.89, a sensitivity of 0.86, and a specificity of 0.82 in differentiating clinically diagnosed PTB cases from non-TB disease controls of the validation cohort, which demonstrated better discrimination and clinical net benefit than the EHR model. The nomogram also had a discriminative power (AUC, 0.90; sensitivity, 0.85; specificity, 0.81) in identifying microbiologically confirmed PTB patients. lncRNAs and the user-friendly nomogram could facilitate the early identification of PTB cases among suspected patients with negative M. tuberculosis microbiological evidence.


2019 ◽  
Author(s):  
Xuejiao Hu ◽  
Hao Chen ◽  
Shun Liao ◽  
Hao Bai ◽  
Shubham Gupta ◽  
...  

ABSTRACTBackgroundClinically diagnosed pulmonary tuberculosis (PTB) patients lack Mycobacterium tuberculosis (MTB) microbiologic evidence, and misdiagnosis or delayed diagnosis often occurs as a consequence. We investigated the potential of lncRNAs and corresponding predictive models to diagnose these patients.MethodsWe enrolled 1372 subjects, including clinically diagnosed PTB patients, non-TB disease controls and healthy controls, in three cohorts (Screening, Selection and Validation). Candidate lncRNAs differentially expressed in blood samples of the PTB and healthy control groups were identified by microarray and qRT-PCR in the Screening Cohort. Logistic regression models were developed using lncRNAs and/or electronic health records (EHRs) from clinically diagnosed PTB patients and non-TB disease controls in the Selection Cohort. These models were evaluated by AUC and decision curve analysis, and the optimal model was presented as a Web-based nomogram, which was evaluated in the Validation Cohort. The biological function of lncRNAs was interrogated using ELISA, lactate dehydrogenase release analysis and flow cytometry.ResultsThree differentially expressed lncRNAs (ENST00000497872, n333737, n335265) were identified. The optimal model (i.e., nomogram) incorporated these three lncRNAs and six EHR variables (age, hemoglobin, weight loss, low-grade fever, CT calcification and TB-IGRA). The nomogram showed an AUC of 0.89, sensitivity of 0.86 and specificity of 0.82 in the Validation Cohort, which demonstrated better discrimination and clinical net benefit than the EHR model. ENST00000497872 may regulate inflammatory cytokine production, cell death and apoptosis during MTB infection.ConclusionsLncRNAs and the user-friendly nomogram could facilitate the early identification of PTB cases among suspected patients with negative MTB microbiologic evidence.


2021 ◽  
Author(s):  
Nawar Shara ◽  
Kelley M. Anderson ◽  
Noor Falah ◽  
Maryam F. Ahmad ◽  
Darya Tavazoei ◽  
...  

BACKGROUND Healthcare data are fragmenting as patients seek care from diverse sources. Consequently, patient care is negatively impacted by disparate health records. Machine learning (ML) offers a disruptive force in its ability to inform and improve patient care and outcomes [6]. However, the differences that exist in each individual’s health records, combined with the lack of health-data standards, in addition to systemic issues that render the data unreliable and that fail to create a single view of each patient, create challenges for ML. While these problems exist throughout healthcare, they are especially prevalent within maternal health, and exacerbate the maternal morbidity and mortality (MMM) crisis in the United States. OBJECTIVE Maternal patient records were extracted from the electronic health records (EHRs) of a large tertiary healthcare system and made into patient-specific, complete datasets through a systematic method so that a machine-learning-based (ML-based) risk-assessment algorithm could effectively identify maternal cardiovascular risk prior to evidence of diagnosis or intervention within the patient’s record. METHODS We outline the effort that was required to define the specifications of the computational systems, the dataset, and access to relevant systems, while ensuring data security, privacy laws, and policies were met. Data acquisition included the concatenation, anonymization, and normalization of health data across multiple EHRs in preparation for its use by a proprietary risk-stratification algorithm designed to establish patient-specific baselines to identify and establish cardiovascular risk based on deviations from the patient’s baselines to inform early interventions. RESULTS Patient records can be made actionable for the goal of effectively employing machine learning (ML), specifically to identify cardiovascular risk in pregnant patients. CONCLUSIONS Upon acquiring data, including the concatenation, anonymization, and normalization of said data across multiple EHRs, the use of a machine-learning-based (ML-based) tool can provide early identification of cardiovascular risk in pregnant patients. CLINICALTRIAL N/A


2021 ◽  
Vol 10 (7) ◽  
pp. 1473
Author(s):  
Ru Wang ◽  
Zhuqi Miao ◽  
Tieming Liu ◽  
Mei Liu ◽  
Kristine Grdinovac ◽  
...  

Diabetic retinopathy (DR) is a leading cause for blindness among working-aged adults. The growing prevalence of diabetes urges for cost-effective tools to improve the compliance of eye examinations for early detection of DR. The objective of this research is to identify essential predictors and develop predictive technologies for DR using electronic health records. We conducted a retrospective analysis on a derivation cohort with 3749 DR and 94,127 non-DR diabetic patients. In the analysis, an ensemble predictor selection method was employed to find essential predictors among 26 variables in demographics, duration of diabetes, complications and laboratory results. A predictive model and a risk index were built based on the selected, essential predictors, and then validated using another independent validation cohort with 869 DR and 6448 non-DR diabetic patients. Out of the 26 variables, 10 were identified to be essential for predicting DR. The predictive model achieved a 0.85 AUC on the derivation cohort and a 0.77 AUC on the validation cohort. For the risk index, the AUCs were 0.81 and 0.73 on the derivation and validation cohorts, respectively. The predictive technologies can provide an early warning sign that motivates patients to comply with eye examinations for early screening and potential treatments.


2021 ◽  
Vol 8 ◽  
Author(s):  
Yao-Dan Liang ◽  
Yi-Bo Xie ◽  
Ming-Hui Du ◽  
Jing Shi ◽  
Jie-Fu Yang ◽  
...  

Background: This study aimed to develop and validate an electronic frailty index (eFI) based on routine electronic health records (EHR) for older adult inpatients and to analyze the correlations between frailty and hospitalized events and costs.Methods: We created an eFI from routine EHR and validated the effectiveness by the consistency of the comprehensive geriatric assessment-frailty index (CGA-FI) with an independent prospective cohort. Then, we analyzed the correlations between frailty and hospitalized events and costs by regressions.Results: During the study period, 49,226 inpatients were included in the analysis, 42,821 (87.0%) of which had enough data to calculate an eFI. A strong correlation between the CGA-FI and eFI was shown in the validation cohort of 685 subjects (Pearson's r = 0.716, P < 0.001). The sensitivity and specificity for an eFI≥0.15, the upper tertile, to identify frailty, defined as a CGA-FI≥0.25, were 64.8 and 88.7%, respectively. After adjusting for age, sex, and operation, an eFI≥0.15 showed an independent association with long hospital stay (odds ratio [OR] = 2.889, P < 0.001) and death in hospital (OR = 19.97, P < 0.001). Moreover, eFI values (per 0.1) were positively associated with total costs (β = 0.453, P < 0.001), examination costs (β = 0.269, P < 0.001), treatment costs (β = 0.414, P < 0.001), nursing costs (β = 0.381, P < 0.001), pharmacy costs (β = 0.524, P < 0.001), and material costs (β = 0.578, P < 0.001) after adjusting aforementioned factors.Conclusions: We successfully developed an effective eFI from routine EHR from a general hospital in China. Frailty is an independent risk factor for long hospital stay and death in hospital. As the degree of frailty increases, the hospitalized costs increase accordingly.


2019 ◽  
Vol 7 ◽  
Author(s):  
Hee Yun Seol ◽  
Sunghwan Sohn ◽  
Hongfang Liu ◽  
Chung-Il Wi ◽  
Euijung Ryu ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document