scholarly journals Diagnostic Performance of Endoscopic Ultrasound-Artificial Intelligence Using Deep Learning Analysis of Gallbladder Polypoid Lesions: A Feasibility Study

Author(s):  
Sung Ill Jang ◽  
Young Jae Kim ◽  
Eui Joo Kim ◽  
Huapyong Kang ◽  
Seung Jin Shon ◽  
...  

Abstract Endoscopic ultrasound (EUS) is the most accurate diagnostic modality for polypoid lesions of the gallbladder (GB), but is limited by subjective interpretation. We evaluated the diagnostic performance of deep learning-based artificial intelligence (AI) in differentiating polypoid lesions using EUS images. The diagnostic performance of the EUS-AI system with ResNet50 architecture was evaluated via three processes: training, internal validation, and testing. The diagnostic performance was also verified using an external validation cohort and compared with the performance of EUS endoscopists. In the AI development cohort, the diagnostic performance of EUS-AI including sensitivity, specificity, positive predictive value, negative predictive value and accuracy. For the differential diagnosis of neoplastic and non-neoplastic GB polyps, these values for EUS-AI were 77.8%, 91.6%, 57.9%, 96.5%, and 89.8%, respectively. In the external validation cohort, the differential diagnosis of neoplastic and non-neoplastic GB polyps, these values were 60.3%, 77.4%, 36.2%, 90.2%, and 74.4%, respectively, for EUS-AI; they were 74.2%, 44.9%, 75.4%, 46.2%, and 65.3%, respectively, for the endoscopists. The accuracy of the EUS-AI was between the accuracies of mid-level (66.7%) and expert EUS endoscopists (77.5%). This EUS-AI system showed favorable performance for the diagnosis of neoplastic GB polyps, with a performance comparable to that of EUS endoscopists.

2020 ◽  
Vol 41 (Supplement_2) ◽  
Author(s):  
K.H Jeon ◽  
J.M Kwon ◽  
K.H Kim ◽  
M.J Kim ◽  
S.H Lee ◽  
...  

Abstract Background Anemia changed the morphology of electrocardiography (ECG), and researchers suggested that mismatching oxygen demand and supply in the myocardium affects the ECG Purpose A deep-learning-based algorithm (DLA) that enables non-invasive anemia screening from electrocardiograms (ECGs) may improve the detection of anemia. Methods A DLA was developed using 57,435 ECGs from 31,898 patients and was internally validated using 7,369 ECGs from 7,369 patients taken at one hospital. External validation was performed using 4,068 ECGs from 4,068 patients admitted at another hospital. Three types of DLA were developed using 12-lead ECGs to detect hemoglobin levels of 10 mg/dL or less. The DLA was built by a convolutional neural network and used 500-Hz raw ECG, age, and sex as input data. Results During internal and external validation, the area under the receiver operating characteristics curve (AUROC) of the DLA using a 12-lead ECG for detecting anemia was 0.941 and 0.904, respectively. Using a 90% sensitivity operating point, the specificity, negative predictive value, and positive predictive value of internal validation were 0.889, 0.998, and 0.151, respectively, and those of external validation were 0.785, 0.994, and 0.166, respectively. The predicted Hgb level based on the DLA was correlated with the actual Hgb level (r=0.891, 95% CI 0.890–0.893, P<0.0001). 57 patients of moderate to severe anemia were treated with appropriated blood transfusion and predicted DLA score of most patients who received transfusion decreased after transfusion according to increase in hemoglobin level. Conclusion In this study, using raw ECG data, a DLA accurately detected anemia. The application of artificial intelligence to ECGs may enable screening for anemia. Correlation between DLA and actual Hb Funding Acknowledgement Type of funding source: None


Author(s):  
Matthias Unterhuber ◽  
Karl-Philipp Rommel ◽  
Karl-Patrik Kresoja ◽  
Julia Lurz ◽  
Jelena Kornej ◽  
...  

Abstract Background Heart failure with preserved ejection fraction (HFpEF) is a rapidly growing global health problem. To date, diagnosis of HFpEF is based on clinical, invasive and laboratory examinations. Electrocardiographic findings may vary, and there are no known typical ECG features for HFpEF. Methods This study included two patient cohorts. In the derivation cohort, we included n = 1884 patients who presented with exertional dyspnea or equivalent and preserved ejection fraction (≥50%) and clinical suspicion for coronary artery disease. The ECGs were divided in segments, yielding a total of 77.558 samples. We trained a convolutional neural network (CNN) to classify HFpEF and control patients according to ESC criteria. An external group of 203 volunteers in a prospective heart failure screening program served as validation cohort of the CNN. Results The external validation of the CNN yielded an AUC of 0.80 (95% CI 0.74–0.86) for detection of HFpEF according to ESC criteria, with a sensitivity of 0.99 (CI 0.98–0.99) and a specificity of 0.60 (95% CI 0.56–0.64), with a positive predictive value of 0.68 (95%CI 0.64–0.72) and a negative predictive value of 0.98 (95% CI 0.95–0.99). Conclusion In this study, we report the first deep learning-enabled CNN for identifying patients with HFpEF according to ESC criteria including NT-proBNP measurements in the diagnostic algorithm among patients at risk. The suitability of the CNN was validated on an external validation cohort of patients at risk for developing heart failure, showing a convincing screening performance.


2021 ◽  
pp. 028418512110589
Author(s):  
Peijun Li ◽  
Bao Feng ◽  
Yu Liu ◽  
Yehang Chen ◽  
Haoyang Zhou ◽  
...  

Background Deep learning (DL) has been used on medical images to grade, differentiate, and predict prognosis in many tumors. Purpose To explore the effect of computed tomography (CT)-based deep learning nomogram (DLN) for predicting cervical cancer lymph node metastasis (LNM) before surgery. Material and Methods In total, 418 patients with stage IB-IIB cervical cancer were retrospectively enrolled for model exploration (n = 296) and internal validation (n = 122); 62 patients from another independent institution were enrolled for external validation. A convolutional neural network (CNN) was used for DL features extracting from all lesions. The least absolute shrinkage and selection operator (Lasso) logistic regression was used to develop a deep learning signature (DLS). A DLN incorporating the DLS and clinical risk factors was proposed to predict LNM individually. The performance of the DLN was evaluated on internal and external validation cohorts. Results Stage, CT-reported pelvic lymph node status, and DLS were found to be independent predictors and could be used to construct the DLN. The combination showed a better performance than the clinical model and DLS. The proposed DLN had an area under the curve (AUC) of 0.925 in the training cohort, 0.771 in the internal validation cohort, and 0.790 in the external validation cohort. Decision curve analysis and stratification analysis suggested that the DLN has potential ability to generate a personalized probability of LNM in cervical cancer. Conclusion The proposed CT-based DLN could be used as a personalized non-invasive tool for preoperative prediction of LNM in cervical cancer, which could facilitate the choice of clinical treatment methods.


2021 ◽  
Vol 11 ◽  
Author(s):  
Gumuyang Zhang ◽  
Zhe Wu ◽  
Lili Xu ◽  
Xiaoxiao Zhang ◽  
Daming Zhang ◽  
...  

BackgroundClinical treatment decision making of bladder cancer (BCa) relies on the absence or presence of muscle invasion and tumor staging. Deep learning (DL) is a novel technique in image analysis, but its potential for evaluating the muscular invasiveness of bladder cancer remains unclear. The purpose of this study was to develop and validate a DL model based on computed tomography (CT) images for prediction of muscle-invasive status of BCa.MethodsA total of 441 BCa patients were retrospectively enrolled from two centers and were divided into development (n=183), tuning (n=110), internal validation (n=73) and external validation (n=75) cohorts. The model was built based on nephrographic phase images of preoperative CT urography. Receiver operating characteristic (ROC) curves were performed and the area under the ROC curve (AUC) for discrimination between muscle-invasive BCa and non-muscle-invasive BCa was calculated. The performance of the model was evaluated and compared with that of the subjective assessment by two radiologists.ResultsThe DL model exhibited relatively good performance in all cohorts [AUC: 0.861 in the internal validation cohort, 0.791 in the external validation cohort] and outperformed the two radiologists. The model yielded a sensitivity of 0.733, a specificity of 0.810 in the internal validation cohort and a sensitivity of 0.710 and a specificity of 0.773 in the external validation cohort.ConclusionThe proposed DL model based on CT images exhibited relatively good prediction ability of muscle-invasive status of BCa preoperatively, which may improve individual treatment of BCa.


Author(s):  
Hongmei Wang ◽  
Lu Wang ◽  
Edward H. Lee ◽  
Jimmy Zheng ◽  
Wei Zhang ◽  
...  

Abstract Purpose High-dimensional image features that underlie COVID-19 pneumonia remain opaque. We aim to compare feature engineering and deep learning methods to gain insights into the image features that drive CT-based for COVID-19 pneumonia prediction, and uncover CT image features significant for COVID-19 pneumonia from deep learning and radiomics framework. Methods A total of 266 patients with COVID-19 and other viral pneumonia with clinical symptoms and CT signs similar to that of COVID-19 during the outbreak were retrospectively collected from three hospitals in China and the USA. All the pneumonia lesions on CT images were manually delineated by four radiologists. One hundred eighty-four patients (n = 93 COVID-19 positive; n = 91 COVID-19 negative; 24,216 pneumonia lesions from 12,001 CT image slices) from two hospitals from China served as discovery cohort for model development. Thirty-two patients (17 COVID-19 positive, 15 COVID-19 negative; 7883 pneumonia lesions from 3799 CT image slices) from a US hospital served as external validation cohort. A bi-directional adversarial network-based framework and PyRadiomics package were used to extract deep learning and radiomics features, respectively. Linear and Lasso classifiers were used to develop models predictive of COVID-19 versus non-COVID-19 viral pneumonia. Results 120-dimensional deep learning image features and 120-dimensional radiomics features were extracted. Linear and Lasso classifiers identified 32 high-dimensional deep learning image features and 4 radiomics features associated with COVID-19 pneumonia diagnosis (P < 0.0001). Both models achieved sensitivity > 73% and specificity > 75% on external validation cohort with slight superior performance for radiomics Lasso classifier. Human expert diagnostic performance improved (increase by 16.5% and 11.6% in sensitivity and specificity, respectively) when using a combined deep learning-radiomics model. Conclusions We uncover specific deep learning and radiomics features to add insight into interpretability of machine learning algorithms and compare deep learning and radiomics models for COVID-19 pneumonia that might serve to augment human diagnostic performance.


2021 ◽  
Author(s):  
Sang-Heon Lim ◽  
Young Jae Kim ◽  
Yeon-Ho Park ◽  
Doojin Kim ◽  
Kwang Gi Kim ◽  
...  

Abstract Pancreas segmentation is necessary for observing lesions, analyzing anatomical structures, and predicting patient prognosis. Therefore, various studies have designed segmentation models based on convolutional neural networks for pancreas segmentation. However, the deep learning approach is limited by a lack of data, and studies conducted on a large computed tomography dataset are scarce. Therefore, this study aims to perform deep-learning-based semantic segmentation on 1,006 participants and evaluate the automatic segmentation performance of the pancreas via four individual three-dimensional segmentation networks. In this study, we performed internal validation with 1,006 patients and external validation using the Cancer Imaging Archive (TCIA) pancreas dataset. We obtained mean precision, recall, and dice similarity coefficients of 0.869, 0.842, and 0.842, respectively, for internal validation via a relevant approach among the four deep learning networks. Using the external dataset, the deep learning network achieved mean precision, recall, and dice similarity coefficients of 0.779, 0.749, and 0.735, respectively. We expect that generalized deep-learning-based systems can assist clinical decisions by providing accurate pancreatic segmentation and quantitative information of the pancreas for abdominal computed tomography.


Author(s):  
Shaoxu Wu ◽  
Xiong Chen ◽  
Jiexin Pan ◽  
Wen Dong ◽  
Xiayao Diao ◽  
...  

Abstract Background Cystoscopy plays an important role in bladder cancer (BCa) diagnosis and treatment, but its sensitivity needs improvement. Artificial intelligence has shown promise in endoscopy, but few cystoscopic applications have been reported. We report a Cystoscopy Artificial Intelligence Diagnostic System (CAIDS) for BCa diagnosis. Methods In total, 69,204 images from 10,729 consecutive patients from six hospitals were collected and divided into training, internal validation, and external validation sets. The CAIDS was built using a pyramid scene parsing network and transfer learning. A subset (n = 260) of the validation sets was used for a performance comparison between the CAIDS and urologists for complex lesion detection. The diagnostic accuracy, sensitivity, specificity, and positive and negative predictive values and 95% confidence intervals (CIs) were calculated using the Clopper-Pearson method. Results The diagnostic accuracies of the CAIDS were 0.977 (95% CI = 0.974–0.979) in the internal validation set and 0.990 (95% CI = 0.979–0.996), 0.982 (95% CI = 0.974–0.988), 0.978 (95% CI = 0.959–0.989), and 0.991 (95% CI = 0.987–0.994) in different external validation sets. In the CAIDS versus urologists’ comparisons, the CAIDS showed high accuracy and sensitivity (accuracy = 0.939, 95% CI = 0.902–0.964; and sensitivity = 0.954, 95% CI = 0.902–0.983) with a short latency of 12 s, much more accurate and quicker than the expert urologists. Conclusions The CAIDS achieved accurate BCa detection with a short latency. The CAIDS may provide many clinical benefits, from increasing the diagnostic accuracy for BCa, even for commonly misdiagnosed cases such as flat cancerous tissue (carcinoma in situ), to reducing the operation time for cystoscopy.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Pei Yang ◽  
Yong Pi ◽  
Tao He ◽  
Jiangming Sun ◽  
Jianan Wei ◽  
...  

Abstract Background 99mTc-pertechnetate thyroid scintigraphy is a valid complementary avenue for evaluating thyroid disease in the clinic, the image feature of thyroid scintigram is relatively simple but the interpretation still has a moderate consistency among physicians. Thus, we aimed to develop an artificial intelligence (AI) system to automatically classify the four patterns of thyroid scintigram. Methods We collected 3087 thyroid scintigrams from center 1 to construct the training dataset (n = 2468) and internal validating dataset (n = 619), and another 302 cases from center 2 as external validating datasets. Four pre-trained neural networks that included ResNet50, DenseNet169, InceptionV3, and InceptionResNetV2 were implemented to construct AI models. The models were trained separately with transfer learning. We evaluated each model’s performance with metrics as following: accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), recall, precision, and F1-score. Results The overall accuracy of four pre-trained neural networks in classifying four common uptake patterns of thyroid scintigrams all exceeded 90%, and the InceptionV3 stands out from others. It reached the highest performance with an overall accuracy of 92.73% for internal validation and 87.75% for external validation, respectively. As for each category of thyroid scintigrams, the area under the receiver operator characteristic curve (AUC) was 0.986 for ‘diffusely increased,’ 0.997 for ‘diffusely decreased,’ 0.998 for ‘focal increased,’ and 0.945 for ‘heterogeneous uptake’ in internal validation, respectively. Accordingly, the corresponding performances also obtained an ideal result of 0.939, 1.000, 0.974, and 0.915 in external validation, respectively. Conclusions Deep convolutional neural network-based AI model represented considerable performance in the classification of thyroid scintigrams, which may help physicians improve the interpretation of thyroid scintigrams more consistently and efficiently.


Author(s):  
Baris Turkbey ◽  
Masoom A. Haider

Prostate cancer (PCa) is the most common cancer type in males in the Western World. MRI has an established role in diagnosis of PCa through guiding biopsies. Due to multistep complex nature of the MRI-guided PCa diagnosis pathway, diagnostic performance has a big variation. Developing artificial intelligence (AI) models using machine learning, particularly deep learning, has an expanding role in radiology. Specifically, for prostate MRI, several AI approaches have been defined in the literature for prostate segmentation, lesion detection and classification with the aim of improving diagnostic performance and interobserver agreement. In this review article, we summarize the use of radiology applications of AI in prostate MRI.


Sign in / Sign up

Export Citation Format

Share Document