scholarly journals Framework for large-scale automatic curation of heterogeneous cardiac MRI (ACUR MRI)

2021 ◽  
Vol 22 (Supplement_2) ◽  
Author(s):  
C Galazis ◽  
K Vimalesvaran ◽  
S Zaman ◽  
C Petri ◽  
J Howard ◽  
...  

Abstract Funding Acknowledgements Type of funding sources: Public grant(s) – National budget only. Main funding source(s): UKRI CDT in AI for Healthcare http://ai4health.io and British Heart Foundation Background  Data curation is an important process that structures and organises data, supporting research and the development of artificial intelligence models. However, manually curating a large volume of medical data is a time-consuming, repetitive and costly process that puts additional strain on clinical experts. The curation becomes more complex and demanding as more data sources are used. This leads to an introduction of disparity in the data structure and protocols.  Purpose  Here, we propose an automatic framework to curate large volumes of heterogenous cardiac MRI scans acquired across different sites and scanner vendors. Our framework requires minimal expert involvement throughout and works directly on DICOM images from the scanner or PACS.  The resulting structured standardised data allow for straightforward image analysis, hypothesis testing and the training and application of artificial intelligence models.  Methods  It is broken down into three main components anonymisation, cataloguing and outlier detection (see Figure 1). Anonymisation automatically removes any identifiable patient information from the DICOM image attributes. These data are replaced with anonymised labels, whilst maintaining relevant longitudinal information from each patient. DICOM attributes are also used to automatically group the different images according to imaging sequence (e.g. CINE, Delayed-Enhancement, T1 maps), acquisition geometry (e.g. short-axis, 2-chamber, 4-chamber) and imaging attributes (e.g. slice thickness, TE, TR), for easier querying. The sorting characteristics are flexible and can easily be defined by the user. Finally, we detect and flag, for subsequent manual inspection, any outliers within those groups, based on the similarity levels of chosen DICOM attributes. This framework additionally offers interactive image visualisation to allow users to assess its performance in real time.  Results  We tested the performance of ACUR CMRI on 26,668 CMR image series (723,531 images) from 858 patient examinations, which took place across two sites in four different scanners. With an average execution time per patient of 100 seconds, ACUR was able to sort imaging data with 1191 different sequence names into 43 categories. The framework can be freely downloaded from https://bitbucket.org/cmr-ai-working-group/acur/.  Conclusions  We present ACUR, an automatic framework to curate large volumes of heterogeneous cardiac MRI data. We show how it can quickly and automatically curate data, grouping it according to desired imaging characteristics defined in DICOM attributes. The proposed framework is flexible and ideally suited as a pre-processing tool for large biomedical imaging data studies.

Water ◽  
2019 ◽  
Vol 11 (2) ◽  
pp. 374 ◽  
Author(s):  
Taereem Kim ◽  
Ju-Young Shin ◽  
Hanbeen Kim ◽  
Sunghun Kim ◽  
Jun-Haeng Heo

Climate variability is strongly influencing hydrological processes under complex weather conditions, and it should be considered to forecast reservoir inflow for efficient dam operation strategies. Large-scale climate indices can provide potential information about climate variability, as they usually have a direct or indirect correlation with hydrologic variables. This study aims to use large-scale climate indices in monthly reservoir inflow forecasting for considering climate variability. For this purpose, time series and artificial intelligence models, such as Seasonal AutoRegressive Integrated Moving Average (SARIMA), SARIMA with eXogenous variables (SARIMAX), Artificial Neural Network (ANN), Adaptive Neural-based Fuzzy Inference System (ANFIS), and Random Forest (RF) models were employed with two types of input variables, autoregressive variables (AR-) and a combination of autoregressive and exogenous variables (ARX-). Several statistical methods, including ensemble empirical mode decomposition (EEMD), were used to select the lagged climate indices. Finally, monthly reservoir inflow was forecasted by SARIMA, SARIMAX, AR-ANN, ARX-ANN, AR-ANFIS, ARX-ANFIS, AR-RF, and ARX-RF models. As a result, the use of climate indices in artificial intelligence models showed a potential to improve the model performance, and the ARX-ANN and AR-RF models generally showed the best performance among the employed models.


2021 ◽  
Vol 4 (1) ◽  
Author(s):  
Albert T. Young ◽  
Kristen Fernandez ◽  
Jacob Pfau ◽  
Rasika Reddy ◽  
Nhat Anh Cao ◽  
...  

AbstractArtificial intelligence models match or exceed dermatologists in melanoma image classification. Less is known about their robustness against real-world variations, and clinicians may incorrectly assume that a model with an acceptable area under the receiver operating characteristic curve or related performance metric is ready for clinical use. Here, we systematically assessed the performance of dermatologist-level convolutional neural networks (CNNs) on real-world non-curated images by applying computational “stress tests”. Our goal was to create a proxy environment in which to comprehensively test the generalizability of off-the-shelf CNNs developed without training or evaluation protocols specific to individual clinics. We found inconsistent predictions on images captured repeatedly in the same setting or subjected to simple transformations (e.g., rotation). Such transformations resulted in false positive or negative predictions for 6.5–22% of skin lesions across test datasets. Our findings indicate that models meeting conventionally reported metrics need further validation with computational stress tests to assess clinic readiness.


2021 ◽  
Vol 7 (1) ◽  
Author(s):  
Xinran Wang ◽  
Liang Wang ◽  
Hong Bu ◽  
Ningning Zhang ◽  
Meng Yue ◽  
...  

AbstractProgrammed death ligand-1 (PD-L1) expression is a key biomarker to screen patients for PD-1/PD-L1-targeted immunotherapy. However, a subjective assessment guide on PD-L1 expression of tumor-infiltrating immune cells (IC) scoring is currently adopted in clinical practice with low concordance. Therefore, a repeatable and quantifiable PD-L1 IC scoring method of breast cancer is desirable. In this study, we propose a deep learning-based artificial intelligence-assisted (AI-assisted) model for PD-L1 IC scoring. Three rounds of ring studies (RSs) involving 31 pathologists from 10 hospitals were carried out, using the current guideline in the first two rounds (RS1, RS2) and our AI scoring model in the last round (RS3). A total of 109 PD-L1 (Ventana SP142) immunohistochemistry (IHC) stained images were assessed and the role of the AI-assisted model was evaluated. With the assistance of AI, the scoring concordance across pathologists was boosted to excellent in RS3 (0.950, 95% confidence interval (CI): 0.936–0.962) from moderate in RS1 (0.674, 95% CI: 0.614–0.735) and RS2 (0.736, 95% CI: 0.683–0.789). The 2- and 4-category scoring accuracy were improved by 4.2% (0.959, 95% CI: 0.953–0.964) and 13% (0.815, 95% CI: 0.803–0.827) (p < 0.001). The AI results were generally accepted by pathologists with 61% “fully accepted” and 91% “almost accepted”. The proposed AI-assisted method can help pathologists at all levels to improve the PD-L1 assay (SP-142) IC assessment in breast cancer in terms of both accuracy and concordance. The AI tool provides a scheme to standardize the PD-L1 IC scoring in clinical practice.


Sign in / Sign up

Export Citation Format

Share Document