Analysis of Tracheobronchial Diverticula Based on Semantic Segmentation of CT Images via the Dual-Channel Attention Network

Frontiers in Public Health ◽

10.3389/fpubh.2021.813717 ◽

2022 ◽

Vol 9 ◽

Author(s):

Maoyi Zhang ◽

Changqing Ding ◽

Shuli Guo

Keyword(s):

Deep Learning ◽

Ground Truth ◽

Semantic Segmentation ◽

Rapid Identification ◽

Diagnostic Process ◽

Automated Identification ◽

Data Set ◽

Attention Model ◽

Physiological Indicators ◽

Dual Channel

Tracheobronchial diverticula (TD) is a common cystic lesion that can be easily neglected; hence accurate and rapid identification is critical for later diagnosis. There is a strong need to automate this diagnostic process because traditional manual observations are time-consuming and laborious. However, most studies have only focused on the case report or listed the relationship between the disease and other physiological indicators, but a few have adopted advanced technologies such as deep learning for automated identification and diagnosis. To fill this gap, this study interpreted TD recognition as semantic segmentation and proposed a novel attention-based network for TD semantic segmentation. Since the area of TD lesion is small and similar to surrounding organs, we designed the atrous spatial pyramid pooling (ASPP) and attention mechanisms, which can efficiently complete the segmentation of TD with robust results. The proposed attention model can selectively gather features from different branches according to the amount of information they contain. Besides, to the best of our knowledge, no public research data is available yet. For efficient network training, we constructed a data set containing 218 TD and related ground truth (GT). We evaluated different models based on the proposed data set, among which the highest MIOU can reach 0.92. The experiments show that our model can outperform state-of-the-art methods, indicating that the deep learning method has great potential for TD recognition.

Download Full-text

Deep learning algorithms out-perform veterinary pathologists in detecting the mitotically most active tumor region

Scientific Reports ◽

10.1038/s41598-020-73246-2 ◽

2020 ◽

Vol 10 (1) ◽

Cited By ~ 1

Author(s):

Marc Aubreville ◽

Christof A. Bertram ◽

Christian Marzahl ◽

Corinne Gurtner ◽

Martina Dettwiler ◽

...

Keyword(s):

Deep Learning ◽

Ground Truth ◽

Mitotic Count ◽

Mitotic Figure ◽

Two Stage ◽

Data Set ◽

Area Selection ◽

Tumor Region ◽

Cutaneous Mast Cell ◽

Mitotic Figures

Abstract Manual count of mitotic figures, which is determined in the tumor region with the highest mitotic activity, is a key parameter of most tumor grading schemes. It can be, however, strongly dependent on the area selection due to uneven mitotic figure distribution in the tumor section. We aimed to assess the question, how significantly the area selection could impact the mitotic count, which has a known high inter-rater disagreement. On a data set of 32 whole slide images of H&E-stained canine cutaneous mast cell tumor, fully annotated for mitotic figures, we asked eight veterinary pathologists (five board-certified, three in training) to select a field of interest for the mitotic count. To assess the potential difference on the mitotic count, we compared the mitotic count of the selected regions to the overall distribution on the slide. Additionally, we evaluated three deep learning-based methods for the assessment of highest mitotic density: In one approach, the model would directly try to predict the mitotic count for the presented image patches as a regression task. The second method aims at deriving a segmentation mask for mitotic figures, which is then used to obtain a mitotic density. Finally, we evaluated a two-stage object-detection pipeline based on state-of-the-art architectures to identify individual mitotic figures. We found that the predictions by all models were, on average, better than those of the experts. The two-stage object detector performed best and outperformed most of the human pathologists on the majority of tumor cases. The correlation between the predicted and the ground truth mitotic count was also best for this approach (0.963–0.979). Further, we found considerable differences in position selection between pathologists, which could partially explain the high variance that has been reported for the manual mitotic count. To achieve better inter-rater agreement, we propose to use a computer-based area selection for support of the pathologist in the manual mitotic count.

Download Full-text

Ground-truth uncertainty-aware metrics for machine learning applications on seismic image interpretation: Application to faults and horizon extraction

The Leading Edge ◽

10.1190/tle39100734.1 ◽

2020 ◽

Vol 39 (10) ◽

pp. 734-741

Author(s):

Sébastien Guillon ◽

Frédéric Joncour ◽

Pierre-Emmanuel Barrallon ◽

Laurent Castanié

Keyword(s):

Deep Learning ◽

Image Interpretation ◽

Ground Truth ◽

Learning Model ◽

Seismic Interpretation ◽

Data Set ◽

Seismic Image ◽

Machine Learning Applications ◽

Deep Learning Model

We propose new metrics to measure the performance of a deep learning model applied to seismic interpretation tasks such as fault and horizon extraction. Faults and horizons are thin geologic boundaries (1 pixel thick on the image) for which a small prediction error could lead to inappropriately large variations in common metrics (precision, recall, and intersection over union). Through two examples, we show how classical metrics could fail to indicate the true quality of fault or horizon extraction. Measuring the accuracy of reconstruction of thin objects or boundaries requires introducing a tolerance distance between ground truth and prediction images to manage the uncertainties inherent in their delineation. We therefore adapt our metrics by introducing a tolerance function and illustrate their ability to manage uncertainties in seismic interpretation. We compare classical and new metrics through different examples and demonstrate the robustness of our metrics. Finally, we show on a 3D West African data set how our metrics are used to tune an optimal deep learning model.

Download Full-text

Deep learning-driven velocity model building workflow

The Leading Edge ◽

10.1190/tle38110872a1.1 ◽

2019 ◽

Vol 38 (11) ◽

pp. 872a1-872a9 ◽

Cited By ~ 4

Author(s):

Mauricio Araya-Polo ◽

Stuart Farris ◽

Manuel Florez

Keyword(s):

Deep Learning ◽

Seismic Data ◽

Model Building ◽

Ground Truth ◽

Velocity Model ◽

Training Data ◽

Quality Data ◽

Generative Adversarial Network ◽

Data Set ◽

Velocity Models

Exploration seismic data are heavily manipulated before human interpreters are able to extract meaningful information regarding subsurface structures. This manipulation adds modeling and human biases and is limited by methodological shortcomings. Alternatively, using seismic data directly is becoming possible thanks to deep learning (DL) techniques. A DL-based workflow is introduced that uses analog velocity models and realistic raw seismic waveforms as input and produces subsurface velocity models as output. When insufficient data are used for training, DL algorithms tend to overfit or fail. Gathering large amounts of labeled and standardized seismic data sets is not straightforward. This shortage of quality data is addressed by building a generative adversarial network (GAN) to augment the original training data set, which is then used by DL-driven seismic tomography as input. The DL tomographic operator predicts velocity models with high statistical and structural accuracy after being trained with GAN-generated velocity models. Beyond the field of exploration geophysics, the use of machine learning in earth science is challenged by the lack of labeled data or properly interpreted ground truth, since we seldom know what truly exists beneath the earth's surface. The unsupervised approach (using GANs to generate labeled data)illustrates a way to mitigate this problem and opens geology, geophysics, and planetary sciences to more DL applications.

Download Full-text

Development of a City-Scale Approach for Façade Color Measurement with Building Functional Classification Using Deep Learning and Street View Images

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10080551 ◽

2021 ◽

Vol 10 (8) ◽

pp. 551

Author(s):

Jiaxin Zhang ◽

Tomohiro Fukuda ◽

Nobuyoshi Yabuki

Keyword(s):

Deep Learning ◽

Ground Truth ◽

Semantic Segmentation ◽

Color Measurement ◽

Functional Classification ◽

Labor Costs ◽

Color Distribution ◽

Distribution Maps ◽

Street View ◽

Cost Efficient

Precise measuring of urban façade color is necessary for urban color planning. The existing manual methods of measuring building façade color are limited by time and labor costs and hardly carried out on a city scale. These methods also make it challenging to identify the role of the building function in controlling and guiding urban color planning. This paper explores a city-scale approach to façade color measurement with building functional classification using state-of-the-art deep learning techniques and street view images. Firstly, we used semantic segmentation to extract building façades and conducted the color calibration of the photos for pre-processing the collected street view images. Then, we proposed a color chart-based façade color measurement method and a multi-label deep learning-based building classification method. Next, the field survey data were used as the ground truth to verify the accuracy of the façade color measurement and building function classification. Finally, we applied our approach to generate façade color distribution maps with the building classification for three metropolises in China, and the results proved the transferability and effectiveness of the scheme. The proposed approach can provide city managers with an overall perception of urban façade color and building function across city-scale areas in a cost-efficient way, contributing to data-driven decision making for urban analytics and planning.

Download Full-text

MIScnn: a framework for medical image segmentation with convolutional neural networks and deep learning

BMC Medical Imaging ◽

10.1186/s12880-020-00543-7 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Dominik Müller ◽

Frank Kramer

Keyword(s):

Image Segmentation ◽

Deep Learning ◽

Medical Image ◽

Cross Validation ◽

Data Augmentation ◽

Semantic Segmentation ◽

Medical Image Segmentation ◽

Tumor Segmentation ◽

Data Set ◽

Public Data

Abstract Background The increased availability and usage of modern medical imaging induced a strong need for automatic medical image segmentation. Still, current image segmentation platforms do not provide the required functionalities for plain setup of medical image segmentation pipelines. Already implemented pipelines are commonly standalone software, optimized on a specific public data set. Therefore, this paper introduces the open-source Python library MIScnn. Implementation The aim of MIScnn is to provide an intuitive API allowing fast building of medical image segmentation pipelines including data I/O, preprocessing, data augmentation, patch-wise analysis, metrics, a library with state-of-the-art deep learning models and model utilization like training, prediction, as well as fully automatic evaluation (e.g. cross-validation). Similarly, high configurability and multiple open interfaces allow full pipeline customization. Results Running a cross-validation with MIScnn on the Kidney Tumor Segmentation Challenge 2019 data set (multi-class semantic segmentation with 300 CT scans) resulted into a powerful predictor based on the standard 3D U-Net model. Conclusions With this experiment, we could show that the MIScnn framework enables researchers to rapidly set up a complete medical image segmentation pipeline by using just a few lines of code. The source code for MIScnn is available in the Git repository: https://github.com/frankkramer-lab/MIScnn.

Download Full-text

Evaluation of deep learning algorithms for semantic segmentation of car parts

Complex & Intelligent Systems ◽

10.1007/s40747-021-00397-8 ◽

2021 ◽

Author(s):

Kitsuchart Pasupa ◽

Phongsathorn Kittiworapanya ◽

Napasin Hongngern ◽

Kuntpong Woraratpanya

Keyword(s):

Deep Learning ◽

Evaluation System ◽

Learning Algorithms ◽

Weather Conditions ◽

Semantic Segmentation ◽

Smart Device ◽

Data Set ◽

Basic Part ◽

Left And Right ◽

Instance Segmentation

AbstractEvaluation of car damages from an accident is one of the most important processes in the car insurance business. Currently, it still needs a manual examination of every basic part. It is expected that a smart device will be able to do this evaluation more efficiently in the future. In this study, we evaluated and compared five deep learning algorithms for semantic segmentation of car parts. The baseline reference algorithm was Mask R-CNN, and the other algorithms were HTC, CBNet, PANet, and GCNet. Runs of instance segmentation were conducted with those five algorithms. HTC with ResNet-50 was the best algorithm for instance segmentation on various kinds of cars such as sedans, trucks, and SUVs. It achieved a mean average precision at 55.2 on our original data set, that assigned different labels to the left and right sides and 59.1 when a single label was assigned to both sides. In addition, the models from every algorithm were tested for robustness, by running them on images of parts, in a real environment with various weather conditions, including snow, frost, fog and various lighting conditions. GCNet was the most robust; it achieved a mean performance under corruption, mPC = 35.2, and a relative degradation of performance on corrupted data, compared to clean data (rPC), of 64.4%, when left and right sides were assigned different labels, and mPC = 38.1 and rPC = $$69.6\%$$ 69.6 % when left- and right-side parts were considered the same part. The findings from this study may directly benefit developers of automated car damage evaluation system in their quest for the best design.

Download Full-text

Semantic Segmentation of Underwater Imagery Using Deep Networks Trained on Synthetic Imagery

Journal of Marine Science and Engineering ◽

10.3390/jmse6030093 ◽

2018 ◽

Vol 6 (3) ◽

pp. 93 ◽

Cited By ~ 8

Author(s):

Michael O’Byrne ◽

Vikram Pakrashi ◽

Franck Schoefs ◽

and Bidisha Ghosh

Keyword(s):

Deep Learning ◽

Real World ◽

Ground Truth ◽

Semantic Segmentation ◽

Support Vector ◽

Natural Scenes ◽

Underwater Imaging ◽

Learning Techniques ◽

Initial Segmentation ◽

Model Training

Recent breakthroughs in the computer vision community have led to the emergence of efficient deep learning techniques for end-to-end segmentation of natural scenes. Underwater imaging stands to gain from these advances, however, deep learning methods require large annotated datasets for model training and these are typically unavailable for underwater imaging applications. This paper proposes the use of photorealistic synthetic imagery for training deep models that can be applied to interpret real-world underwater imagery. To demonstrate this concept, we look at the specific problem of biofouling detection on marine structures. A contemporary deep encoder–decoder network, termed SegNet, is trained using 2500 annotated synthetic images of size 960 × 540 pixels. The images were rendered in a virtual underwater environment under a wide variety of conditions and feature biofouling of various size, shape, and colour. Each rendered image has a corresponding ground truth per-pixel label map. Once trained on the synthetic imagery, SegNet is applied to segment new real-world images. The initial segmentation is refined using an iterative support vector machine (SVM) based post-processing algorithm. The proposed approach achieves a mean Intersection over Union (IoU) of 87% and a mean accuracy of 94% when tested on 32 frames extracted from two distinct real-world subsea inspection videos. Inference takes several seconds for a typical image.

Download Full-text

A hybrid deep learning-based fruit classification using attention model and convolution autoencoder

Complex & Intelligent Systems ◽

10.1007/s40747-020-00192-x ◽

2020 ◽

Author(s):

Gang Xue ◽

Shifeng Liu ◽

Yicao Ma

Keyword(s):

Supply Chain ◽

Deep Learning ◽

Image Classification ◽

Image Recognition ◽

Ground Truth ◽

Experimental Results ◽

Attention Model ◽

Convolutional Networks ◽

Classification Framework ◽

Fruit Sorting

Abstract Image recognition supports several applications, for instance, facial recognition, image classification, and achieving accurate fruit and vegetable classification is very important in fresh supply chain, factories, supermarkets, and other fields. In this paper, we develop a hybrid deep learning-based fruit image classification framework, named attention-based densely connected convolutional networks with convolution autoencoder (CAE-ADN), which uses a convolution autoencoder to pre-train the images and uses an attention-based DenseNet to extract the features of image. In the first part of the framework, an unsupervised method with a set of images is applied to pre-train the greedy layer-wised CAE. We use CAE structure to initialize a set of weights and bias of ADN. In the second part of the framework, the supervised ADN with the ground truth is implemented. The final part of the framework makes a prediction of the category of fruits. We use two fruit datasets to test the effectiveness of the model, experimental results show the effectiveness of the framework, and the framework can improve the efficiency of fruit sorting, which can reduce costs of fresh supply chain, factories, supermarkets, etc.

Download Full-text

TOWARDS MESH-BASED DEEP LEARNING FOR SEMANTIC SEGMENTATION IN PHOTOGRAMMETRY

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-v-2-2021-59-2021 ◽

2021 ◽

Vol V-2-2021 ◽

pp. 59-66

Author(s):

M. Knott ◽

R. Groenendijk

Keyword(s):

Deep Learning ◽

Search Algorithm ◽

Predictive Performance ◽

Semantic Segmentation ◽

Learning Task ◽

Learning Model ◽

Breadth First Search ◽

Data Set ◽

Processing Step ◽

Deep Learning Model

Abstract. This research is the first to apply MeshCNN – a deep learning model that is specifically designed for 3D triangular meshes – in the photogrammetry domain. We highlight the challenges that arise when applying a mesh-based deep learning model to a photogrammetric mesh, especially w.r.t. data set properties. We provide solutions on how to prepare a remotely sensed mesh for a machine learning task. The most notable pre-processing step proposed is a novel application of the Breadth-First Search algorithm for chunking a large mesh into computable pieces. Furthermore, this work extends MeshCNN such that photometric features based on the mesh texture are considered in addition to the geometric information. Experiments show that including color information improves the predictive performance of the model by a large margin. Besides, experimental results indicate that segmentation performance could be advanced substantially with the introduction of a high-quality benchmark for semantic segmentation on meshes.

Download Full-text

Development of a Deep Learning Algorithm for Periapical Disease Detection in Dental Radiographs

Diagnostics ◽

10.3390/diagnostics10060430 ◽

2020 ◽

Vol 10 (6) ◽

pp. 430 ◽

Cited By ~ 1

Author(s):

Michael G. Endres ◽

Florian Hillen ◽

Marios Salloumis ◽

Ahmad R. Sedaghat ◽

Stefan M. Niehues ◽

...

Keyword(s):

Deep Learning ◽

Learning Algorithm ◽

Ground Truth ◽

True Positive Rate ◽

Radiographic Images ◽

Data Set ◽

Radiographic Findings ◽

Panoramic Radiographs ◽

Deep Learning Algorithm ◽

The Mean

Periapical radiolucencies, which can be detected on panoramic radiographs, are one of the most common radiographic findings in dentistry and have a differential diagnosis including infections, granuloma, cysts and tumors. In this study, we seek to investigate the ability with which 24 oral and maxillofacial (OMF) surgeons assess the presence of periapical lucencies on panoramic radiographs, and we compare these findings to the performance of a predictive deep learning algorithm that we have developed using a curated data set of 2902 de-identified panoramic radiographs. The mean diagnostic positive predictive value (PPV) of OMF surgeons based on their assessment of panoramic radiographic images was 0.69 (±0.13), indicating that dentists on average falsely diagnose 31% of cases as radiolucencies. However, the mean diagnostic true positive rate (TPR) was 0.51 (±0.14), indicating that on average 49% of all radiolucencies were missed. We demonstrate that the deep learning algorithm achieves a better performance than 14 of 24 OMF surgeons within the cohort, exhibiting an average precision of 0.60 (±0.04), and an F1 score of 0.58 (±0.04) corresponding to a PPV of 0.67 (±0.05) and TPR of 0.51 (±0.05). The algorithm, trained on limited data and evaluated on clinically validated ground truth, has potential to assist OMF surgeons in detecting periapical lucencies on panoramic radiographs.

Download Full-text