Malaria parasite segmentation using U-Net: Comparative study of loss functions

Julisa Bana Abraham

doi:10.21924/cst.4.2.2019.128

Malaria parasite segmentation using U-Net: Comparative study of loss functions

Communications in Science and Technology ◽

10.21924/cst.4.2.2019.128 ◽

2019 ◽

Vol 4 (2) ◽

pp. 57-62

Author(s):

Julisa Bana Abraham

Keyword(s):

Network Architecture ◽

Blood Smear ◽

Mean Squared Error ◽

Semantic Segmentation ◽

Loss Functions ◽

Thin Blood Smear ◽

Convolutional Network ◽

Fully Convolutional Network ◽

Convolutional Networks ◽

Segmentation Accuracy

The convolutional neural network is commonly used for classification. However, convolutional networks can also be used for semantic segmentation using the fully convolutional network approach. U-Net is one example of a fully convolutional network architecture capable of producing accurate segmentation on biomedical images. This paper proposes to use U-Net for Plasmodium segmentation on thin blood smear images. The evaluation shows that U-Net can accurately perform Plasmodium segmentation on thin blood smear images, besides this study also compares the three loss functions, namely mean-squared error, binary cross-entropy, and Huber loss. The results show that Huber loss has the best testing metrics: 0.9297, 0.9715, 0.8957, 0.9096 for F1 score, positive predictive value (PPV), sensitivity (SE), and relative segmentation accuracy (RSA), respectively.

Download Full-text

Parallel Fully Convolutional Network for Semantic Segmentation

IEEE Access ◽

10.1109/access.2020.3042254 ◽

2021 ◽

Vol 9 ◽

pp. 673-682

Author(s):

Jian Ji ◽

Xiaocong Lu ◽

Mai Luo ◽

Minghui Yin ◽

Qiguang Miao ◽

...

Keyword(s):

Semantic Segmentation ◽

Convolutional Network ◽

Fully Convolutional Network

Download Full-text

Outdoor Scenes Pixel-wise Semantic Segmentation using Polarimetry and Fully Convolutional Network

Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications ◽

10.5220/0007360203280335 ◽

2019 ◽

Cited By ~ 2

Author(s):

Marc Blanchon ◽

Olivier Morel ◽

Yifei Zhang ◽

Ralph Seulin ◽

Nathan Crombez ◽

...

Keyword(s):

Semantic Segmentation ◽

Convolutional Network ◽

Fully Convolutional Network ◽

Outdoor Scenes

Download Full-text

Weakly supervised semantic segmentation of tomographic images in the diagnosis of stroke

Journal of Physics Conference Series ◽

10.1088/1742-6596/2099/1/012021 ◽

2021 ◽

Vol 2099 (1) ◽

pp. 012021

Author(s):

A V Dobshik ◽

A A Tulupov ◽

V B Berikov

Keyword(s):

Computed Tomography ◽

Network Architecture ◽

Semantic Segmentation ◽

Training Data ◽

Neural Network Architecture ◽

Brain Images ◽

Tomographic Images ◽

Computed Tomography Images ◽

Segmentation Accuracy ◽

Weakly Supervised

Abstract This paper presents an automatic algorithm for the segmentation of areas affected by an acute stroke in the non-contrast computed tomography brain images. The proposed algorithm is designed for learning in a weakly supervised scenario when some images are labeled accurately, and some images are labeled inaccurately. Wrong labels appear as a result of inaccuracy made by a radiologist in the process of manual annotation of computed tomography images. We propose methods for solving the segmentation problem in the case of inaccurately labeled training data. We use the U-Net neural network architecture with several modifications. Experiments on real computed tomography scans show that the proposed methods increase the segmentation accuracy.

Download Full-text

Automatic Detection of the Pharyngeal Phase in Raw Videos for the Videofluoroscopic Swallowing Study Using Efficient Data Collection and 3D Convolutional Networks †

Sensors ◽

10.3390/s19183873 ◽

2019 ◽

Vol 19 (18) ◽

pp. 3873 ◽

Cited By ~ 5

Author(s):

Jong Taek Lee ◽

Eunhee Park ◽

Tae-Du Jung

Keyword(s):

Network Architecture ◽

Training Data ◽

Time Intervals ◽

Convolutional Network ◽

Videofluoroscopic Swallowing Study ◽

Convolutional Networks ◽

Pharyngeal Phase ◽

Manual Search ◽

Novel Approach ◽

Efficient Data

Videofluoroscopic swallowing study (VFSS) is a standard diagnostic tool for dysphagia. To detect the presence of aspiration during a swallow, a manual search is commonly used to mark the time intervals of the pharyngeal phase on the corresponding VFSS image. In this study, we present a novel approach that uses 3D convolutional networks to detect the pharyngeal phase in raw VFSS videos without manual annotations. For efficient collection of training data, we propose a cascade framework which no longer requires time intervals of the swallowing process nor the manual marking of anatomical positions for detection. For video classification, we applied the inflated 3D convolutional network (I3D), one of the state-of-the-art network for action classification, as a baseline architecture. We also present a modified 3D convolutional network architecture that is derived from the baseline I3D architecture. The classification and detection performance of these two architectures were evaluated for comparison. The experimental results show that the proposed model outperformed the baseline I3D model in the condition where both models are trained with random weights. We conclude that the proposed method greatly reduces the examination time of the VFSS images with a low miss rate.

Download Full-text

Class-Wise Fully Convolutional Network for Semantic Segmentation of Remote Sensing Images

Remote Sensing ◽

10.3390/rs13163211 ◽

2021 ◽

Vol 13 (16) ◽

pp. 3211

Author(s):

Tian Tian ◽

Zhengquan Chu ◽

Qian Hu ◽

Li Ma

Keyword(s):

Remote Sensing ◽

Image Interpretation ◽

Semantic Segmentation ◽

Remote Sensing Images ◽

Feature Maps ◽

Convolutional Network ◽

Fully Convolutional Network ◽

Semantic Labeling ◽

Benchmark Datasets ◽

Semantic Label

Semantic segmentation is a fundamental task in remote sensing image interpretation, which aims to assign a semantic label for every pixel in the given image. Accurate semantic segmentation is still challenging due to the complex distributions of various ground objects. With the development of deep learning, a series of segmentation networks represented by fully convolutional network (FCN) has made remarkable progress on this problem, but the segmentation accuracy is still far from expectations. This paper focuses on the importance of class-specific features of different land cover objects, and presents a novel end-to-end class-wise processing framework for segmentation. The proposed class-wise FCN (C-FCN) is shaped in the form of an encoder-decoder structure with skip-connections, in which the encoder is shared to produce general features for all categories and the decoder is class-wise to process class-specific features. To be detailed, class-wise transition (CT), class-wise up-sampling (CU), class-wise supervision (CS), and class-wise classification (CC) modules are designed to achieve the class-wise transfer, recover the resolution of class-wise feature maps, bridge the encoder and modified decoder, and implement class-wise classifications, respectively. Class-wise and group convolutions are adopted in the architecture with regard to the control of parameter numbers. The method is tested on the public ISPRS 2D semantic labeling benchmark datasets. Experimental results show that the proposed C-FCN significantly improves the segmentation performances compared with many state-of-the-art FCN-based networks, revealing its potentials on accurate segmentation of complex remote sensing images.

Download Full-text

Detection of Windthrown Tree Stems on UAV-Orthomosaics Using U-Net Convolutional Networks

Remote Sensing ◽

10.3390/rs14010075 ◽

2021 ◽

Vol 14 (1) ◽

pp. 75

Author(s):

Stefan Reder ◽

Jan-Peter Mund ◽

Nicole Albert ◽

Lilli Waßermann ◽

Luis Miranda

Keyword(s):

Tree Species ◽

Network Architecture ◽

Semantic Segmentation ◽

Classification Performance ◽

Storm Events ◽

Negative Effects ◽

Baseline Model ◽

Affected Area ◽

Convolutional Networks ◽

Training Samples

The increasing number of severe storm events is threatening European forests. Besides the primary damages directly caused by storms, there are secondary damages such as bark beetle outbreaks and tertiary damages due to negative effects on the market. These subsequent damages can be minimized if a detailed overview of the affected area and the amount of damaged wood can be obtained quickly and included in the planning of clearance measures. The present work utilizes UAV-orthophotos and an adaptation of the U-Net architecture for the semantic segmentation and localization of windthrown stems. The network was pre-trained with generic datasets, randomly combining stems and background samples in a copy–paste augmentation, and afterwards trained with a specific dataset of a particular windthrow. The models pre-trained with generic datasets containing 10, 50 and 100 augmentations per annotated windthrown stems achieved F1-scores of 73.9% (S1Mod10), 74.3% (S1Mod50) and 75.6% (S1Mod100), outperforming the baseline model (F1-score 72.6%), which was not pre-trained. These results emphasize the applicability of the method to correctly identify windthrown trees and suggest the collection of training samples from other tree species and windthrow areas to improve the ability to generalize. Further enhancements of the network architecture are considered to improve the classification performance and to minimize the calculative costs.

Download Full-text

A Deep Fully Convolutional Network for Distal Radius and Ulna Semantic Segmentation

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/646/1/012025 ◽

2019 ◽

Vol 646 ◽

pp. 012025

Author(s):

Shuqiang Wang ◽

Wei Liang ◽

Hongfei Wang ◽

Zhuo Chen ◽

Yiqian Lu

Keyword(s):

Distal Radius ◽

Semantic Segmentation ◽

Convolutional Network ◽

Fully Convolutional Network

Download Full-text

PFCN: a fully convolutional network for point cloud semantic segmentation

Electronics Letters ◽

10.1049/el.2019.1757 ◽

2019 ◽

Vol 55 (20) ◽

pp. 1088-1090

Author(s):

Jian Lu ◽

Tong Liu ◽

Maoxin Luo ◽

Haozhe Cheng ◽

Kaibing Zhang

Keyword(s):

Point Cloud ◽

Semantic Segmentation ◽

Convolutional Network ◽

Fully Convolutional Network

Download Full-text

Filter-Wise Pruning Approach to FPGA Implementation of Fully Convolutional Network for Semantic Segmentation

Lecture Notes in Computer Science - Applied Reconfigurable Computing ◽

10.1007/978-3-030-17227-5_26 ◽

2019 ◽

pp. 371-386 ◽

Cited By ~ 5

Author(s):

Masayuki Shimoda ◽

Youki Sada ◽

Hiroki Nakahara

Keyword(s):

Semantic Segmentation ◽

Fpga Implementation ◽

Convolutional Network ◽

Fully Convolutional Network

Download Full-text

DSNet: an efficient CNN for road scene segmentation

APSIPA Transactions on Signal and Information Processing ◽

10.1017/atsip.2020.25 ◽

2020 ◽

Vol 9 ◽

Author(s):

Ping-Rong Chen ◽

Hsueh-Ming Hang ◽

Sheng-Wei Chan ◽

Jing-Jhih Lin

Keyword(s):

Real Time ◽

Network Architecture ◽

Autonomous Driving ◽

High Accuracy ◽

Neural Net ◽

Scene Segmentation ◽

Convolutional Network ◽

Advantages And Disadvantages ◽

Segmentation Accuracy ◽

Very High

Road scene understanding is a critical component in an autonomous driving system. Although the deep learning-based road scene segmentation can achieve very high accuracy, its complexity is also very high for developing real-time applications. It is challenging to design a neural net with high accuracy and low computational complexity. To address this issue, we investigate the advantages and disadvantages of several popular convolutional neural network (CNN) architectures in terms of speed, storage, and segmentation accuracy. We start from the fully convolutional network with VGG, and then we study ResNet and DenseNet. Through detailed experiments, we pick up the favorable components from the existing architectures and at the end, we construct a light-weight network architecture based on the DenseNet. Our proposed network, called DSNet, demonstrates a real-time testing (inferencing) ability (on the popular GPU platform) and it maintains an accuracy comparable with most previous systems. We test our system on several datasets including the challenging Cityscapes dataset (resolution of 1024 × 512) with an Mean Intersection over Union (mIoU) of about 69.1% and runtime of 0.0147 s/image on a single GTX 1080Ti. We also design a more accurate model but at the price of a slower speed, which has an mIoU of about 72.6% on the CamVid dataset.

Download Full-text