Addressing the Class Imbalance Problem in Medical Image Segmentation via Accelerated Tversky Loss Function

The segmentation of power lines (PLs) from aerial images is a crucial task for the safe navigation of unmanned aerial vehicles (UAVs) operating at low altitudes. Despite the advances in deep learning-based approaches for PL segmentation, these models are still vulnerable to the class imbalance present in the data. The PLs occupy only a minimal portion (1–5%) of the aerial images as compared to the background region (95–99%). Generally, this class imbalance problem is addressed via the use of PL-specific detectors in conjunction with the popular class balanced cross entropy (BBCE) loss function. However, these PL-specific detectors do not work outside their application areas and a BBCE loss requires hyperparameter tuning for class-wise weights, which is not trivial. Moreover, the BBCE loss results in low dice scores and precision values and thus, fails to achieve an optimal trade-off between dice scores, model accuracy, and precision–recall values. In this work, we propose a generalized focal loss function based on the Matthews correlation coefficient (MCC) or the Phi coefficient to address the class imbalance problem in PL segmentation while utilizing a generic deep segmentation architecture. We evaluate our loss function by improving the vanilla U-Net model with an additional convolutional auxiliary classifier head (ACU-Net) for better learning and faster model convergence. The evaluation of two PL datasets, namely the Mendeley Power Line Dataset and the Power Line Dataset of Urban Scenes (PLDU), where PLs occupy around 1% and 2% of the aerial images area, respectively, reveal that our proposed loss function outperforms the popular BBCE loss by 16% in PL dice scores on both the datasets, 19% in precision and false detection rate (FDR) values for the Mendeley PL dataset and 15% in precision and FDR values for the PLDU with a minor degradation in the accuracy and recall values. Moreover, our proposed ACU-Net outperforms the baseline vanilla U-Net for the characteristic evaluation parameters in the range of 1–10% for both the PL datasets. Thus, our proposed loss function with ACU-Net achieves an optimal trade-off for the characteristic evaluation parameters without any bells and whistles. Our code is available at Github.

Download Full-text

A Densely Connected Network Based on U-Net for Medical Image Segmentation

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3446618 ◽

2021 ◽

Vol 17 (3) ◽

pp. 1-14

Author(s):

Zhenzhen Yang ◽

Pengfei Xu ◽

Yongpeng Yang ◽

Bing-Kun Bao

Keyword(s):

Feature Extraction ◽

Image Segmentation ◽

Loss Function ◽

Network Architecture ◽

Medical Image ◽

Medical Image Segmentation ◽

Cross Entropy ◽

Loss Functions ◽

Feature Maps ◽

Different Levels

The U-Net has become the most popular structure in medical image segmentation in recent years. Although its performance for medical image segmentation is outstanding, a large number of experiments demonstrate that the classical U-Net network architecture seems to be insufficient when the size of segmentation targets changes and the imbalance happens between target and background in different forms of segmentation. To improve the U-Net network architecture, we develop a new architecture named densely connected U-Net (DenseUNet) network in this article. The proposed DenseUNet network adopts a dense block to improve the feature extraction capability and employs a multi-feature fuse block fusing feature maps of different levels to increase the accuracy of feature extraction. In addition, in view of the advantages of the cross entropy and the dice loss functions, a new loss function for the DenseUNet network is proposed to deal with the imbalance between target and background. Finally, we test the proposed DenseUNet network and compared it with the multi-resolutional U-Net (MultiResUNet) and the classic U-Net networks on three different datasets. The experimental results show that the DenseUNet network has significantly performances compared with the MultiResUNet and the classic U-Net networks.

Download Full-text

HPS-Net: Multi-Task Network for Medical Image Segmentation with Predictable Performance

Symmetry ◽

10.3390/sym13112107 ◽

2021 ◽

Vol 13 (11) ◽

pp. 2107

Author(s):

Xin Wei ◽

Huan Wan ◽

Fanghua Ye ◽

Weidong Min

Keyword(s):

Image Segmentation ◽

Deep Learning ◽

Loss Function ◽

Network Structure ◽

Medical Diagnosis ◽

Medical Image ◽

Medical Image Segmentation ◽

Training Mode ◽

Cooperative Training

In recent years, medical image segmentation (MIS) has made a huge breakthrough due to the success of deep learning. However, the existing MIS algorithms still suffer from two types of uncertainties: (1) the uncertainty of the plausible segmentation hypotheses and (2) the uncertainty of segmentation performance. These two types of uncertainties affect the effectiveness of the MIS algorithm and then affect the reliability of medical diagnosis. Many studies have been done on the former but ignore the latter. Therefore, we proposed the hierarchical predictable segmentation network (HPS-Net), which consists of a new network structure, a new loss function, and a cooperative training mode. According to our knowledge, HPS-Net is the first network in the MIS area that can generate both the diverse segmentation hypotheses to avoid the uncertainty of the plausible segmentation hypotheses and the measure predictions about these hypotheses to avoid the uncertainty of segmentation performance. Extensive experiments were conducted on the LIDC-IDRI dataset and the ISIC2018 dataset. The results show that HPS-Net has the highest Dice score compared with the benchmark methods, which means it has the best segmentation performance. The results also confirmed that the proposed HPS-Net can effectively predict TNR and TPR.

Download Full-text

Generative synthetic adversarial network for internal bias correction and handling class imbalance problem in medical image diagnosis

Medical Imaging 2020: Computer-Aided Diagnosis ◽

10.1117/12.2551166 ◽

2020 ◽

Cited By ~ 1

Author(s):

Mina Rezaei ◽

Tomoki Uemura ◽

Janne Nappi ◽

Hiroyuki Yoshida ◽

Christoph Lippert ◽

...

Keyword(s):

Bias Correction ◽

Medical Image ◽

Class Imbalance ◽

Class Imbalance Problem ◽

Adversarial Network ◽

Imbalance Problem ◽

Medical Image Diagnosis ◽

Image Diagnosis ◽

Internal Bias

Download Full-text

Improved U-Net Based on Mixed Loss Function for Liver Medical Image Segmentation

Laser & Optoelectronics Progress ◽

10.3788/lop57.221003 ◽

2020 ◽

Vol 57 (22) ◽

pp. 221003

Author(s):

黄泳嘉 Huang Yongjia ◽

史再峰 Shi Zaifeng ◽

王仲琦 Wang Zhongqi ◽

王哲 Wang Zhe

Keyword(s):

Image Segmentation ◽

Loss Function ◽

Medical Image ◽

Medical Image Segmentation

Download Full-text

A Separate 3D Convolutional Neural Network Architecture for 3D Medical Image Semantic Segmentation

Journal of Medical Imaging and Health Informatics ◽

10.1166/jmihi.2019.2797 ◽

2019 ◽

Vol 9 (8) ◽

pp. 1705-1716

Author(s):

Shidu Dong ◽

Zhi Liu ◽

Huaqiu Wang ◽

Yihao Zhang ◽

Shaoguo Cui

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Brain Tumor ◽

Network Architecture ◽

Medical Image ◽

Negative Impact ◽

Class Imbalance ◽

Semantic Segmentation ◽

Medical Image Segmentation ◽

Memory Space

To exploit three-dimensional (3D) context information and improve 3D medical image semantic segmentation, we propose a separate 3D (S3D) convolution neural network (CNN) architecture. First, a two-dimensional (2D) CNN is used to extract the 2D features of each slice in the xy-plane of 3D medical images. Second, one-dimensional (1D) features reassembled from the 2D features in the z-axis are input into a 1D-CNN and are then classified feature-wise. Analysis shows that S3D-CNN has lower time complexity, fewer parameters and less memory space requirements than other 3D-CNNs with a similar structure. As an example, we extend the deep convolutional encoder–decoder architecture (SegNet) to S3D-SegNet for brain tumor image segmentation. We also propose a method based on priority queues and the dice loss function to address the class imbalance for medical image segmentation. The experimental results show the following: (1) S3D-SegNet extended from SegNet can improve brain tumor image segmentation. (2) The proposed imbalance accommodation method can increase the speed of training convergence and reduce the negative impact of the imbalance. (3) S3D-SegNet with the proposed imbalance accommodation method offers performance comparable to that of some state-of-the-art 3D-CNNs and experts in brain tumor image segmentation.

Download Full-text