SEEK: A Framework of Superpixel Learning with CNN Features for Unsupervised Segmentation

Talha Ilyas; Abbas Khan; Muhammad Umraiz; Hyongsuk Kim

doi:10.3390/electronics9030383

SEEK: A Framework of Superpixel Learning with CNN Features for Unsupervised Segmentation

Electronics ◽

10.3390/electronics9030383 ◽

2020 ◽

Vol 9 (3) ◽

pp. 383 ◽

Cited By ~ 5

Author(s):

Talha Ilyas ◽

Abbas Khan ◽

Muhammad Umraiz ◽

Hyongsuk Kim

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Enhancement ◽

Ground Truth ◽

Semantic Segmentation ◽

Segmentation Method ◽

Unsupervised Segmentation ◽

Segmentation Algorithms ◽

Salient Features ◽

The Given

Supervised semantic segmentation algorithms have been a hot area of exploration recently, but now the attention is being drawn towards completely unsupervised semantic segmentation. In an unsupervised framework, neither the targets nor the ground truth labels are provided to the network. That being said, the network is unaware about any class instance or object present in the given data sample. So, we propose a convolutional neural network (CNN) based architecture for unsupervised segmentation. We used the squeeze and excitation network, due to its peculiar ability to capture the features’ interdependencies, which increases the network’s sensitivity to more salient features. We iteratively enable our CNN architecture to learn the target generated by a graph-based segmentation method, while simultaneously preventing our network from falling into the pit of over-segmentation. Along with this CNN architecture, image enhancement and refinement techniques are exploited to improve the segmentation results. Our proposed algorithm produces improved segmented regions that meet the human level segmentation results. In addition, we evaluate our approach using different metrics to show the quantitative outperformance.

Download Full-text

Multi-Feature 3D Road Point Cloud Semantic Segmentation Method Based on Convolutional Neural Network

Chinese Journal of Lasers ◽

10.3788/cjl202047.0410001 ◽

2020 ◽

Vol 47 (4) ◽

pp. 0410001

Author(s):

张爱武 Zhang Aiwu ◽

刘路路 Liu Lulu ◽

张希珍 Zhang Xizhen

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Point Cloud ◽

Semantic Segmentation ◽

Segmentation Method

Download Full-text

Semantic Segmentation of 3D Medical Images with 3D Convolutional Neural Networks

CLEI electronic journal ◽

10.19153/cleiej.23.1.4 ◽

2020 ◽

Vol 23 (1) ◽

Author(s):

Alejandra Márquez Herrera ◽

Alex J. Cuadros-Vargas ◽

Helio Pedrini

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Loss Function ◽

Medical Images ◽

Three Dimensional ◽

Ground Truth ◽

Semantic Segmentation ◽

Volumetric Medical Images ◽

Magnetic Resonance Imaging Mri ◽

3D Cnn

A neural network is a mathematical model that is able to perform a task automatically or semi-automatically after learning the human knowledge that we provided. Moreover, a Convolutional Neural Network (CNN) is a type of neural network that has shown to efficiently learn tasks related to the area of image analysis, such as image segmentation, whose main purpose is to find regions or separable objects within an image. A more specific type of segmentation, called semantic segmentation, guarantees that each region has a semantic meaning by giving it a label or class. Since CNNs can automate the task of image semantic segmentation, they have been very useful for the medical area, applying them to the segmentation of organs or abnormalities (tumors). This work aims to improve the task of binary semantic segmentation of volumetric medical images acquired by Magnetic Resonance Imaging (MRI) using a pre-existing Three-Dimensional Convolutional Neural Network (3D CNN) architecture. We propose a formulation of a loss function for training this 3D CNN, for improving pixel-wise segmentation results. This loss function is formulated based on the idea of adapting a similarity coefficient, used for measuring the spatial overlap between the prediction and ground truth, and then using it to train the network. As contribution, the developed approach achieved good performance in a context where the pixel classes are imbalanced. We show how the choice of the loss function for training can affect the nal quality of the segmentation. We validate our proposal over two medical image semantic segmentation datasets and show comparisons in performance between the proposed loss function and other pre-existing loss functions used for binary semantic segmentation.

Download Full-text

R2AU-Net: Attention Recurrent Residual Convolutional Neural Network for Multimodal Medical Image Segmentation

Security and Communication Networks ◽

10.1155/2021/6625688 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Qiang Zuo ◽

Songyu Chen ◽

Zhifang Wang

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Medical Image ◽

Data Science ◽

Contextual Information ◽

Semantic Segmentation ◽

Medical Image Segmentation ◽

Segmentation Method ◽

Public Dataset

In recent years, semantic segmentation method based on deep learning provides advanced performance in medical image segmentation. As one of the typical segmentation networks, U-Net is successfully applied to multimodal medical image segmentation. A recurrent residual convolutional neural network with attention gate connection (R2AU-Net) based on U-Net is proposed in this paper. It enhances the capability of integrating contextual information by replacing basic convolutional units in U-Net by recurrent residual convolutional units. Furthermore, R2AU-Net adopts attention gates instead of the original skip connection. In this paper, the experiments are performed on three multimodal datasets: ISIC 2018, DRIVE, and public dataset used in LUNA and the Kaggle Data Science Bowl 2017. Experimental results show that R2AU-Net achieves much better performance than other improved U-Net algorithms for multimodal medical image segmentation.

Download Full-text

Carpal Bone Segmentation Using Fully Convolutional Neural Network

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405615666190724101600 ◽

2019 ◽

Vol 15 (10) ◽

pp. 983-989

Author(s):

Liang Kim Meng ◽

Azira Khalil ◽

Muhamad Hanif Ahmad Nizar ◽

Maryam Kamarun Nisham ◽

Belinda Pingguan-Murphy ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Automatic Segmentation ◽

Ground Truth ◽

Bone Age ◽

Image Feature ◽

Qualitative Assessment ◽

Carpal Bone ◽

Radiographic Images ◽

Accurate Quantitative Analysis

Background: Bone Age Assessment (BAA) refers to a clinical procedure that aims to identify a discrepancy between biological and chronological age of an individual by assessing the bone age growth. Currently, there are two main methods of executing BAA which are known as Greulich-Pyle and Tanner-Whitehouse techniques. Both techniques involve a manual and qualitative assessment of hand and wrist radiographs, resulting in intra and inter-operator variability accuracy and time-consuming. An automatic segmentation can be applied to the radiographs, providing the physician with more accurate delineation of the carpal bone and accurate quantitative analysis. Methods: In this study, we proposed an image feature extraction technique based on image segmentation with the fully convolutional neural network with eight stride pixel (FCN-8). A total of 290 radiographic images including both female and the male subject of age ranging from 0 to 18 were manually segmented and trained using FCN-8. Results and Conclusion: The results exhibit a high training accuracy value of 99.68% and a loss rate of 0.008619 for 50 epochs of training. The experiments compared 58 images against the gold standard ground truth images. The accuracy of our fully automated segmentation technique is 0.78 ± 0.06, 1.56 ±0.30 mm and 98.02% in terms of Dice Coefficient, Hausdorff Distance, and overall qualitative carpal recognition accuracy, respectively.

Download Full-text

A progressive image semantic segmentation method using recurrent neural network

2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP) ◽

10.1109/icsp51882.2021.9408920 ◽

2021 ◽

Author(s):

Li Yi

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Semantic Segmentation ◽

Segmentation Method

Download Full-text

Geometric property-based convolutional neural network for indoor object detection

International Journal of Advanced Robotic Systems ◽

10.1177/1729881421993323 ◽

2021 ◽

Vol 18 (1) ◽

pp. 172988142199332

Author(s):

Xintao Ding ◽

Boquan Li ◽

Jinbao Wang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Geometric Property ◽

Ground Truth ◽

Geometric Constraints ◽

Depth Information ◽

Training Set ◽

Object Knowledge ◽

The Mean

Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision.

Download Full-text

A new deep distortion convolutional neural network for semantic segmentation of panoramic images

Journal of Physics Conference Series ◽

10.1088/1742-6596/1873/1/012006 ◽

2021 ◽

Vol 1873 (1) ◽

pp. 012006

Author(s):

Xing Hu ◽

Yi An ◽

Cheng Shao ◽

Pan Qin

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Semantic Segmentation ◽

Panoramic Images

Download Full-text

Unraveling the deep learning gearbox in optical coherence tomography image segmentation towards explainable artificial intelligence

Communications Biology ◽

10.1038/s42003-021-01697-y ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Peter M. Maloca ◽

Philipp L. Müller ◽

Aaron Y. Lee ◽

Adnan Tufail ◽

Konstantinos Balaskas ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Optical Coherence Tomography ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Learning Algorithm ◽

Ground Truth ◽

Optical Coherence Tomography Image ◽

Optical Coherence ◽

Tomography Image

AbstractMachine learning has greatly facilitated the analysis of medical data, while the internal operations usually remain intransparent. To better comprehend these opaque procedures, a convolutional neural network for optical coherence tomography image segmentation was enhanced with a Traceable Relevance Explainability (T-REX) technique. The proposed application was based on three components: ground truth generation by multiple graders, calculation of Hamming distances among graders and the machine learning algorithm, as well as a smart data visualization (‘neural recording’). An overall average variability of 1.75% between the human graders and the algorithm was found, slightly minor to 2.02% among human graders. The ambiguity in ground truth had noteworthy impact on machine learning results, which could be visualized. The convolutional neural network balanced between graders and allowed for modifiable predictions dependent on the compartment. Using the proposed T-REX setup, machine learning processes could be rendered more transparent and understandable, possibly leading to optimized applications.

Download Full-text

Active Learning with Bayesian UNet for Efficient Semantic Image Segmentation

Journal of Imaging ◽

10.3390/jimaging7020037 ◽

2021 ◽

Vol 7 (2) ◽

pp. 37

Author(s):

Isah Charles Saidu ◽

Lehel Csató

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Active Learning ◽

Convolutional Neural Network ◽

Medical Image ◽

Segmentation Method ◽

Semantic Image Segmentation ◽

Batch Normalization ◽

Set Up ◽

Image Datasets

We present a sample-efficient image segmentation method using active learning, we call it Active Bayesian UNet, or AB-UNet. This is a convolutional neural network using batch normalization and max-pool dropout. The Bayesian setup is achieved by exploiting the probabilistic extension of the dropout mechanism, leading to the possibility to use the uncertainty inherently present in the system. We set up our experiments on various medical image datasets and highlight that with a smaller annotation effort our AB-UNet leads to stable training and better generalization. Added to this, we can efficiently choose from an unlabelled dataset.

Download Full-text

Semantic Segmentation of Urban Street Scene Based on Convolutional Neural Network

Journal of Physics Conference Series ◽

10.1088/1742-6596/1682/1/012077 ◽

2020 ◽

Vol 1682 ◽

pp. 012077

Author(s):

Tingting Li ◽

Chunshan Jiang ◽

Zhenqi Bian ◽

Mingchang Wang ◽

Xuefeng Niu

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Semantic Segmentation ◽

Urban Street ◽

Street Scene

Download Full-text