Fully convolutional networks semantic segmentation based on conditional random field optimization

Each pixel can be classified in the image by the semantic segmentation. The segmentation detection results of pixel level can be got which are similar to the contour of the target object. However, the results of semantic segmentation trained by Fully convolutional networks often lead to loss of detail information. This paper proposes a CRF-FCN model based on CRF optimization. Firstly, the original image is detected based on feature pyramid networks, and the target area information is extracted, which is used to train the high-order potential function of CRF. Then, the high-order CRF is used as the back-end of the complete convolution network to optimize the semantic image segmentation. The algorithm comparison experiment shows that our algorithm makes the target details more obvious, and improves the accuracy and efficiency of semantic segmentation.

Download Full-text

BUILDING SEGMENTATION FROM AIRBORNE VHR IMAGES USING MASK R-CNN

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-2-w13-155-2019 ◽

2019 ◽

Vol XLII-2/W13 ◽

pp. 155-161 ◽

Cited By ~ 1

Author(s):

K. Zhou ◽

Y. Chen ◽

I. Smal ◽

R. Lindenbergh

Keyword(s):

Model Building ◽

Critical Factor ◽

Semantic Segmentation ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Building Models ◽

Single Feature ◽

Complex Scenes ◽

Feature Pyramid

<p><strong>Abstract.</strong> Up-to-date 3D building models are important for many applications. Airborne very high resolution (VHR) images often acquired annually give an opportunity to create an up-to-date 3D model. Building segmentation is often the first and utmost step. Convolutional neural networks (CNNs) draw lots of attention in interpreting VHR images as they can learn very effective features for very complex scenes. This paper employs Mask R-CNN to address two problems in building segmentation: detecting different scales of building and segmenting buildings to have accurately segmented edges. Mask R-CNN starts from feature pyramid network (FPN) to create different scales of semantically rich features. FPN is integrated with region proposal network (RPN) to generate objects with various scales with the corresponding optimal scale of features. The features with high and low levels of information are further used for better object classification of small objects and for mask prediction of edges. The method is tested on ISPRS benchmark dataset by comparing results with the fully convolutional networks (FCN), which merge high and low level features by a skip-layer to create a single feature for semantic segmentation. The results show that Mask R-CNN outperforms FCN with around 15% in detecting objects, especially in detecting small objects. Moreover, Mask R-CNN has much better results in edge region than FCN. The results also show that choosing the range of anchor scales in Mask R-CNN is a critical factor in segmenting different scale of objects. This paper provides an insight into how a good anchor scale for different dataset should be chosen.</p>

Download Full-text

Automatic Deep Learning Semantic Segmentation of Ultrasound Thyroid Cineclips using Recurrent Fully Convolutional Networks

IEEE Access ◽

10.1109/access.2020.3045906 ◽

2020 ◽

pp. 1-1

Author(s):

Jeremy M. Webb ◽

Duane D. Meixner ◽

Shaheeda A. Adusei ◽

Eric C. Polley ◽

Mostafa Fatemi ◽

...

Keyword(s):

Deep Learning ◽

Semantic Segmentation ◽

Convolutional Networks ◽

Fully Convolutional Networks

Download Full-text

A Hybrid Semantic Segmentation Based on Level-Set Evolution Driven by Fully Convolutional Networks

IEEE Access ◽

10.1109/access.2021.3066515 ◽

2021 ◽

Vol 9 ◽

pp. 42556-42567

Author(s):

Meng Wang ◽

Yi Ma ◽

Fan Li ◽

Zhengbing Guo

Keyword(s):

Level Set ◽

Semantic Segmentation ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Level Set Evolution

Download Full-text

Domain Adaptation for Semantic Segmentation of Historical Panchromatic Orthomosaics in Central Africa

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10080523 ◽

2021 ◽

Vol 10 (8) ◽

pp. 523

Author(s):

Nicholus Mboga ◽

Stefano D’Aronco ◽

Tais Grippa ◽

Charlotte Pelletier ◽

Stefanos Georganos ◽

...

Keyword(s):

Land Cover ◽

Domain Adaptation ◽

Central Africa ◽

Semantic Segmentation ◽

Target Domain ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

The Cost ◽

Performance Gains

Multitemporal environmental and urban studies are essential to guide policy making to ultimately improve human wellbeing in the Global South. Land-cover products derived from historical aerial orthomosaics acquired decades ago can provide important evidence to inform long-term studies. To reduce the manual labelling effort by human experts and to scale to large, meaningful regions, we investigate in this study how domain adaptation techniques and deep learning can help to efficiently map land cover in Central Africa. We propose and evaluate a methodology that is based on unsupervised adaptation to reduce the cost of generating reference data for several cities and across different dates. We present the first application of domain adaptation based on fully convolutional networks for semantic segmentation of a dataset of historical panchromatic orthomosaics for land-cover generation for two focus cities Goma-Gisenyi and Bukavu. Our experimental evaluation shows that the domain adaptation methods can reach an overall accuracy between 60% and 70% for different regions. If we add a small amount of labelled data from the target domain, too, further performance gains can be achieved.

Download Full-text

Region-Based CNN Method with Deformable Modules for Visually Classifying Concrete Cracks

Applied Sciences ◽

10.3390/app10072528 ◽

2020 ◽

Vol 10 (7) ◽

pp. 2528 ◽

Cited By ~ 5

Author(s):

Lu Deng ◽

Hong-Hu Chu ◽

Peng Shi ◽

Wei Wang ◽

Xuan Kong

Keyword(s):

Crack Detection ◽

Detection Methods ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Out Of Plane ◽

Intelligent Detection ◽

New Type ◽

Feature Pyramid ◽

Crack Detector ◽

Core Idea

Cracks are often the most intuitive indicators for assessing the condition of in-service structures. Intelligent detection methods based on regular convolutional neural networks (CNNs) have been widely applied to the field of crack detection in recently years; however, these methods exhibit unsatisfying performance on the detection of out-of-plane cracks. To overcome this drawback, a new type of region-based CNN (R-CNN) crack detector with deformable modules is proposed in the present study. The core idea of the method is to replace the traditional regular convolution and pooling operation with a deformable convolution operation and a deformable pooling operation. The idea is implemented on three different regular detectors, namely the Faster R-CNN, region-based fully convolutional networks (R-FCN), and feature pyramid network (FPN)-based Faster R-CNN. To examine the advantages of the proposed method, the results obtained from the proposed detector and corresponding regular detectors are compared. The results show that the addition of deformable modules improves the mean average precisions (mAPs) achieved by the Faster R-CNN, R-FCN, and FPN-based Faster R-CNN for crack detection. More importantly, adding deformable modules enables these detectors to detect the out-of-plane cracks that are difficult for regular detectors to detect.

Download Full-text

Dual Path Attention Net for Remote Sensing Semantic Image Segmentation

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9100571 ◽

2020 ◽

Vol 9 (10) ◽

pp. 571

Author(s):

Jinglun Li ◽

Jiapeng Xiu ◽

Zhengqiu Yang ◽

Chen Liu

Keyword(s):

Remote Sensing ◽

Spatial Information ◽

Semantic Segmentation ◽

Remote Sensing Images ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Augmentation Strategies ◽

Rich Information ◽

The Mean ◽

The Rich

Semantic segmentation plays an important role in being able to understand the content of remote sensing images. In recent years, deep learning methods based on Fully Convolutional Networks (FCNs) have proved to be effective for the sematic segmentation of remote sensing images. However, the rich information and complex content makes the training of networks for segmentation challenging, and the datasets are necessarily constrained. In this paper, we propose a Convolutional Neural Network (CNN) model called Dual Path Attention Network (DPA-Net) that has a simple modular structure and can be added to any segmentation model to enhance its ability to learn features. Two types of attention module are appended to the segmentation model, one focusing on spatial information the other focusing upon the channel. Then, the outputs of these two attention modules are fused to further improve the network’s ability to extract features, thus contributing to more precise segmentation results. Finally, data pre-processing and augmentation strategies are used to compensate for the small number of datasets and uneven distribution. The proposed network was tested on the Gaofen Image Dataset (GID). The results show that the network outperformed U-Net, PSP-Net, and DeepLab V3+ in terms of the mean IoU by 0.84%, 2.54%, and 1.32%, respectively.

Download Full-text

Pooling-Free Fully Convolutional Networks with Dense Skip Connections for Semantic Segmentation, with Application to Brain Tumor Segmentation

Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries - Lecture Notes in Computer Science ◽

10.1007/978-3-319-75238-9_15 ◽

2018 ◽

pp. 169-177 ◽

Cited By ~ 3

Author(s):

Richard McKinley ◽

Alain Jungo ◽

Roland Wiest ◽

Mauricio Reyes

Keyword(s):

Brain Tumor ◽

Semantic Segmentation ◽

Tumor Segmentation ◽

Brain Tumor Segmentation ◽

Convolutional Networks ◽

Fully Convolutional Networks

Download Full-text

Fully Convolutional Networks for Semantic Segmentation of Very High Resolution Remotely Sensed Images Combined With DSM

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2018.2795531 ◽

2018 ◽

Vol 15 (3) ◽

pp. 474-478 ◽

Cited By ~ 51

Author(s):

Weiwei Sun ◽

Ruisheng Wang

Keyword(s):

High Resolution ◽

Semantic Segmentation ◽

Remotely Sensed ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Remotely Sensed Images ◽

Very High

Download Full-text

A Novel Object-Based Deep Learning Framework for Semantic Segmentation of Very High-Resolution Remote Sensing Data: Comparison with Convolutional and Fully Convolutional Networks

Remote Sensing ◽

10.3390/rs11060684 ◽

2019 ◽

Vol 11 (6) ◽

pp. 684 ◽

Cited By ~ 17

Author(s):

Maria Papadomanolaki ◽

Maria Vakalopoulou ◽

Konstantinos Karantzalos

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Semantic Segmentation ◽

Novel Object ◽

Convolutional Networks ◽

Learning Framework ◽

Fully Convolutional Networks ◽

Object Based ◽

Deep Networks ◽

Very High

Deep learning architectures have received much attention in recent years demonstrating state-of-the-art performance in several segmentation, classification and other computer vision tasks. Most of these deep networks are based on either convolutional or fully convolutional architectures. In this paper, we propose a novel object-based deep-learning framework for semantic segmentation in very high-resolution satellite data. In particular, we exploit object-based priors integrated into a fully convolutional neural network by incorporating an anisotropic diffusion data preprocessing step and an additional loss term during the training process. Under this constrained framework, the goal is to enforce pixels that belong to the same object to be classified at the same semantic category. We compared thoroughly the novel object-based framework with the currently dominating convolutional and fully convolutional deep networks. In particular, numerous experiments were conducted on the publicly available ISPRS WGII/4 benchmark datasets, namely Vaihingen and Potsdam, for validation and inter-comparison based on a variety of metrics. Quantitatively, experimental results indicate that, overall, the proposed object-based framework slightly outperformed the current state-of-the-art fully convolutional networks by more than 1% in terms of overall accuracy, while intersection over union results are improved for all semantic categories. Qualitatively, man-made classes with more strict geometry such as buildings were the ones that benefit most from our method, especially along object boundaries, highlighting the great potential of the developed approach.

Download Full-text

A Benchmark for Endoluminal Scene Segmentation of Colonoscopy Images

Journal of Healthcare Engineering ◽

10.1155/2017/4037190 ◽

2017 ◽

Vol 2017 ◽

pp. 1-9 ◽

Cited By ~ 32

Author(s):

David Vázquez ◽

Jorge Bernal ◽

F. Javier Sánchez ◽

Gloria Fernández-Esparrach ◽

Antonio M. López ◽

...

Keyword(s):

Screening Tool ◽

Semantic Segmentation ◽

Visual Assessment ◽

Scene Segmentation ◽

Convolutional Networks ◽

The Third ◽

Fully Convolutional Networks ◽

Regular Screening ◽

Training Standard ◽

Related Mortality

Colorectal cancer (CRC) is the third cause of cancer death worldwide. Currently, the standard approach to reduce CRC-related mortality is to perform regular screening in search for polyps and colonoscopy is the screening tool of choice. The main limitations of this screening procedure are polyp miss rate and the inability to perform visual assessment of polyp malignancy. These drawbacks can be reduced by designing decision support systems (DSS) aiming to help clinicians in the different stages of the procedure by providing endoluminal scene segmentation. Thus, in this paper, we introduce an extended benchmark of colonoscopy image segmentation, with the hope of establishing a new strong benchmark for colonoscopy image analysis research. The proposed dataset consists of 4 relevant classes to inspect the endoluminal scene, targeting different clinical needs. Together with the dataset and taking advantage of advances in semantic segmentation literature, we provide new baselines by training standard fully convolutional networks (FCNs). We perform a comparative study to show that FCNs significantly outperform, without any further postprocessing, prior results in endoluminal scene segmentation, especially with respect to polyp segmentation and localization.

Download Full-text