Semantic Segmentation of Surgical Instruments based on Enhanced Multi-scale Receptive Field

Semantic segmentation of surgical instruments plays a critical role in computer-assisted surgery. However, specular reflection and scale variation of instruments are likely to occur in the surgical environment, undesirably altering visual features of instruments, such as color and shape. These issues make semantic segmentation of surgical instruments more challenging. In this paper, a novel network, Pyramid Attention Aggregation Network, is proposed to aggregate multi-scale attentive features for surgical instruments. It contains two critical modules: Double Attention Module and Pyramid Upsampling Module. Specifically, the Double Attention Module includes two attention blocks (i.e., position attention block and channel attention block), which model semantic dependencies between positions and channels by capturing joint semantic information and global contexts, respectively. The attentive features generated by the Double Attention Module can distinguish target regions, contributing to solving the specular reflection issue. Moreover, the Pyramid Upsampling Module extracts local details and global contexts by aggregating multi-scale attentive features. It learns the shape and size features of surgical instruments in different receptive fields and thus addresses the scale variation issue. The proposed network achieves state-of-the-art performance on various datasets. It achieves a new record of 97.10% mean IOU on Cata7. Besides, it comes first in the MICCAI EndoVis Challenge 2017 with 9.90% increase on mean IOU.

Download Full-text

An Inverse Node Graph-Based Method for the Urban Scene Segmentation of 3D Point Clouds

Remote Sensing ◽

10.3390/rs13153021 ◽

2021 ◽

Vol 13 (15) ◽

pp. 3021

Author(s):

Bufan Zhao ◽

Xianghong Hua ◽

Kegen Yu ◽

Xiaoxing He ◽

Weixing Xue ◽

...

Keyword(s):

Semantic Segmentation ◽

Point Clouds ◽

Intelligent Vehicles ◽

Critical Data ◽

Multi Scale ◽

3D Point Clouds ◽

Cluster Optimization ◽

Urban Scene ◽

Processing Steps

Urban object segmentation and classification tasks are critical data processing steps in scene understanding, intelligent vehicles and 3D high-precision maps. Semantic segmentation of 3D point clouds is the foundational step in object recognition. To identify the intersecting objects and improve the accuracy of classification, this paper proposes a segment-based classification method for 3D point clouds. This method firstly divides points into multi-scale supervoxels and groups them by proposed inverse node graph (IN-Graph) construction, which does not need to define prior information about the node, it divides supervoxels by judging the connection state of edges between them. This method reaches minimum global energy by graph cutting, obtains the structural segments as completely as possible, and retains boundaries at the same time. Then, the random forest classifier is utilized for supervised classification. To deal with the mislabeling of scattered fragments, higher-order CRF with small-label cluster optimization is proposed to refine the classification results. Experiments were carried out on mobile laser scan (MLS) point dataset and terrestrial laser scan (TLS) points dataset, and the results show that overall accuracies of 97.57% and 96.39% were obtained in the two datasets. The boundaries of objects were retained well, and the method achieved a good result in the classification of cars and motorcycles. More experimental analyses have verified the advantages of the proposed method and proved the practicability and versatility of the method.

Download Full-text

Adaptive Effective Receptive Field Convolution for Semantic Segmentation of VHR Remote Sensing Images

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2020.3009143 ◽

2020 ◽

pp. 1-15

Author(s):

Xi Chen ◽

Zhiqiang Li ◽

Jie Jiang ◽

Zhen Han ◽

Shiyi Deng ◽

...

Keyword(s):

Remote Sensing ◽

Receptive Field ◽

Semantic Segmentation ◽

Remote Sensing Images

Download Full-text

MLFNet-Point Cloud Semantic Segmentation Convolution Network Based on Multi-scale Feature Fusion

IEEE Access ◽

10.1109/access.2021.3057612 ◽

2021 ◽

pp. 1-1

Author(s):

Jingfang Yang ◽

Bochang Zou ◽

Huadong Qiu ◽

Zhi Li

Keyword(s):

Point Cloud ◽

Feature Fusion ◽

Semantic Segmentation ◽

Scale Feature ◽

Multi Scale

Download Full-text

Real-time Semantic Segmentation Based on Multi-scale Feature Map Joint Pyramid Upsamping

10.1109/aeeca52519.2021.9574190 ◽

2021 ◽

Author(s):

Liang Chao ◽

Wang Xiaoyu ◽

Song Yu ◽

Jiang Changhong

Keyword(s):

Real Time ◽

Semantic Segmentation ◽

Feature Map ◽

Scale Feature ◽

Multi Scale

Download Full-text

Simultaneous Segmentation of Fetal Hearts and Lungs for Medical Ultrasound Images via an Efficient Multi-scale Model Integrated With Attention Mechanism

Ultrasonic Imaging ◽

10.1177/01617346211042526 ◽

2021 ◽

pp. 016173462110425

Author(s):

Jianing Xi ◽

Jiangang Chen ◽

Zhao Wang ◽

Dean Ta ◽

Bing Lu ◽

...

Keyword(s):

Congenital Anomaly ◽

Large Scale ◽

Automatic Segmentation ◽

Receptive Fields ◽

Semantic Segmentation ◽

Attention Mechanism ◽

Scale Model ◽

Ultrasound Images ◽

Multi Scale ◽

Task Irrelevant

Large scale early scanning of fetuses via ultrasound imaging is widely used to alleviate the morbidity or mortality caused by congenital anomalies in fetal hearts and lungs. To reduce the intensive cost during manual recognition of organ regions, many automatic segmentation methods have been proposed. However, the existing methods still encounter multi-scale problem at a larger range of receptive fields of organs in images, resolution problem of segmentation mask, and interference problem of task-irrelevant features, obscuring the attainment of accurate segmentations. To achieve semantic segmentation with functions of (1) extracting multi-scale features from images, (2) compensating information of high resolution, and (3) eliminating the task-irrelevant features, we propose a multi-scale model with skip connection framework and attention mechanism integrated. The multi-scale feature extraction modules are incorporated with additive attention gate units for irrelevant feature elimination, through a U-Net framework with skip connections for information compensation. The performance of fetal heart and lung segmentation indicates the superiority of our method over the existing deep learning based approaches. Our method also shows competitive performance stability during the task of semantic segmentations, showing a promising contribution on ultrasound based prognosis of congenital anomaly in the early intervention, and alleviating the negative effects caused by congenital anomaly.

Download Full-text

Multi-scale Semantic Segmentation Enriched Features for Pedestrian Detection

2018 24th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr.2018.8545414 ◽

2018 ◽

Author(s):

Xiaolu Xie ◽

Zengfu Wang

Keyword(s):

Pedestrian Detection ◽

Semantic Segmentation ◽

Multi Scale

Download Full-text

Multi-scale Context Intertwining for Semantic Segmentation

Computer Vision – ECCV 2018 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-01219-9_37 ◽

2018 ◽

pp. 622-638 ◽

Cited By ~ 19

Author(s):

Di Lin ◽

Yuanfeng Ji ◽

Dani Lischinski ◽

Daniel Cohen-Or ◽

Hui Huang

Keyword(s):

Semantic Segmentation ◽

Multi Scale

Download Full-text

Multi-scale Spatial Location Preference for Semantic Segmentation

MultiMedia Modeling - Lecture Notes in Computer Science ◽

10.1007/978-3-030-37731-1_48 ◽

2019 ◽

pp. 593-604

Author(s):

Qiuyuan Han ◽

Jin Zheng

Keyword(s):

Semantic Segmentation ◽

Spatial Location ◽

Multi Scale

Download Full-text

Semantic Segmentation Network for Surface Defect Detection of Automobile Wheel Hub Fusing High-Resolution Feature and Multi-Scale Feature

Applied Sciences ◽

10.3390/app112210508 ◽

2021 ◽

Vol 11 (22) ◽

pp. 10508

Author(s):

Chaowei Tang ◽

Xinxin Feng ◽

Haotian Wen ◽

Xu Zhou ◽

Yanqing Shao ◽

...

Keyword(s):

High Resolution ◽

Defect Detection ◽

Automobile Industry ◽

Surface Defect ◽

Semantic Segmentation ◽

The Body ◽

Multi Scale ◽

Surface Defect Detection ◽

Edge Features ◽

Automobile Wheel

Surface defect detection of an automobile wheel hub is important to the automobile industry because these defects directly affect the safety and appearance of automobiles. At present, surface defect detection networks based on convolutional neural network use many pooling layers when extracting features, reducing the spatial resolution of features and preventing the accurate detection of the boundary of defects. On the basis of DeepLab v3+, we propose a semantic segmentation network for the surface defect detection of an automobile wheel hub. To solve the gridding effect of atrous convolution, the high-resolution network (HRNet) is used as the backbone network to extract high-resolution features, and the multi-scale features extracted by the Atrous Spatial Pyramid Pooling (ASPP) of DeepLab v3+ are superimposed. On the basis of the optical flow, we decouple the body and edge features of the defects to accurately detect the boundary of defects. Furthermore, in the upsampling process, a decoder can accurately obtain detection results by fusing the body, edge, and multi-scale features. We use supervised training to optimize these features. Experimental results on four defect datasets (i.e., wheels, magnetic tiles, fabrics, and welds) show that the proposed network has better F1 score, average precision, and intersection over union than SegNet, Unet, and DeepLab v3+, proving that the proposed network is effective for different defect detection scenarios.

Download Full-text