Positive-Aware Lesion Detection Network with Cross-scale Feature Pyramid for OCT Images

Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet

Sensors ◽

10.3390/s21051820 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1820

Author(s):

Xiaotao Shao ◽

Qing Wang ◽

Wei Yang ◽

Yun Chen ◽

Yi Xie ◽

...

Keyword(s):

Semantic Information ◽

Detection System ◽

Pedestrian Detection ◽

Detection Accuracy ◽

The Public ◽

Scale Feature ◽

Detection Algorithms ◽

Multi Scale ◽

Art Works ◽

Feature Pyramid

The existing pedestrian detection algorithms cannot effectively extract features of heavily occluded targets which results in lower detection accuracy. To solve the heavy occlusion in crowds, we propose a multi-scale feature pyramid network based on ResNet (MFPN) to enhance the features of occluded targets and improve the detection accuracy. MFPN includes two modules, namely double feature pyramid network (FPN) integrated with ResNet (DFR) and repulsion loss of minimum (RLM). We propose the double FPN which improves the architecture to further enhance the semantic information and contours of occluded pedestrians, and provide a new way for feature extraction of occluded targets. The features extracted by our network can be more separated and clearer, especially those heavily occluded pedestrians. Repulsion loss is introduced to improve the loss function which can keep predicted boxes away from the ground truths of the unrelated targets. Experiments carried out on the public CrowdHuman dataset, we obtain 90.96% AP which yields the best performance, 5.16% AP gains compared to the FPN-ResNet50 baseline. Compared with the state-of-the-art works, the performance of the pedestrian detection system has been boosted with our method.

Download Full-text

MM-FPN: Multi-path and Multi-scale Feature Pyramid Network for Object Detection

10.1109/isceic53685.2021.00072 ◽

2021 ◽

Author(s):

Sheng Dong ◽

Jiaxin Zhang ◽

Zehui Qu

Keyword(s):

Object Detection ◽

Scale Feature ◽

Multi Scale ◽

Feature Pyramid

Download Full-text

Image segmentation of nasopharyngeal carcinoma using 3D CNN with long-range skip connection and multi-scale feature pyramid

Soft Computing ◽

10.1007/s00500-020-04708-y ◽

2020 ◽

Vol 24 (16) ◽

pp. 12671-12680

Author(s):

Feng Guo ◽

Canghong Shi ◽

Xiaojie Li ◽

Xi Wu ◽

Jiliu Zhou ◽

...

Keyword(s):

Image Segmentation ◽

Nasopharyngeal Carcinoma ◽

Long Range ◽

Scale Feature ◽

Multi Scale ◽

Feature Pyramid ◽

3D Cnn

Download Full-text

Automated prostate lesion detection and PI-RADS assessment with deep learning.

Journal of Clinical Oncology ◽

10.1200/jco.2019.37.15_suppl.e16605 ◽

2019 ◽

Vol 37 (15_suppl) ◽

pp. e16605-e16605

Author(s):

Choongheon Yoon ◽

Jasper Van ◽

Michelle Bardis ◽

Param Bhatter ◽

Alexander Ushinsky ◽

...

Keyword(s):

Neural Network ◽

Prostate Cancer ◽

Neural Networks ◽

Deep Learning ◽

Lesion Detection ◽

Dice Coefficient ◽

Cohen’S Kappa ◽

Transrectal Biopsy ◽

Cohen's Kappa ◽

Feature Pyramid

e16605 Background: Prostate Cancer is the most commonly diagnosed male cancer in the U.S. Multiparametric magnetic resonance imaging (mpMRI) is increasingly used for both prostate cancer evaluation and biopsy guidance. The PI-RADS v2 scoring paradigm was developed to stratify prostate lesions on MRI and to predict lesion grade. Prostate organ and lesion segmentation is an essential step in pre-biopsy surgical planning. Deep learning convolutional neural networks (CNN) for image recognition are becoming a more common method of machine learning. In this study, we develop a comprehensive deep learning pipeline of 3D/2D CNN based on U-Net architecture for automatic localization and segmentation of prostates, detection of prostate lesions and PI-RADS v2 lesion scoring of mpMRIs. Methods: This IRB approved retrospective review included a total of 303 prostate nodules from 217 patients who had a prostate mpMRI between September 2014 and December 2016 and an MR-guided transrectal biopsy. For each T2 weighted image, a board-certified abdominal radiologist manually segmented the prostate and each prostate lesion. The T2 weighted and ADC series were co-registered and each lesion was assigned an overall PI-RADS score, T2 weighted PI-RADS score, and ADC PI-RADS score. After a U-Net neural network segmented the prostate organ, a mask regional convolutional neural network (R-CNN) was applied. The mask R-CNN is composed of three neural networks: feature pyramid network, region proposal network, and head network. The mask R-CNN detected the prostate lesion, segmented it, and estimated its PI-RADS score. Instead, the mask R-CNN was implemented to regress along dimensions of the PI-RADS criteria. The mask R-CNN performance was assessed with AUC, Sørensen–Dice coefficient, and Cohen’s Kappa for PI-RADS scoring agreement. Results: The AUC for prostate nodule detection was 0.79. By varying detection thresholds, sensitivity/PPV were 0.94/.54 and 0.60/0.87 at either ends of the spectrum. For detected nodules, the segmentation Sørensen–Dice coefficient was 0.76 (0.72 – 0.80). Weighted Cohen’s Kappa for PI-RADS scoring agreement was 0.63, 0.71, and 0.51 for composite, T2 weighted, and ADC respectively. Conclusions: These results demonstrate the feasibility of implementing a comprehensive 3D/2D CNN-based deep learning pipeline for evaluation of prostate mpMRI. This method is highly accurate for organ segmentation. The results for lesion detection and categorization are modest; however, the PI-RADS v2 score accuracy is comparable to previously published human interobserver agreement.

Download Full-text

CSFF-Net: Scene Text Detection Based on Cross-Scale Feature Fusion

Information ◽

10.3390/info12120524 ◽

2021 ◽

Vol 12 (12) ◽

pp. 524

Author(s):

Yuan Li ◽

Mayire Ibrayim ◽

Askar Hamdulla

Keyword(s):

Information Flow ◽

Feature Fusion ◽

Feature Learning ◽

Text Detection ◽

Cross Layer ◽

Scale Feature ◽

Scene Text Detection ◽

Scene Text ◽

Feature Pyramid ◽

Feature Information

In the last years, methods for detecting text in real scenes have made significant progress with an increase in neural networks. However, due to the limitation of the receptive field of the central nervous system and the simple representation of text by using rectangular bounding boxes, the previous methods may be insufficient for working with more challenging instances of text. To solve this problem, this paper proposes a scene text detection network based on cross-scale feature fusion (CSFF-Net). The framework is based on the lightweight backbone network Resnet, and the feature learning is enhanced by embedding the depth weighted convolution module (DWCM) while retaining the original feature information extracted by CNN. At the same time, the 3D-Attention module is also introduced to merge the context information of adjacent areas, so as to refine the features in each spatial size. In addition, because the Feature Pyramid Network (FPN) cannot completely solve the interdependence problem by simple element-wise addition to process cross-layer information flow, this paper introduces a Cross-Level Feature Fusion Module (CLFFM) based on FPN, which is called Cross-Level Feature Pyramid Network (Cross-Level FPN). The proposed CLFFM can better handle cross-layer information flow and output detailed feature information, thus improving the accuracy of text region detection. Compared to the original network framework, the framework provides a more advanced performance in detecting text images of complex scenes, and extensive experiments on three challenging datasets validate the realizability of our approach.

Download Full-text

Geospatial Object Detection on High Resolution Remote Sensing Imagery Based on Double Multi-Scale Feature Pyramid Network

Remote Sensing ◽

10.3390/rs11070755 ◽

2019 ◽

Vol 11 (7) ◽

pp. 755 ◽

Cited By ~ 20

Author(s):

Xiaodong Zhang ◽

Kun Zhu ◽

Guanzhou Chen ◽

Xiaoliang Tan ◽

Lifei Zhang ◽

...

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Object Detection ◽

Large Scale ◽

Training Data ◽

Validation Dataset ◽

Remote Sensing Imagery ◽

Scale Feature ◽

Multi Scale ◽

Feature Pyramid

Object detection on very-high-resolution (VHR) remote sensing imagery has attracted a lot of attention in the field of image automatic interpretation. Region-based convolutional neural networks (CNNs) have been vastly promoted in this domain, which first generate candidate regions and then accurately classify and locate the objects existing in these regions. However, the overlarge images, the complex image backgrounds and the uneven size and quantity distribution of training samples make the detection tasks more challenging, especially for small and dense objects. To solve these problems, an effective region-based VHR remote sensing imagery object detection framework named Double Multi-scale Feature Pyramid Network (DM-FPN) was proposed in this paper, which utilizes inherent multi-scale pyramidal features and combines the strong-semantic, low-resolution features and the weak-semantic, high-resolution features simultaneously. DM-FPN consists of a multi-scale region proposal network and a multi-scale object detection network, these two modules share convolutional layers and can be trained end-to-end. We proposed several multi-scale training strategies to increase the diversity of training data and overcome the size restrictions of the input images. We also proposed multi-scale inference and adaptive categorical non-maximum suppression (ACNMS) strategies to promote detection performance, especially for small and dense objects. Extensive experiments and comprehensive evaluations on large-scale DOTA dataset demonstrate the effectiveness of the proposed framework, which achieves mean average precision (mAP) value of 0.7927 on validation dataset and the best mAP value of 0.793 on testing dataset.

Download Full-text

Automatic CT Lesion Detection Based on Feature Pyramid Inference with Multi-scale Response

Lecture Notes in Computer Science - Artificial Intelligence and Security ◽

10.1007/978-3-030-78609-0_15 ◽

2021 ◽

pp. 167-179

Author(s):

Yangyang Tang ◽

Zhe Liu ◽

Yuqing Song ◽

Kai Han ◽

Jun Su ◽

...

Keyword(s):

Lesion Detection ◽

Multi Scale ◽

Scale Response ◽

Feature Pyramid

Download Full-text

FPB: Improving Multi-Scale Feature Representation Inside Convolutional Layer Via Feature Pyramid Block

2020 IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip40778.2020.9190755 ◽

2020 ◽

Author(s):

Zheng Cao ◽

Kailai Zhang ◽

Ji Wu

Keyword(s):

Feature Representation ◽

Scale Feature ◽

Multi Scale ◽

Feature Pyramid

Download Full-text

MGFPN: Enhancing multi-scale feature for object detection

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-202372 ◽

2021 ◽

pp. 1-11

Author(s):

Weiming He ◽

You Wu ◽

Jing Xiao ◽

Yang Cao

Keyword(s):

Object Detection ◽

Spatial Information ◽

Mixed Group ◽

Scale Feature ◽

Multi Scale ◽

Pyramid Structure ◽

Feature Pyramid ◽

New Feature ◽

Design Defects ◽

Feature Utilization

Feature pyramids are commonly applied to solve the scale variation problem for object detection. One of the most representative works of feature pyramid is Feature Pyramid Network (FPN), which is simple and efficient. However, the fully power of multi-scale features might not be completely exploited in FPN due to its design defects. In this paper, we first analyze the structure problems of FPN which prevent the multi-scale feature from being fully exploited, then propose a new feature pyramid structure named Mixed Group FPN (MGFPN), to mitigate these design defects of FPN. Concretely, MGFPN strengthens the feature utilization by two modules named Mixed Group Convolution(MGConv) and Contextual Attention(CA). MGConv reduces the spatial information loss of FPN in feature generation stage. And CA narrows the semantic gaps between features of different receptive field before lateral summation. By replacing FPN with MGFPN in FCOS, our method can improve the performance of detectors in many major backbones by 0.7 to 1.2 Average Precision(AP) on MS-COCO benchmark without adding too much parameters and it is easy to be extended to other FPN-based models. The proposed MGFPN can serve as a simple and strong alternative for many other FPN based models.

Download Full-text