spatial pyramid pooling
Recently Published Documents


TOTAL DOCUMENTS

134
(FIVE YEARS 106)

H-INDEX

10
(FIVE YEARS 5)

Entropy ◽  
2022 ◽  
Vol 24 (1) ◽  
pp. 118
Author(s):  
Yu Sun ◽  
Rongrong Ni ◽  
Yao Zhao

Up to now, most of the forensics methods have attached more attention to natural content images. To expand the application of image forensics technology, forgery detection for certificate images that can directly represent people’s rights and interests is investigated in this paper. Variable tampered region scales and diverse manipulation types are two typical characteristics in fake certificate images. To tackle this task, a novel method called Multi-level Feature Attention Network (MFAN) is proposed. MFAN is built following the encoder–decoder network structure. In order to extract features with rich scale information in the encoder, on the one hand, we employ Atrous Spatial Pyramid Pooling (ASPP) on the final layer of a pre-trained residual network to capture the contextual information at different scales; on the other hand, low-level features are concatenated to ensure the sensibility to small targets. Furthermore, the resulting multi-level features are recalibrated on channels for irrelevant information suppression and enhancing the tampered regions, guiding the MFAN to adapt to diverse manipulation traces. In the decoder module, the attentive feature maps are convoluted and unsampled to effectively generate the prediction mask. Experimental results indicate that the proposed method outperforms some state-of-the-art forensics methods.


2022 ◽  
pp. 1-1
Author(s):  
Yu Qiu ◽  
Yun Liu ◽  
Yanan Chen ◽  
Jianwen Zhang ◽  
Jinchao Zhu ◽  
...  

2022 ◽  
Vol 2148 (1) ◽  
pp. 012013
Author(s):  
Zhong Xiang ◽  
Yujia Shen ◽  
Zhitao Cheng ◽  
Miao Ma ◽  
Feng Lin

Abstract Printed fabric patterns contain multiple repeat pattern primitives, which have a significant impact on fabric pattern design in the textile industry. The pattern primitive is often composed of multiple elements, such as color, form, and texture structure. Therefore, the more pattern elements it contains, the more complex the primitive is. In order to segment fabric primitives, this paper proposes a novel convolutional neural network (CNN) method with spatial pyramid pooling module as a feature extractor, which enables to learn the pattern feature information and determine whether the printed fabric has periodic pattern primitives. Furthermore, by choosing pair of activation peaks in a filter, a set of displacement vectors can be calculated. The activation peaks that are most accordant with the optimum displacement vector contribute to pick out the final size of primitives. The results show that the method with the powerful feature extraction capabilities of the CNN can segment the periodic pattern primitives of complex printed fabrics. Compared with the traditional algorithm, the proposed method has higher segmentation accuracy and adaptability.


Agronomy ◽  
2021 ◽  
Vol 11 (12) ◽  
pp. 2440
Author(s):  
Faris A. Kateb ◽  
Muhammad Mostafa Monowar ◽  
Md. Abdul Hamid ◽  
Abu Quwsar Ohi ◽  
Muhammad Firoz Mridha

Computer vision is currently experiencing success in various domains due to the harnessing of deep learning strategies. In the case of precision agriculture, computer vision is being investigated for detecting fruits from orchards. However, such strategies limit too-high complexity computation that is impossible to embed in an automated device. Nevertheless, most investigation of fruit detection is limited to a single fruit, resulting in the necessity of a one-to-many object detection system. This paper introduces a generic detection mechanism named FruitDet, designed to be prominent for detecting fruits. The FruitDet architecture is designed on the YOLO pipeline and achieves better performance in detecting fruits than any other detection model. The backbone of the detection model is implemented using DenseNet architecture. Further, the FruitDet is packed with newer concepts: attentive pooling, bottleneck spatial pyramid pooling, and blackout mechanism. The detection mechanism is benchmarked using five datasets, which combines a total of eight different fruit classes. The FruitDet architecture acquires better performance than any other recognized detection methods in fruit detection.


Author(s):  
Leijian Yu ◽  
Erfu Yang ◽  
Cai Luo ◽  
Peng Ren

AbstractCorrosion has been concerned as a serious safety issue for metallic facilities. Visual inspection carried out by an engineer is expensive, subjective and time-consuming. Micro Aerial Vehicles (MAVs) equipped with detection algorithms have the potential to perform safer and much more efficient visual inspection tasks than engineers. Towards corrosion detection algorithms, convolution neural networks (CNNs) have enabled the power for high accuracy metallic corrosion detection. However, these detectors are restricted by MAVs on-board capabilities. In this study, based on You Only Look Once v3-tiny (Yolov3-tiny), an accurate deep learning-based metallic corrosion detector (AMCD) is proposed for MAVs on-board metallic corrosion detection. Specifically, a backbone with depthwise separable convolution (DSConv) layers is designed to realise efficient corrosion detection. The convolutional block attention module (CBAM), three-scale object detection and focal loss are incorporated to improve the detection accuracy. Moreover, the spatial pyramid pooling (SPP) module is improved to fuse local features for further improvement of detection accuracy. A field inspection image dataset labelled with four types of corrosions (the nubby corrosion, bar corrosion, exfoliation and fastener corrosion) is utilised for training and testing the AMCD. Test results show that the AMCD achieves 84.96% mean average precision (mAP), which outperforms other state-of-the-art detectors. Meanwhile, 20.18 frames per second (FPS) is achieved leveraging NVIDIA Jetson TX2, the most popular MAVs on-board computer, and the model size is only 6.1 MB.


Sign in / Sign up

Export Citation Format

Share Document