Flower Species Recognition System Combining Object Detection and Attention Mechanism

For unmanned aerial vehicle (UAV), object detection at different scales is an important component for the visual recognition. Recent advances in convolutional neural networks (CNNs) have demonstrated that attention mechanism remarkably enhances multiscale representation of CNNs. However, most existing multiscale feature representation methods simply employ several attention blocks in the attention mechanism to adaptively recalibrate the feature response, which overlooks the context information at a multiscale level. To solve this problem, a multiscale feature filtering network (MFFNet) is proposed in this paper for image recognition system in the UAV. A novel building block, namely, multiscale feature filtering (MFF) module, is proposed for ResNet-like backbones and it allows feature-selective learning for multiscale context information across multiparallel branches. These branches employ multiple atrous convolutions at different scales, respectively, and further adaptively generate channel-wise feature responses by emphasizing channel-wise dependencies. Experimental results on CIFAR100 and Tiny ImageNet datasets reflect that the MFFNet achieves very competitive results in comparison with previous baseline models. Further ablation experiments verify that the MFFNet can achieve consistent performance gains in image classification and object detection tasks.

Download Full-text

Design of Desktop Audiovisual Entertainment System with Deep Learning and Haptic Sensations

Symmetry ◽

10.3390/sym12101718 ◽

2020 ◽

Vol 12 (10) ◽

pp. 1718

Author(s):

Chien-Hsing Chou ◽

Yu-Sheng Su ◽

Che-Ju Hsu ◽

Kong-Chang Lee ◽

Ping-Hsuan Han

Keyword(s):

Deep Learning ◽

Object Detection ◽

User Experience ◽

Recognition System ◽

Scene Recognition ◽

Single Shot ◽

Auditory Signals ◽

Hot Weather ◽

Viewing Experience ◽

At Home

In this study, we designed a four-dimensional (4D) audiovisual entertainment system called Sense. This system comprises a scene recognition system and hardware modules that provide haptic sensations for users when they watch movies and animations at home. In the scene recognition system, we used Google Cloud Vision to detect common scene elements in a video, such as fire, explosions, wind, and rain, and further determine whether the scene depicts hot weather, rain, or snow. Additionally, for animated videos, we applied deep learning with a single shot multibox detector to detect whether the animated video contained scenes of fire-related objects. The hardware module was designed to provide six types of haptic sensations set as line-symmetry to provide a better user experience. After the system considers the results of object detection via the scene recognition system, the system generates corresponding haptic sensations. The system integrates deep learning, auditory signals, and haptic sensations to provide an enhanced viewing experience.

Download Full-text

YOLOv4 Object Detection Algorithm with Efficient Channel Attention Mechanism

2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE) ◽

10.1109/icmcce51767.2020.00387 ◽

2020 ◽

Author(s):

Cui Gao ◽

Qiang Cai ◽

Shaofeng Ming

Keyword(s):

Object Detection ◽

Detection Algorithm ◽

Attention Mechanism

Download Full-text

An Object Detection Algorithm Based on Attention Mechanism and Lightweight Network (AMLN)

Proceedings of the 2020 the 4th International Conference on Innovation in Artificial Intelligence ◽

10.1145/3390557.3394297 ◽

2020 ◽

Author(s):

Xuemei Yuan ◽

Hanming Huang ◽

Zhengfeng Jiang ◽

Simin Xue

Keyword(s):

Object Detection ◽

Detection Algorithm ◽

Attention Mechanism

Download Full-text

Research on 3D Object Detection Method Based on Convolutional Attention Mechanism

Journal of Physics Conference Series ◽

10.1088/1742-6596/1848/1/012097 ◽

2021 ◽

Vol 1848 (1) ◽

pp. 012097

Author(s):

Zhang Yong ◽

Zhang Xiaoxia ◽

Da Nana

Keyword(s):

Object Detection ◽

Detection Method ◽

Attention Mechanism ◽

3D Object ◽

3D Object Detection

Download Full-text

Research on CNN for Anti-missile Object Detection Algorithm Based on Improved Attention Mechanism

10.23919/ccc52363.2021.9550491 ◽

2021 ◽

Author(s):

Tainian Song ◽

Weiwei Qin ◽

Zhuo Liang ◽

Qingqiang Qin ◽

Gang Liu

Keyword(s):

Object Detection ◽

Detection Algorithm ◽

Attention Mechanism

Download Full-text

The Design of Single Moving Object Detection and Recognition System Based on OpenCV

2018 IEEE International Conference on Mechatronics and Automation (ICMA) ◽

10.1109/icma.2018.8484437 ◽

2018 ◽

Cited By ~ 1

Author(s):

Lijun Yu ◽

Weijie Sun ◽

Hui Wang ◽

Qiang Wang ◽

Chaoda Liu

Keyword(s):

Object Detection ◽

Recognition System ◽

Moving Object Detection ◽

Moving Object ◽

Detection And Recognition

Download Full-text

A Novel Idea for Designing a Speech Recognition System Using Computer Vision Object Detection Techniques

Computational Methods and Data Engineering - Advances in Intelligent Systems and Computing ◽

10.1007/978-981-15-7907-3_28 ◽

2020 ◽

pp. 375-381

Author(s):

Sukrobjon Toshpulotov ◽

Sarvar Saidov ◽

Selvanayaki Kolandapalayam Shanmugam ◽

J. Shyamala Devi ◽

K. Ramkumar

Keyword(s):

Computer Vision ◽

Speech Recognition ◽

Object Detection ◽

Recognition System ◽

Speech Recognition System ◽

Detection Techniques

Download Full-text

Ensemble Based Plant Species Recognition System Using Fusion of Hog and Kaze Approach

Communications in Computer and Information Science - Futuristic Trends in Network and Communication Technologies ◽

10.1007/978-981-16-1480-4_48 ◽

2021 ◽

pp. 536-545

Author(s):

Sandeep Rathor

Keyword(s):

Plant Species ◽

Species Recognition ◽

Recognition System

Download Full-text

Object Detection Algorithm Based on Multiheaded Attention

Applied Sciences ◽

10.3390/app9091829 ◽

2019 ◽

Vol 9 (9) ◽

pp. 1829 ◽

Cited By ~ 1

Author(s):

Jie Jiang ◽

Hui Xu ◽

Shichang Zhang ◽

Yujie Fang

Keyword(s):

Object Detection ◽

Linear Interpolation ◽

Detection Algorithm ◽

Attention Mechanism ◽

Visual Object ◽

Single Shot ◽

Object Class ◽

Feature Information ◽

Base Network ◽

Detector Model

This study proposes a multiheaded object detection algorithm referred to as MANet. The main purpose of the study is to integrate feature layers of different scales based on the attention mechanism and to enhance contextual connections. To achieve this, we first replaced the feed-forward base network of the single-shot detector with the ResNet–101 (inspired by the Deconvolutional Single-Shot Detector) and then applied linear interpolation and the attention mechanism. The information of the feature layers at different scales was fused to improve the accuracy of target detection. The primary contributions of this study are the propositions of (a) a fusion attention mechanism, and (b) a multiheaded attention fusion method. Our final MANet detector model effectively unifies the feature information among the feature layers at different scales, thus enabling it to detect objects with different sizes and with higher precision. We used the 512 × 512 input MANet (the backbone is ResNet–101) to obtain a mean accuracy of 82.7% based on the PASCAL visual object class 2007 test. These results demonstrated that our proposed method yielded better accuracy than those provided by the conventional Single-shot detector (SSD) and other advanced detectors.

Download Full-text