Flower End-to-End Detection Based on YOLOv4 Using a Mobile Device

In this paper, a novel flower detection application anchor-based method is proposed, which is combined with an attention mechanism to detect the flowers in a smart garden in AIoT more accurately and fast. While many researchers have paid much attention to the flower classification in existing studies, the issue of flower detection has been largely overlooked. The problem we have outlined deals largely with the study of a new design and application of flower detection. Firstly, a new end-to-end flower detection anchor-based method is inserted into the architecture of the network to make it more precious and fast and the loss function and attention mechanism are introduced into our model to suppress unimportant features. Secondly, our flower detection algorithms can be integrated into the mobile device. It is revealed that our flower detection method is very considerable through a series of investigations carried out. The detection accuracy of our method is similar to that of the state-of-the-art, and the detection speed is faster at the same time. It makes a major contribution to flower detection in computer vision.

Download Full-text

A Fast Orientation Invariant Detector Based on the One-stage Method

MATEC Web of Conferences ◽

10.1051/matecconf/201823204036 ◽

2018 ◽

Vol 232 ◽

pp. 04036

Author(s):

Jun Yin ◽

Huadong Pan ◽

Hui Su ◽

Zhonggeng Liu ◽

Zhirong Peng

Keyword(s):

Object Detection ◽

Loss Function ◽

High Efficiency ◽

Detection Method ◽

State Of The Art ◽

Orientation Angle ◽

Detection Methods ◽

Detection Algorithms ◽

Bounding Boxes ◽

The One

We propose an object detection method that predicts the orientation bounding boxes (OBB) to estimate objects locations, scales and orientations based on YOLO (You Only Look Once), which is one of the top detection algorithms performing well both in accuracy and speed. Horizontal bounding boxes(HBB), which are not robust to orientation variances, are used in the existing object detection methods to detect targets. The proposed orientation invariant YOLO (OIYOLO) detector can effectively deal with the bird’s eye viewpoint images where the orientation angles of the objects are arbitrary. In order to estimate the rotated angle of objects, we design a new angle loss function. Therefore, the training of OIYOLO forces the network to learn the annotated orientation angle of objects, making OIYOLO orientation invariances. The proposed approach that predicts OBB can be applied in other detection frameworks. In additional, to evaluate the proposed OIYOLO detector, we create an UAV-DAHUA datasets that annotated with objects locations, scales and orientation angles accurately. Extensive experiments conducted on UAV-DAHUA and DOTA datasets demonstrate that OIYOLO achieves state-of-the-art detection performance with high efficiency comparing with the baseline YOLO algorithms.

Download Full-text

Examination of Abnormal Behavior Detection Based on Improved YOLOv3

Electronics ◽

10.3390/electronics10020197 ◽

2021 ◽

Vol 10 (2) ◽

pp. 197

Author(s):

Meng-ting Fang ◽

Zhong-ju Chen ◽

Krzysztof Przystupa ◽

Tao Li ◽

Michal Majka ◽

...

Keyword(s):

Detection Method ◽

Abnormal Behavior ◽

Detection Accuracy ◽

Average Precision ◽

Research Results ◽

Behavior Detection ◽

Abnormal Behavior Detection ◽

The Mean ◽

Examination Room ◽

Detection Speed

Examination is a way to select talents, and a perfect invigilation strategy can improve the fairness of the examination. To realize the automatic detection of abnormal behavior in the examination room, the method based on the improved YOLOv3 (The third version of the You Only Look Once algorithm) algorithm is proposed. The YOLOv3 algorithm is improved by using the K-Means algorithm, GIoUloss, focal loss, and Darknet32. In addition, the frame-alternate dual-thread method is used to optimize the detection process. The research results show that the improved YOLOv3 algorithm can improve both the detection accuracy and detection speed. The frame-alternate dual-thread method can greatly increase the detection speed. The mean Average Precision (mAP) of the improved YOLOv3 algorithm on the test set reached 88.53%, and the detection speed reached 42 Frames Per Second (FPS) in the frame-alternate dual-thread detection method. The research results provide a certain reference for automated invigilation.

Download Full-text

Random Forest with Adaptive Local Template for Pedestrian Detection

Mathematical Problems in Engineering ◽

10.1155/2015/767423 ◽

2015 ◽

Vol 2015 ◽

pp. 1-11 ◽

Cited By ~ 2

Author(s):

Tao Xiang ◽

Tao Li ◽

Mao Ye ◽

Zijian Liu

Keyword(s):

Computer Vision ◽

Random Forest ◽

Classification Accuracy ◽

Template Matching ◽

Detection Method ◽

State Of The Art ◽

Pedestrian Detection ◽

Sliding Window ◽

Experimental Results ◽

Training Samples

Pedestrian detection with large intraclass variations is still a challenging task in computer vision. In this paper, we propose a novel pedestrian detection method based on Random Forest. Firstly, we generate a few local templates with different sizes and different locations in positive exemplars. Then, the Random Forest is built whose splitting functions are optimized by maximizing class purity of matching the local templates to the training samples, respectively. To improve the classification accuracy, we adopt a boosting-like algorithm to update the weights of the training samples in a layer-wise fashion. During detection, the trained Random Forest will vote the category when a sliding window is input. Our contributions are the splitting functions based on local template matching with adaptive size and location and iteratively weight updating method. We evaluate the proposed method on 2 well-known challenging datasets: TUD pedestrians and INRIA pedestrians. The experimental results demonstrate that our method achieves state-of-the-art or competitive performance.

Download Full-text

A Hard Example Mining Approach for Concealed Multi-Object Detection of Active Terahertz Image

Applied Sciences ◽

10.3390/app112311241 ◽

2021 ◽

Vol 11 (23) ◽

pp. 11241

Author(s):

Ling Li ◽

Fei Xue ◽

Dong Liang ◽

Xiaofei Chen

Keyword(s):

Computer Vision ◽

Object Detection ◽

State Of The Art ◽

Terahertz Imaging ◽

Public Security ◽

Counter Terrorism ◽

Detection Algorithms ◽

Public Dataset ◽

The One ◽

Objects Detection

Concealed objects detection in terahertz imaging is an urgent need for public security and counter-terrorism. So far, there is no public terahertz imaging dataset for the evaluation of objects detection algorithms. This paper provides a public dataset for evaluating multi-object detection algorithms in active terahertz imaging. Due to high sample similarity and poor imaging quality, object detection on this dataset is much more difficult than on those commonly used public object detection datasets in the computer vision field. Since the traditional hard example mining approach is designed based on the two-stage detector and cannot be directly applied to the one-stage detector, this paper designs an image-based Hard Example Mining (HEM) scheme based on RetinaNet. Several state-of-the-art detectors, including YOLOv3, YOLOv4, FRCN-OHEM, and RetinaNet, are evaluated on this dataset. Experimental results show that the RetinaNet achieves the best mAP and HEM further enhances the performance of the model. The parameters affecting the detection metrics of individual images are summarized and analyzed in the experiments.

Download Full-text

A new joint CTC-attention-based speech recognition model with multi-level multi-head attention

EURASIP Journal on Audio Speech and Music Processing ◽

10.1186/s13636-019-0161-0 ◽

2019 ◽

Vol 2019 (1) ◽

Cited By ~ 2

Author(s):

Chu-Xiong Qin ◽

Wen-Lin Zhang ◽

Dan Qu

Keyword(s):

Speech Recognition ◽

Nonnegative Matrix Factorization ◽

State Of The Art ◽

Nonnegative Matrix ◽

Attention Mechanism ◽

Word Error Rate ◽

Absolute Value ◽

Multi Level ◽

End To End ◽

High Level

Abstract A method called joint connectionist temporal classification (CTC)-attention-based speech recognition has recently received increasing focus and has achieved impressive performance. A hybrid end-to-end architecture that adds an extra CTC loss to the attention-based model could force extra restrictions on alignments. To explore better the end-to-end models, we propose improvements to the feature extraction and attention mechanism. First, we introduce a joint model trained with nonnegative matrix factorization (NMF)-based high-level features. Then, we put forward a hybrid attention mechanism by incorporating multi-head attentions and calculating attention scores over multi-level outputs. Experiments on TIMIT indicate that the new method achieves state-of-the-art performance with our best model. Experiments on WSJ show that our method exhibits a word error rate (WER) that is only 0.2% worse in absolute value than the best referenced method, which is trained on a much larger dataset, and it beats all present end-to-end methods. Further experiments on LibriSpeech show that our method is also comparable to the state-of-the-art end-to-end system in WER.

Download Full-text

Fast Face Detection Based on AdaBoost and Canny Operators

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.716-717.936 ◽

2014 ◽

Vol 716-717 ◽

pp. 936-939

Author(s):

Lin Zhang

Keyword(s):

Face Detection ◽

Detection Method ◽

Face Image ◽

Experimental Results ◽

Detection Accuracy ◽

Human Face ◽

Adaboost Algorithm ◽

Amount Of Information ◽

Detection Speed

Detection speed of traditional face detection method based on AdaBoost algorithm is slow since AdaBoost asks a large number of features. Therefore, to address this shortcoming, we proposed a fast face detection method based on AdaBoost and canny operators in this paper. Firstly, we use canny operators to detect edge of face image which separates the region of the possible human face from image, and then do face detection in the separated region using Modest AdaBoost algorithm (MAB). Before using MAB to achieve face detection, utilizing canny operators to detect edge can make this algorithm effectively filter information, retain useful information, reduce the amount of information and improve detection speed. Experimental results show that the algorithm can obtain higher detection accuracy and detection speed has been significantly improved at the same time.

Download Full-text

A Fast Positioning Technology for Compensation Hole of Automobile Brake Master Cylinder

E3S Web of Conferences ◽

10.1051/e3sconf/202125201018 ◽

2021 ◽

Vol 252 ◽

pp. 01018

Author(s):

Changfu Zhao ◽

Hongchang Ding ◽

Guohua Cao ◽

Han Hou

Keyword(s):

Traditional Method ◽

Detection Method ◽

The Self ◽

Fine Tuning ◽

Process Time ◽

Detection Accuracy ◽

Positioning Accuracy ◽

Master Cylinder ◽

Structural Part ◽

Detection Speed

The compensation hole of the automobile brake master cylinder is an important structural part for adjusting the reservoir and pressure chamber of the brake master cylinder. Its detection accuracy is strictly controlled. However, because the compensation hole is located on the inner wall of the blind hole, the existing detection method cannot meet the testing needs. Therefore, this paper introduces the SSD model into the detection of the compensation hole of the brake master cylinder, and realizes the rapid positioning of the compensation hole by means of network fine-tuning. The compensation hole positioning detection is carried out on the self-developed automobile brake master cylinder compensation hole detector. The entire detection process time is about 5s, and the positioning accuracy is high. We apply the fine-tuning SSD model to the detection of the compensation hole of automobile brake master cylinder, which replaces the traditional method based on human-computer interaction to determine the position of the compensation hole. It has better detection accuracy and faster detection speed, and lays the foundation for the subsequent detection of the size of the compensation hole.

Download Full-text

Optimized loss functions for object detection and application on nighttime vehicle detection

Proceedings of the Institution of Mechanical Engineers Part D Journal of Automobile Engineering ◽

10.1177/09544070211036366 ◽

2021 ◽

pp. 095440702110363

Author(s):

Shang Jiang ◽

Haoran Qin ◽

Bingli Zhang ◽

Jieyu Zheng

Keyword(s):

Object Detection ◽

Loss Function ◽

Mahalanobis Distance ◽

Vehicle Detection ◽

Detection Task ◽

Loss Functions ◽

Detection Accuracy ◽

Localization Accuracy ◽

Detection Speed

The loss function is a crucial factor that affects the detection precision in the object detection task. In this paper, we optimize both two loss functions for classification and localization simultaneously. Firstly, we reconstruct the classification loss function by combining the prediction results of localization, aiming to establish the correlation between localization and classification subnetworks. Compared to the existing studies, in which the correlation is only established among the positive samples and applied to improve the localization accuracy of predicted boxes, this paper utilizes the correlation to define the hard negative samples and then puts emphasis on the classification of them. Thus the whole misclassified rate for negative samples can be reduced. Besides, a novel localization loss named MIoU is proposed by incorporating a Mahalanobis distance between the predicted box and target box, eliminating the gradients inconsistency problem in the DIoU loss, further improving the localization accuracy. Finally, the proposed methods are applied to train the networks for nighttime vehicle detection. Experimental results show that the detection accuracy can be outstandingly improved with our proposed loss functions without hurting the detection speed.

Download Full-text

An Improved Fingerprinting Algorithm for Detection of Video Frame Duplication Forgery

International Journal of Digital Crime and Forensics ◽

10.4018/jdcf.2012070102 ◽

2012 ◽

Vol 4 (3) ◽

pp. 20-32 ◽

Cited By ~ 3

Author(s):

Yongjian Hu ◽

Chang-Tsun Li ◽

Yufei Wang ◽

Bei-bei Liu

Keyword(s):

Computational Efficiency ◽

State Of The Art ◽

Experimental Results ◽

Video Frame ◽

Detection Accuracy ◽

Forgery Detection ◽

Detection Algorithms ◽

Dct Coefficients ◽

Duplication Detection ◽

Frame Duplication

Frame duplication is a common way of digital video forgeries. State-of-the-art approaches of duplication detection usually suffer from heavy computational load. In this paper, the authors propose a new algorithm to detect duplicated frames based on video sub-sequence fingerprints. The fingerprints employed are extracted from the DCT coefficients of the temporally informative representative images (TIRIs) of the sub-sequences. Compared with other similar algorithms, this study focuses on improving fingerprints representing video sub-sequences and introducing a simple metric for the matching of video sub-sequences. Experimental results show that the proposed algorithm overall outperforms three related duplication forgery detection algorithms in terms of computational efficiency, detection accuracy and robustness against common video operations like compression and brightness change.

Download Full-text

Automatic Pixel-Level Pavement Crack Recognition Using a Deep Feature Aggregation Segmentation Network with a scSE Attention Mechanism Module

Sensors ◽

10.3390/s21092902 ◽

2021 ◽

Vol 21 (9) ◽

pp. 2902

Author(s):

Wenting Qiao ◽

Qiangwei Liu ◽

Xiaoguang Wu ◽

Biao Ma ◽

Gang Li

Keyword(s):

Deep Learning ◽

Crack Detection ◽

Attention Mechanism ◽

Model Parameters ◽

Detection Accuracy ◽

Safe Driving ◽

Deep Feature ◽

Feature Aggregation ◽

Pavement Crack Detection ◽

Detection Speed

Pavement crack detection is essential for safe driving. The traditional manual crack detection method is highly subjective and time-consuming. Hence, an automatic pavement crack detection system is needed to facilitate this progress. However, this is still a challenging task due to the complex topology and large noise interference of crack images. Recently, although deep learning-based technologies have achieved breakthrough progress in crack detection, there are still some challenges, such as large parameters and low detection efficiency. Besides, most deep learning-based crack detection algorithms find it difficult to establish good balance between detection accuracy and detection speed. Inspired by the latest deep learning technology in the field of image processing, this paper proposes a novel crack detection algorithm based on the deep feature aggregation network with the spatial-channel squeeze & excitation (scSE) attention mechanism module, which calls CrackDFANet. Firstly, we cut the collected crack images into 512 × 512 pixel image blocks to establish a crack dataset. Then through iterative optimization on the training and validation sets, we obtained a crack detection model with good robustness. Finally, the CrackDFANet model verified on a total of 3516 images in five datasets with different sizes and containing different noise interferences. Experimental results show that the trained CrackDFANet has strong anti-interference ability, and has better robustness and generalization ability under the interference of light interference, parking line, water stains, plant disturbance, oil stains, and shadow conditions. Furthermore, the CrackDFANet is found to be better than other state-of-the-art algorithms with more accurate detection effect and faster detection speed. Meanwhile, our algorithm model parameters and error rates are significantly reduced.

Download Full-text