Dim Target Detection Method Based on Deep Learning in Complex Traffic Environment

Abstract Although the current vehicle detection and recognition framework based on deep learning has its own characteristics and advantages, it is difficult to effectively combine multi-scale and multi category vehicle features, and there is still room for improvement in vehicle detection and recognition performance. Based on this, an improved fast R-CNN convolutional neural network is proposed to detect dim targets in complex traffic environment. The deep learning model of fast R-CNN convolutional neural network is introduced into the image recognition of complex traffic environment, and a structure optimization method is proposed, which replaces vgg16 in fast RCNN with RESNET to make it suitable for small target recognition in complex background. Max pooling is the down sampling method, and then feature pyramid network is introduced into RPN to generate target candidate box to optimize the structure of convolutional neural network. After training with 1497 images, the complex traffic environment images are identified and tested.

Download Full-text

Hyperspectral Image Classification Based on Multi-Scale Residual Network with Attention Mechanism

Remote Sensing ◽

10.3390/rs13030335 ◽

2021 ◽

Vol 13 (3) ◽

pp. 335

Author(s):

Yuhao Qing ◽

Wenyi Liu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Image Classification ◽

Classification Accuracy ◽

Hyperspectral Image ◽

Principal Component ◽

Hyperspectral Image Classification ◽

Deep Network ◽

Multi Scale

In recent years, image classification on hyperspectral imagery utilizing deep learning algorithms has attained good results. Thus, spurred by that finding and to further improve the deep learning classification accuracy, we propose a multi-scale residual convolutional neural network model fused with an efficient channel attention network (MRA-NET) that is appropriate for hyperspectral image classification. The suggested technique comprises a multi-staged architecture, where initially the spectral information of the hyperspectral image is reduced into a two-dimensional tensor, utilizing a principal component analysis (PCA) scheme. Then, the constructed low-dimensional image is input to our proposed ECA-NET deep network, which exploits the advantages of its core components, i.e., multi-scale residual structure and attention mechanisms. We evaluate the performance of the proposed MRA-NET on three public available hyperspectral datasets and demonstrate that, overall, the classification accuracy of our method is 99.82 %, 99.81%, and 99.37, respectively, which is higher compared to the corresponding accuracy of current networks such as 3D convolutional neural network (CNN), three-dimensional residual convolution structure (RES-3D-CNN), and space–spectrum joint deep network (SSRN).

Download Full-text

CMS R-CNN: An Efficient Cascade Multi-Scale Region-based Convolutional Neural Network for Accurate 2D Small Vehicle Detection

2019 Chinese Automation Congress (CAC) ◽

10.1109/cac48633.2019.8997429 ◽

2019 ◽

Author(s):

Ziyu Li ◽

Chao Wang ◽

Qiang Wang ◽

Wankou Yang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Vehicle Detection ◽

Multi Scale

Download Full-text

Video Vehicle Detection and Recognition Based on MapReduce and Convolutional Neural Network

Lecture Notes in Computer Science - Advances in Swarm Intelligence ◽

10.1007/978-3-319-93818-9_53 ◽

2018 ◽

pp. 552-562 ◽

Cited By ~ 2

Author(s):

Mingsong Chen ◽

Weiguang Wang ◽

Shi Dong ◽

Xinling Zhou

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Vehicle Detection ◽

Detection And Recognition

Download Full-text

Deep Learning With TensorFlow: A Review

Journal of Educational and Behavioral Statistics ◽

10.3102/1076998619872761 ◽

2019 ◽

Vol 45 (2) ◽

pp. 227-248 ◽

Cited By ~ 4

Author(s):

Bo Pang ◽

Erik Nijkamp ◽

Ying Nian Wu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Network Models ◽

Optimization Method ◽

Stochastic Gradient Descent ◽

Processing Unit ◽

Neural Network Models ◽

Core Concepts ◽

High Level

This review covers the core concepts and design decisions of TensorFlow. TensorFlow, originally created by researchers at Google, is the most popular one among the plethora of deep learning libraries. In the field of deep learning, neural networks have achieved tremendous success and gained wide popularity in various areas. This family of models also has tremendous potential to promote data analysis and modeling for various problems in educational and behavioral sciences given its flexibility and scalability. We give the reader an overview of the basics of neural network models such as the multilayer perceptron, the convolutional neural network, and stochastic gradient descent, the most commonly used optimization method for neural network models. However, the implementation of these models and optimization algorithms is time-consuming and error-prone. Fortunately, TensorFlow greatly eases and accelerates the research and application of neural network models. We review several core concepts of TensorFlow such as graph construction functions, graph execution tools, and TensorFlow’s visualization tool, TensorBoard. Then, we apply these concepts to build and train a convolutional neural network model to classify handwritten digits. This review is concluded by a comparison of low- and high-level application programming interfaces and a discussion of graphical processing unit support, distributed training, and probabilistic modeling with TensorFlow Probability library.

Download Full-text

Deep Convolutional Neural Network Based Traffic Vehicle Detection and Recognition

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - IoT as a Service ◽

10.1007/978-3-030-44751-9_36 ◽

2020 ◽

pp. 427-438

Author(s):

Yukun Rao ◽

Guanwen Zhang ◽

Wei Zhou ◽

Changhao Wang ◽

Yu Lv

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Vehicle Detection ◽

Deep Convolutional Neural Network ◽

Detection And Recognition

Download Full-text

Bearing Fault Diagnosis Based on Shallow Multi-Scale Convolutional Neural Network with Attention

Energies ◽

10.3390/en12203937 ◽

2019 ◽

Vol 12 (20) ◽

pp. 3937 ◽

Cited By ~ 4

Author(s):

Tengda Huang ◽

Sheng Fu ◽

Haonan Feng ◽

Jiafeng Kuang

Keyword(s):

Neural Network ◽

Deep Learning ◽

Fault Diagnosis ◽

Convolutional Neural Network ◽

Time Domain ◽

Attention Mechanism ◽

Intelligent Fault Diagnosis ◽

Bearing Fault ◽

Bearing Fault Diagnosis ◽

Multi Scale

Recently, deep learning technology was successfully applied to mechanical fault diagnosis. The convolutional neural network (CNN), as a prevalent deep learning model, occupies a place in intelligent fault diagnosis, which reduces the need for human feature extraction and prior knowledge, thereby achieving an end-to-end intelligent fault diagnosis model. However, the data for mechanical fault diagnosis in practical application are limited, the CNN model is too deep and too complex, making it prone to overfitting, and a model with too simple a structure and shallow layers cannot fully learn the effective features of the data. Convolutional filters with fixed window sizes are widely used in existing CNN models, which cannot flexibly select variable pivotal features. The model may be interfered with by redundant information in feature maps during training. Therefore, in this paper, a novel shallow multi-scale convolutional neural network with attention is proposed for bearing fault diagnosis. The shallow multi-scale convolutional neural network structure can fully learn the feature information of input data without overfitting. For the first time, a feature attention mechanism is developed for fault diagnosis to adaptively select features for classification more effectively, where the pivotal feature was emphasized, and the redundant feature was weakened through an attention mechanism. The time frequency representations as the input of the model were obtained from the vibration time domain signals, which contain the complete time domain and frequency domain information of the vibration signals. Compared with the current popular diagnostic methods, the results show that the proposed diagnostic method has fairly high accuracy, and its performance is superior to the existing methods. The average recognition accuracy was 99.86%, and the weak recognition rate of I-07 and I-14 labels was improved.

Download Full-text

Rolling Bearing Fault Diagnosis Algorithm Based on Overlapping Group Sparse Model-Deep Complex Convolutional Neural Network

10.21203/rs.3.rs-888210/v1 ◽

2021 ◽

Author(s):

FENGPING AN ◽

Jianrong Wang

Keyword(s):

Neural Network ◽

Deep Learning ◽

Fault Diagnosis ◽

Convolutional Neural Network ◽

Rolling Bearing ◽

Operating Conditions ◽

Sparse Model ◽

Bearing Fault ◽

Bearing Fault Diagnosis ◽

Multi Scale

Abstract As the key component of a mechanical system, rolling bearings will cause paralysis of the entire mechanical system once they fail. In recent years, considering the high generalization ability and nonlinear modeling ability of deep learning, a rolling bearing fault diagnosis method based on deep learning has been formed, and good results have been achieved. However, because this kind of method is still in the initial development stage, its main problems are as follows. First, it is difficult to extract the composite fault signal feature of rolling bearing. Second, the existing deep learning rolling bearing fault diagnosis methods cannot well consider the problem of multi-scale information of rolling bearing signals. Therefore, this paper first proposes the overlapping group sparse model. It constructs weight coefficients by analyzing the salient features of the signal. It uses convex optimization techniques to solve the sparse optimization model, and applies the method to the feature extraction of rolling bearing composite faults. For the problem of multi-scale feature information extraction of rolling bearing composite fault signals, this paper proposes a new deep complex convolutional neural network model. This model fully considers the multi-scale information of rolling bearing signals. The complex information in this model not only contains rich representation ability, but also can extract more scale information. Finally, the classifier of this model is used to identify rolling bearing faults. Based on this, this paper proposes a new rolling bearing fault diagnosis algorithm based on overlapping group sparse model-deep complex convolutional neural network. The experimental results show that the method proposed in this paper can not only effectively identify rolling bearing faults under constant operating conditions, but also accurately identify rolling bearing fault signals under changing operating conditions. Additionally, the classification accuracy of the method proposed in this paper is greatly improved compared with traditional machine learning methods. It also has certain advantages over other deep learning methods.

Download Full-text

Multi-scale 3D-convolutional neural network for hyperspectral image classification

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v25.i1.pp307-316 ◽

2022 ◽

Vol 25 (1) ◽

pp. 307

Author(s):

Murali Kanthi ◽

Thogarcheti Hitendra Sarma ◽

Chigarapalle Shoba Bindu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Classification Accuracy ◽

Hyperspectral Image ◽

State Of The Art ◽

Spatial Dimension ◽

Multi Scale ◽

Proposed Model ◽

Spectral Channels

Deep Learning methods are state-of-the-art approaches for pixel-based hyperspectral images (HSI) classification. High classification accuracy has been achieved by extracting deep features from both spatial-spectral channels. However, the efficiency of such spatial-spectral approaches depends on the spatial dimension of each patch and there is no theoretically valid approach to find the optimum spatial dimension to be considered. It is more valid to extract spatial features by considering varying neighborhood scales in spatial dimensions. In this regard, this article proposes a deep convolutional neural network (CNN) model wherein three different multi-scale spatial-spectral patches are used to extract the features in both the spatial and spectral channels. In order to extract these potential features, the proposed deep learning architecture takes three patches various scales in spatial dimension. 3D convolution is performed on each selected patch and the process runs through entire image. The proposed is named as multi-scale three-dimensional convolutional neural network (MS-3DCNN). The efficiency of the proposed model is being verified through the experimental studies on three publicly available benchmark datasets including Pavia University, Indian Pines, and Salinas. It is empirically proved that the classification accuracy of the proposed model is improved when compared with the remaining state-of-the-art methods.

Download Full-text

A 3D multiscale view convolutional neural network with attention for mental disease diagnosis on MRI images

Mathematical Biosciences and Engineering ◽

10.3934/mbe.2021347 ◽

2021 ◽

Vol 18 (5) ◽

pp. 6978-3994

Author(s):

Zijian Wang ◽

◽

Yaqin Zhu ◽

Haibo Shi ◽

Yanting Zhang ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Mental Disease ◽

Computer Assisted ◽

Multi Scale ◽

Mental Diseases ◽

Assisted Diagnosis ◽

3D Cnn

<abstract> <p>Computer Assisted Diagnosis (CAD) based on brain Magnetic Resonance Imaging (MRI) is a popular research field for the computer science and medical engineering. Traditional machine learning and deep learning methods were employed in the classification of brain MRI images in the previous studies. However, the current algorithms rarely take into consideration the influence of multi-scale brain connectivity disorders on some mental diseases. To improve this defect, a deep learning structure was proposed based on MRI images, which was designed to consider the brain's connections at different sizes and the attention of connections. In this work, a Multiscale View (MV) module was proposed, which was designed to detect multi-scale brain network disorders. On the basis of the MV module, the path attention module was also proposed to simulate the attention selection of the parallel paths in the MV module. Based on the two modules, we proposed a 3D Multiscale View Convolutional Neural Network with Attention (3D MVA-CNN) for classification of MRI images for mental disease. The proposed method outperformed the previous 3D CNN structures in the structural MRI data of ADHD-200 and the functional MRI data of schizophrenia. Finally, we also proposed a preliminary framework for clinical application using 3D CNN, and discussed its limitations on data accessing and reliability. This work promoted the assisted diagnosis of mental diseases based on deep learning and provided a novel 3D CNN method based on MRI data.</p> </abstract>

Download Full-text

Improved stereo matching algorithm based on multi-scale fusion

Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University ◽

10.1051/jnwpu/20213940876 ◽

2021 ◽

Vol 39 (4) ◽

pp. 876-882

Author(s):

Xing Chen ◽

Wenhai Zhang ◽

Yu Hou ◽

Lin Yang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Stereo Matching ◽

Dynamic Programming Algorithm ◽

Good Effect ◽

Disparity Map ◽

Data Set ◽

Matching Algorithm ◽

Multi Scale ◽

Feature Pyramid

Aiming at the low matching accuracy of local stereo matching algorithm in weak texture or discontinuous disparity areas, a stereo matching algorithm combining multi-scale fusion of convolutional neural network (CNN) and feature pyramid structure (FPN) is proposed. The feature pyramid is applied on the basis of the convolutional neural network to realize the multi-scale feature extraction and fusion of the image, which improves the matching similarity of the image blocks. The guide graph filter is used to quickly and effectively complete the cost aggregation. The disparity selection stage adapts the improvement dynamic programming algorithm to obtain the initial disparity map. The initial disparity map is refined so as to obtain the final disparity map. The algorithm is trained and tested on the image provided by Middlebury data set, and the result shows that the disparity map obtained by the algorithm has good effect.

Download Full-text