Vehicle Detection in Aerial Images Based on Hyper Feature Map in Deep Convolutional Network

Vehicles in aerial images are generally with small sizes and unbalanced number of samples, which leads to the poor performances of the existing vehicle detection algorithms. Therefore, an oriented vehicle detection framework based on improved Faster RCNN is proposed for aerial images. First of all, we propose an oversampling and stitching data augmentation method to decrease the negative effect of category imbalance in the training dataset and construct a new dataset with balanced number of samples. Then considering that the pooling operation may loss the discriminative ability of features for small objects, we propose to amplify the feature map so that detailed information hidden in the last feature map can be enriched. Finally, we design a joint training loss function including center loss for both horizontal and oriented bounding boxes, and reduce the impact of small inter-class diversity on vehicle detection. The proposed framework is evaluated on the VEDAI dataset that consists of 9 vehicle categories. The experimental results show that the proposed framework outperforms previous approaches with a mean average precision of 60.4% and 60.1% in detecting horizontal and oriented bounding boxes respectively, which is about 8% better than Faster RCNN.

Download Full-text

Vehicle Detection in Aerial Images Based on 3D Depth Maps and Deep Neural Networks

IEEE Access ◽

10.1109/access.2021.3049741 ◽

2021 ◽

pp. 1-1

Author(s):

Saleh Javadi ◽

Mattias Dahl ◽

Mats I. Pettersson

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Vehicle Detection ◽

Aerial Images ◽

Depth Maps

Download Full-text

A Study on Vehicle Detection through Aerial Images: Various Challenges, Issues and Applications

2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS) ◽

10.1109/icccis51004.2021.9397116 ◽

2021 ◽

Author(s):

Sandeep Kumar ◽

E. G. Rajan ◽

Shilpa Rani

Keyword(s):

Vehicle Detection ◽

Aerial Images

Download Full-text

Vehicle Detection in UAV Aerial Images Based on Improved YOLOv3

2020 IEEE International Conference on Networking, Sensing and Control (ICNSC) ◽

10.1109/icnsc48988.2020.9238059 ◽

2020 ◽

Author(s):

Shengli Zhang ◽

Lin Chai ◽

Lizuo Jin

Keyword(s):

Vehicle Detection ◽

Aerial Images

Download Full-text

E-DiCoNet: Extreme learning machine based classifier for diagnosis of COVID-19 using deep convolutional network

Journal of Ambient Intelligence and Humanized Computing ◽

10.1007/s12652-020-02688-3 ◽

2021 ◽

Author(s):

R. Murugan ◽

Tripti Goel

Keyword(s):

Extreme Learning Machine ◽

Convolutional Network ◽

Deep Convolutional Network ◽

Learning Machine

Download Full-text

Image Classification Based On Deep Convolutional Network And Gaussian Aggregate Encoding

2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI) ◽

10.1109/ictai50040.2020.00089 ◽

2020 ◽

Author(s):

Fengge Wang ◽

Xiaolin Tian ◽

Yang Zhang ◽

Nan Jia ◽

Tiantian Lu

Keyword(s):

Image Classification ◽

Convolutional Network ◽

Deep Convolutional Network

Download Full-text

Facial Expression Recognition Based on Multi-Features Cooperative Deep Convolutional Network

Applied Sciences ◽

10.3390/app11041428 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1428

Author(s):

Haopeng Wu ◽

Zhiying Lu ◽

Jianfeng Zhang ◽

Xin Li ◽

Mingyue Zhao ◽

...

Keyword(s):

Facial Expression ◽

Facial Expressions ◽

Facial Expression Recognition ◽

Video Data ◽

Expression Recognition ◽

Convolutional Network ◽

Facial Movements ◽

The Face ◽

Deep Convolutional Network ◽

Selection Of

This paper addresses the problem of Facial Expression Recognition (FER), focusing on unobvious facial movements. Traditional methods often cause overfitting problems or incomplete information due to insufficient data and manual selection of features. Instead, our proposed network, which is called the Multi-features Cooperative Deep Convolutional Network (MC-DCN), maintains focus on the overall feature of the face and the trend of key parts. The processing of video data is the first stage. The method of ensemble of regression trees (ERT) is used to obtain the overall contour of the face. Then, the attention model is used to pick up the parts of face that are more susceptible to expressions. Under the combined effect of these two methods, the image which can be called a local feature map is obtained. After that, the video data are sent to MC-DCN, containing parallel sub-networks. While the overall spatiotemporal characteristics of facial expressions are obtained through the sequence of images, the selection of keys parts can better learn the changes in facial expressions brought about by subtle facial movements. By combining local features and global features, the proposed method can acquire more information, leading to better performance. The experimental results show that MC-DCN can achieve recognition rates of 95%, 78.6% and 78.3% on the three datasets SAVEE, MMI, and edited GEMEP, respectively.

Download Full-text