Multiple Pedestrians and Vehicles Tracking in Aerial Imagery Using a Convolutional Neural Network

Seyed Majid Azimi; Maximilian Kraus; Reza Bahmanyar; Peter Reinartz

doi:10.3390/rs13101953

Multiple Pedestrians and Vehicles Tracking in Aerial Imagery Using a Convolutional Neural Network

Remote Sensing ◽

10.3390/rs13101953 ◽

2021 ◽

Vol 13 (10) ◽

pp. 1953

Author(s):

Seyed Majid Azimi ◽

Maximilian Kraus ◽

Reza Bahmanyar ◽

Peter Reinartz

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Object Tracking ◽

Short Term Memory ◽

Aerial Imagery ◽

Future Research ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

In this paper, we address various challenges in multi-pedestrian and vehicle tracking in high-resolution aerial imagery by intensive evaluation of a number of traditional and Deep Learning based Single- and Multi-Object Tracking methods. We also describe our proposed Deep Learning based Multi-Object Tracking method AerialMPTNet that fuses appearance, temporal, and graphical information using a Siamese Neural Network, a Long Short-Term Memory, and a Graph Convolutional Neural Network module for more accurate and stable tracking. Moreover, we investigate the influence of the Squeeze-and-Excitation layers and Online Hard Example Mining on the performance of AerialMPTNet. To the best of our knowledge, we are the first to use these two for regression-based Multi-Object Tracking. Additionally, we studied and compared the L1 and Huber loss functions. In our experiments, we extensively evaluate AerialMPTNet on three aerial Multi-Object Tracking datasets, namely AerialMPT and KIT AIS pedestrian and vehicle datasets. Qualitative and quantitative results show that AerialMPTNet outperforms all previous methods for the pedestrian datasets and achieves competitive results for the vehicle dataset. In addition, Long Short-Term Memory and Graph Convolutional Neural Network modules enhance the tracking performance. Moreover, using Squeeze-and-Excitation and Online Hard Example Mining significantly helps for some cases while degrades the results for other cases. In addition, according to the results, L1 yields better results with respect to Huber loss for most of the scenarios. The presented results provide a deep insight into challenges and opportunities of the aerial Multi-Object Tracking domain, paving the way for future research.

Get full-text (via PubEx)

Deep Learning with Convolutional Neural Network and Long Short-Term Memory for Phishing Detection

2019 13th International Conference on Software, Knowledge, Information Management and Applications (SKIMA) ◽

10.1109/skima47702.2019.8982427 ◽

2019 ◽

Author(s):

M. A. Adebowale ◽

K. T. Lwin ◽

M. A. Hossain

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Phishing Detection

Get full-text (via PubEx)

A Spectral-Spatial Cascaded 3D Convolutional Neural Network with a Convolutional Long Short-Term Memory Network for Hyperspectral Image Classification

Remote Sensing ◽

10.3390/rs11202363 ◽

2019 ◽

Vol 11 (20) ◽

pp. 2363 ◽

Cited By ~ 2

Author(s):

Wenchao Qi ◽

Xia Zhang ◽

Nan Wang ◽

Mao Zhang ◽

Yi Cen

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Hyperspectral Image ◽

Short Term ◽

Dynamic Learning ◽

Learning Methods ◽

Term Memory ◽

Long Short Term Memory

Deep learning methods used for hyperspectral image (HSI) classification often achieve greater accuracy than traditional algorithms but require large numbers of training epochs. To simplify model structures and reduce their training epochs, an end-to-end deep learning framework incorporating a spectral-spatial cascaded 3D convolutional neural network (CNN) with a convolutional long short-term memory (CLSTM) network, called SSCC, is proposed herein for HSI classification. The SSCC framework employs cascaded 3D CNN to learn the spectral-spatial features of HSIs and uses the CLSTM network to extract sequence features. Residual connections are used in SSCC to accelerate model convergence, with the outputs of previous convolutional layers concatenated as inputs for subsequent layers. Moreover, the data augmentation, parametric rectified linear unit, dynamic learning rate, batch normalization, and regularization (including dropout and L2) methods are used to increase classification accuracy and prevent overfitting. These attributes allow the SSCC framework to achieve good performance for HSI classification within 20 epochs. Three well-known datasets including Indiana Pines, University of Pavia, and Pavia Center were employed to evaluate the classification performance of the proposed algorithm. The GF-5 dataset of Anxin County, obtained from China’s recently launched spaceborne Advanced Hyperspectral Imager, was also used for classification experiments. The experimental results demonstrate that the proposed SSCC framework achieves state-of-the-art performance with better training efficiency than other deep learning methods.

Get full-text (via PubEx)

Hybrid convolutional neural network (CNN) and long-short term memory (LSTM) based deep learning model for detecting shilling attack in the social-aware network

Journal of Ambient Intelligence and Humanized Computing ◽

10.1007/s12652-020-02164-y ◽

2020 ◽

Cited By ~ 2

Author(s):

K. Vivekanandan ◽

N. Praveena

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Shilling Attack ◽

The Social ◽

Long Short Term Memory ◽

Deep Learning Model

Get full-text (via PubEx)

Deep Learning Enhanced Solar Energy Forecasting with AI-Driven IoT

Wireless Communications and Mobile Computing ◽

10.1155/2021/9249387 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Hangxia Zhou ◽

Qian Liu ◽

Ke Yan ◽

Yang Du

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Energy Generation ◽

Attention Mechanism ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Short-term photovoltaic (PV) energy generation forecasting models are important, stabilizing the power integration between the PV and the smart grid for artificial intelligence- (AI-) driven internet of things (IoT) modeling of smart cities. With the recent development of AI and IoT technologies, it is possible for deep learning techniques to achieve more accurate energy generation forecasting results for the PV systems. Difficulties exist for the traditional PV energy generation forecasting method considering external feature variables, such as the seasonality. In this study, we propose a hybrid deep learning method that combines the clustering techniques, convolutional neural network (CNN), long short-term memory (LSTM), and attention mechanism with the wireless sensor network to overcome the existing difficulties of the PV energy generation forecasting problem. The overall proposed method is divided into three stages, namely, clustering, training, and forecasting. In the clustering stage, correlation analysis and self-organizing mapping are employed to select the highest relevant factors in historical data. In the training stage, a convolutional neural network, long short-term memory neural network, and attention mechanism are combined to construct a hybrid deep learning model to perform the forecasting task. In the testing stage, the most appropriate training model is selected based on the month of the testing data. The experimental results showed significantly higher prediction accuracy rates for all time intervals compared to existing methods, including traditional artificial neural networks, long short-term memory neural networks, and an algorithm combining long short-term memory neural network and attention mechanism.

Get full-text (via PubEx)

Comparative analysis of short-term demand predicting models using ARIMA and deep learning

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v11i4.pp3319-3328 ◽

2021 ◽

Vol 11 (4) ◽

pp. 3319

Author(s):

Halima Bousqaoui ◽

Ilham Slimani ◽

Said Achchab

Keyword(s):

Neural Network ◽

Deep Learning ◽

Comparative Analysis ◽

Convolutional Neural Network ◽

Demand Uncertainty ◽

Short Term Memory ◽

Real Life ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

The forecasting consists of taking historical data as inputs then using them to predict future observations, thus determining future trends. Demand prediction is a crucial component in the supply chain’s process that allows each member to enhance its performance and its profit. Nevertheless, because of demand uncertainty supply chains usually suffer from many problems such as the bullwhip effect. As a solution to those logistics issues, this paper presents a comparative analysis of four time series demand forecasting models; namely, the autoregressive integrated moving Average (ARIMA) a statistical model, the multi-layer perceptron (MLP) a feedforward neural network, the long short-term memory model (LSTM) a recurrent neural network and the convolutional neural network (CNN or ConvNet) a deep learning model. The experimentations are carried out using a real-life dataset provided by a supermarket in Morocco. The results clearly show that the convolutional neural network gives slightly better forecasting results than the Long short-term memory network.

Get full-text (via PubEx)

Estimation of municipal solid waste amount based on one-dimension convolutional neural network and long short-term memory with attention mechanism model: A case study of Shanghai

The Science of The Total Environment ◽

10.1016/j.scitotenv.2021.148088 ◽

2021 ◽

Vol 791 ◽

pp. 148088

Author(s):

Kunsen Lin ◽

Youcai Zhao ◽

Lu Tian ◽

Chunlong Zhao ◽

Meilan Zhang ◽

...

Keyword(s):

Neural Network ◽

Municipal Solid Waste ◽

Convolutional Neural Network ◽

Short Term Memory ◽

One Dimension ◽

Short Term ◽

Term Memory ◽

Mechanism Model ◽

Long Short Term Memory

Get full-text (via PubEx)

Sentiment Analysis on Twitter Data by Using Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM)

Wireless Personal Communications ◽

10.1007/s11277-021-08580-3 ◽

2021 ◽

Author(s):

Usha Devi Gandhi ◽

Priyan Malarvizhi Kumar ◽

Gokulnath Chandra Babu ◽

Gayathri Karthick

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Sentiment Analysis ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Twitter Data ◽

Long Short Term Memory

Get full-text (via PubEx)

1D Convolutional Neural Network with Long Short-Term Memory for Human Activity Recognition

10.1109/iicaiet51634.2021.9573979 ◽

2021 ◽

Author(s):

Jia Xin Goh ◽

Kian Ming Lim ◽

Chin Poo Lee

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Activity Recognition ◽

Human Activity ◽

Short Term Memory ◽

Human Activity Recognition ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Get full-text (via PubEx)

Gait-Based Human Identification by Combining Shallow Convolutional Neural Network-Stacked Long Short-Term Memory and Deep Convolutional Neural Network

IEEE Access ◽

10.1109/access.2018.2876890 ◽

2018 ◽

Vol 6 ◽

pp. 63164-63186 ◽

Cited By ~ 12

Author(s):

Ganbayar Batchuluun ◽

Hyo Sik Yoon ◽

Jin Kyu Kang ◽

Kang Ryoung Park

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Human Identification ◽

Deep Convolutional Neural Network ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Get full-text (via PubEx)

Pemanfaatan Asynchronous Advantage Actor-Critic Dalam Pembuatan AI Game Bot Pada Game Arcade

Journal of Intelligent System and Computation ◽

10.52985/insyst.v1i2.82 ◽

2019 ◽

Vol 1 (2) ◽

pp. 74-84

Author(s):

Evan Kusuma Susanto ◽

Yosi Kristian

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Reinforcement Learning ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Trial And Error ◽

Short Term ◽

Term Memory ◽

Memory Network ◽

Long Short Term Memory

Asynchronous Advantage Actor-Critic (A3C) adalah sebuah algoritma deep reinforcement learning yang dikembangkan oleh Google DeepMind. Algoritma ini dapat digunakan untuk menciptakan sebuah arsitektur artificial intelligence yang dapat menguasai berbagai jenis game yang berbeda melalui trial and error dengan mempelajari tempilan layar game dan skor yang diperoleh dari hasil tindakannya tanpa campur tangan manusia. Sebuah network A3C terdiri dari Convolutional Neural Network (CNN) di bagian depan, Long Short-Term Memory Network (LSTM) di tengah, dan sebuah Actor-Critic network di bagian belakang. CNN berguna sebagai perangkum dari citra output layar dengan mengekstrak fitur-fitur yang penting yang terdapat pada layar. LSTM berguna sebagai pengingat keadaan game sebelumnya. Actor-Critic Network berguna untuk menentukan tindakan terbaik untuk dilakukan ketika dihadapkan dengan suatu kondisi tertentu. Dari hasil percobaan yang dilakukan, metode ini cukup efektif dan dapat mengalahkan pemain pemula dalam memainkan 5 game yang digunakan sebagai bahan uji coba.

Get full-text (via PubEx)