Development and evaluation of a deep learning model for real-time ground vehicle semantic segmentation from UAV-based thermal infrared imagery

AbstractUrban area mapping is an important application of remote sensing which aims at both estimation and change in land cover under the urban area. A major challenge being faced while analyzing Synthetic Aperture Radar (SAR) based remote sensing data is that there is a lot of similarity between highly vegetated urban areas and oriented urban targets with that of actual vegetation. This similarity between some urban areas and vegetation leads to misclassification of the urban area into forest cover. The present work is a precursor study for the dual-frequency L and S-band NASA-ISRO Synthetic Aperture Radar (NISAR) mission and aims at minimizing the misclassification of such highly vegetated and oriented urban targets into vegetation class with the help of deep learning. In this study, three machine learning algorithms Random Forest (RF), K-Nearest Neighbour (KNN), and Support Vector Machine (SVM) have been implemented along with a deep learning model DeepLabv3+ for semantic segmentation of Polarimetric SAR (PolSAR) data. It is a general perception that a large dataset is required for the successful implementation of any deep learning model but in the field of SAR based remote sensing, a major issue is the unavailability of a large benchmark labeled dataset for the implementation of deep learning algorithms from scratch. In current work, it has been shown that a pre-trained deep learning model DeepLabv3+ outperforms the machine learning algorithms for land use and land cover (LULC) classification task even with a small dataset using transfer learning. The highest pixel accuracy of 87.78% and overall pixel accuracy of 85.65% have been achieved with DeepLabv3+ and Random Forest performs best among the machine learning algorithms with overall pixel accuracy of 77.91% while SVM and KNN trail with an overall accuracy of 77.01% and 76.47% respectively. The highest precision of 0.9228 is recorded for the urban class for semantic segmentation task with DeepLabv3+ while machine learning algorithms SVM and RF gave comparable results with a precision of 0.8977 and 0.8958 respectively.

Download Full-text

Application of YOLO Deep Learning Model for Real Time Abandoned Baggage Detection

2018 IEEE 7th Global Conference on Consumer Electronics (GCCE) ◽

10.1109/gcce.2018.8574819 ◽

2018 ◽

Cited By ~ 1

Author(s):

Tossaporn Santad ◽

Piyarat Silapasupphakornwong ◽

Worawat Choensawat ◽

Kingkarn Sookhanaphibarn

Keyword(s):

Deep Learning ◽

Real Time ◽

Learning Model ◽

Deep Learning Model

Download Full-text

Detecting Damage Building Using Real-Time Crowdsourced Images And Transfer Learning

10.21203/rs.3.rs-964756/v1 ◽

2021 ◽

Author(s):

Gaurav Chachra ◽

Qingkai Kong ◽

Jim Huang ◽

Srujay Korlakunta ◽

Jennifer Grannen ◽

...

Keyword(s):

Social Media ◽

Deep Learning ◽

Real Time ◽

Transfer Learning ◽

Learning Model ◽

Research Community ◽

Rescue Work ◽

The Public ◽

Social Media Platforms ◽

Deep Learning Model

Abstract After significant earthquakes, we can see images posted on social media platforms by individuals and media agencies owing to the mass usage of smartphones these days. These images can be utilized to provide information about the shaking damage in the earthquake region both to the public and research community, and potentially to guide rescue work. This paper presents an automated way to extract the damaged building images after earthquakes from social media platforms such as Twitter and thus identify the particular user posts containing such images. Using transfer learning and ~6500 manually labelled images, we trained a deep learning model to recognize images with damaged buildings in the scene. The trained model achieved good performance when tested on newly acquired images of earthquakes at different locations and ran in near real-time on Twitter feed after the 2020 M7.0 earthquake in Turkey. Furthermore, to better understand how the model makes decisions, we also implemented the Grad-CAM method to visualize the important locations on the images that facilitate the decision.

Download Full-text

Insights into deep learning for earthquake magnitude and location estimation

10.5194/egusphere-egu21-4718 ◽

2021 ◽

Author(s):

Jannes Münchmeyer ◽

Dino Bindi ◽

Ulf Leser ◽

Frederik Tilmann

Keyword(s):

Deep Learning ◽

Real Time ◽

Early Warning ◽

Source Parameters ◽

Spectral Composition ◽

Learning Model ◽

Location Estimation ◽

Training Set ◽

Earthquake Source Parameters ◽

Deep Learning Model

The estimation of earthquake source parameters, in particular magnitude and location, in real time is one of the key tasks for earthquake early warning and rapid response. In recent years, several publications introduced deep learning approaches for these fast assessment tasks. Deep learning is well suited for these tasks, as it can work directly on waveforms and can learn features and their relation from data.A drawback of deep learning models is their lack of interpretability, i.e., it is usually unknown what reasoning the network uses. Due to this issue, it is also hard to estimate how the model will handle new data whose properties differ in some aspects from the training set, for example earthquakes in previously seismically quite regions. The discussions of previous studies usually focused on the average performance of models and did not consider this point in any detail.Here we analyze a deep learning model for real time magnitude and location estimation through targeted experiments and a qualitative error analysis. We conduct our analysis on three large scale regional data sets from regions with diverse seismotectonic settings and network properties: Italy and Japan with dense networks (station spacing down to 10 km) of strong motion sensors, and North Chile with a sparser network (station spacing around 40 km) of broadband stations. We obtained several key insights. First, the deep learning model does not seem to follow the classical approaches for magnitude and location estimation. For magnitude, one would classically expect the model to estimate attenuation, but the network rather seems to focus its attention on the spectral composition of the waveforms. For location, one would expect a triangulation approach, but our experiments instead show indications of a fingerprinting approach. Second, we can pinpoint the effect of training data size on model performance. For example, a four times larger training set reduces average errors for both magnitude and location prediction by more than half, and reduces the required time for real time assessment by a factor of four. Third, the model fails for events with few similar training examples. For magnitude, this means that the largest events are systematically underestimated. For location, events in regions with few events in the training set tend to get mislocated to regions with more training events. These characteristics can have severe consequences in downstream tasks like early warning and need to be taken into account for future model development and evaluation.

Download Full-text

Real-time automated diagnosis of colorectal cancer invasion depth using a deep learning model with multimodal data (with video)

Gastrointestinal Endoscopy ◽

10.1016/j.gie.2021.11.049 ◽

2021 ◽

Author(s):

Zihua Lu ◽

Youming Xu ◽

Liwen Yao ◽

Wei Zhou ◽

Wei Gong ◽

...

Keyword(s):

Colorectal Cancer ◽

Deep Learning ◽

Real Time ◽

Learning Model ◽

Cancer Invasion ◽

Automated Diagnosis ◽

Invasion Depth ◽

Multimodal Data ◽

Deep Learning Model

Download Full-text

Real-Time Estimation of Eye Movement Condition Using a Deep Learning Model

10.1007/978-3-030-90963-5_11 ◽

2021 ◽

pp. 132-143

Author(s):

Akihiro Sugiura ◽

Yoshiki Itazu ◽

Kunihiko Tanaka ◽

Hiroki Takada

Keyword(s):

Deep Learning ◽

Real Time ◽

Eye Movement ◽

Time Estimation ◽

Learning Model ◽

Movement Condition ◽

Real Time Estimation ◽

Deep Learning Model

Download Full-text

Efficient 3D Deep Learning Model for Medical Image Semantic Segmentation

Alexandria Engineering Journal ◽

10.1016/j.aej.2020.10.046 ◽

2021 ◽

Vol 60 (1) ◽

pp. 1231-1239

Author(s):

Nasser Alalwan ◽

Amr Abozeid ◽

AbdAllah A. ElHabshy ◽

Ahmed Alzahrani

Keyword(s):

Deep Learning ◽

Medical Image ◽

Semantic Segmentation ◽

Learning Model ◽

Deep Learning Model

Download Full-text

A deep learning model for real-time mortality prediction in critically ill children

Critical Care ◽

10.1186/s13054-019-2561-z ◽

2019 ◽

Vol 23 (1) ◽

Cited By ~ 5

Author(s):

Soo Yeon Kim ◽

Saehoon Kim ◽

Joongbum Cho ◽

Young Suh Kim ◽

In Suk Sol ◽

...

Keyword(s):

Deep Learning ◽

Real Time ◽

Critically Ill ◽

Learning Model ◽

Mortality Prediction ◽

Critically Ill Children ◽

Deep Learning Model

Download Full-text

IGRNet: A Deep Learning Model for Non-Invasive, Real-Time Diagnosis of Prediabetes through Electrocardiograms

Sensors ◽

10.3390/s20092556 ◽

2020 ◽

Vol 20 (9) ◽

pp. 2556

Author(s):

Liyang Wang ◽

Yao Mu ◽

Jing Zhao ◽

Xiaoya Wang ◽

Huilian Che

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Real Time ◽

Clinical Symptoms ◽

Characteristic Curve ◽

Learning Model ◽

Machine Learning Algorithms ◽

Test Set ◽

Non Invasive ◽

Deep Learning Model

The clinical symptoms of prediabetes are mild and easy to overlook, but prediabetes may develop into diabetes if early intervention is not performed. In this study, a deep learning model—referred to as IGRNet—is developed to effectively detect and diagnose prediabetes in a non-invasive, real-time manner using a 12-lead electrocardiogram (ECG) lasting 5 s. After searching for an appropriate activation function, we compared two mainstream deep neural networks (AlexNet and GoogLeNet) and three traditional machine learning algorithms to verify the superiority of our method. The diagnostic accuracy of IGRNet is 0.781, and the area under the receiver operating characteristic curve (AUC) is 0.777 after testing on the independent test set including mixed group. Furthermore, the accuracy and AUC are 0.856 and 0.825, respectively, in the normal-weight-range test set. The experimental results indicate that IGRNet diagnoses prediabetes with high accuracy using ECGs, outperforming existing other machine learning methods; this suggests its potential for application in clinical practice as a non-invasive, prediabetes diagnosis technology.

Download Full-text

Real-time person detection in low-resolution thermal infrared imagery with MSER and CNNs

10.1117/12.2240940 ◽

2016 ◽

Cited By ~ 5

Author(s):

Christian Herrmann ◽

Thomas Müller ◽

Dieter Willersinn ◽

Jürgen Beyerer

Keyword(s):

Real Time ◽

Thermal Infrared ◽

Low Resolution ◽

Infrared Imagery ◽

Person Detection ◽

Thermal Infrared Imagery

Download Full-text