Classifying stable and unstable videos with deep convolutional networks

Journal of Intelligent Systems with Applications ◽

10.54856/jiswa.202012125 ◽

2020 ◽

pp. 90-92

Author(s):

Mehmet Sarigul ◽

Levent Karacan

Keyword(s):

Neural Networks ◽

Classification Performance ◽

Video Stabilization ◽

Learning Networks ◽

Stabilization Method ◽

Deep Convolutional Neural Networks ◽

Convolutional Networks ◽

Number Of Layers ◽

The Stability

Since the invention of cameras, video shooting has become a passion for human. However, the quality of videos recorded with devices such as handheld cameras, head cameras, and vehicle cameras may be low due to shaking, jittering and unwanted periodic movements. Although the issue of video stabilization has been studied for decades, there is no consensus on how to measure the performance of a video stabilization method. In many studies in the literature, different metrics have been used for comparison of different methods. In this study, deep convolutional neural networks are used as a decision maker for video stabilization. VGG networks with different number of layers are used to determine the stability status of the videos. It was observed that VGG networks showed a classification performance up to 96.537% using only two consecutive scenes. These results show that deep learning networks can be utilized as a metric for video stabilization.

Download Full-text

Attention-based deep learning networks for identification of human gait using radar micro-Doppler spectrograms

International Journal of Microwave and Wireless Technologies ◽

10.1017/s1759078721000830 ◽

2021 ◽

pp. 1-6

Author(s):

Hannah Garcia Doherty ◽

Roberto Arnaiz Burgueño ◽

Roeland P. Trommel ◽

Vasileios Papanastasiou ◽

Ronny I. A. Harmanny

Keyword(s):

Neural Networks ◽

Feature Vector ◽

Classification Performance ◽

Input Image ◽

Human Gait ◽

Learning Networks ◽

Class Label ◽

Deep Convolutional Neural Networks ◽

Network Layers ◽

Feature Dimension

Abstract Identification of human individuals within a group of 39 persons using micro-Doppler (μ-D) features has been investigated. Deep convolutional neural networks with two different training procedures have been used to perform classification. Visualization of the inner network layers revealed the sections of the input image most relevant when determining the class label of the target. A convolutional block attention module is added to provide a weighted feature vector in the channel and feature dimension, highlighting the relevant μ-D feature-filled areas in the image and improving classification performance.

Download Full-text

DCNN-based Ship Classification using Enhanced Edge Information and Inception Module

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2022.66.3.030501 ◽

2021 ◽

Author(s):

Bo Wang ◽

Xiaoting Yu ◽

Chengeng Huang ◽

Qinghong Sheng ◽

Yuanyuan Wang ◽

...

Keyword(s):

Neural Networks ◽

Classification Performance ◽

Image Features ◽

Deep Convolutional Neural Networks ◽

Edge Information ◽

Average Accuracy ◽

Ship Classification ◽

Edge Features ◽

High Level ◽

Better Than

The excellent feature extraction ability of deep convolutional neural networks (DCNNs) has been demonstrated in many image processing tasks, by which image classification can achieve high accuracy with only raw input images. However, the specific image features that influence the classification results are not readily determinable and what lies behind the predictions is unclear. This study proposes a method combining the Sobel and Canny operators and an Inception module for ship classification. The Sobel and Canny operators obtain enhanced edge features from the input images. A convolutional layer is replaced with the Inception module, which can automatically select the proper convolution kernel for ship objects in different image regions. The principle is that the high-level features abstracted by the DCNN, and the features obtained by multi-convolution concatenation of the Inception module must ultimately derive from the edge information of the preprocessing input images. This indicates that the classification results are based on the input edge features, which indirectly interpret the classification results to some extent. Experimental results show that the combination of the edge features and the Inception module improves DCNN ship classification performance. The original model with the raw dataset has an average accuracy of 88.72%, while when using enhanced edge features as input, it achieves the best performance of 90.54% among all models. The model that replaces the fifth convolutional layer with the Inception module has the best performance of 89.50%. It performs close to VGG-16 on the raw dataset and is significantly better than other deep neural networks. The results validate the functionality and feasibility of the idea posited.

Download Full-text

Quality of Experience using Deep Convolutional Neural Networks and future trends

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ◽

10.1109/apsipaasc47483.2019.9023170 ◽

2019 ◽

Author(s):

Woojae Kim ◽

Jaekyung Kim ◽

Sanghoon Lee

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Quality Of Experience ◽

Future Trends ◽

Deep Convolutional Neural Networks

Download Full-text

Boosting Multilabel Semantic Segmentation for Somata and Vessels in Mouse Brain

Frontiers in Neuroscience ◽

10.3389/fnins.2021.610122 ◽

2021 ◽

Vol 15 ◽

Author(s):

Xinglong Wu ◽

Yuhang Tao ◽

Guangzhi He ◽

Dun Liu ◽

Meiling Fan ◽

...

Keyword(s):

Neural Networks ◽

Mouse Brain ◽

Light And Electron Microscopy ◽

Three Dimensional ◽

Image Data ◽

Semantic Segmentation ◽

Poor Quality ◽

Deep Convolutional Neural Networks ◽

Segmentation Task

Deep convolutional neural networks (DCNNs) are widely utilized for the semantic segmentation of dense nerve tissues from light and electron microscopy (EM) image data; the goal of this technique is to achieve efficient and accurate three-dimensional reconstruction of the vasculature and neural networks in the brain. The success of these tasks heavily depends on the amount, and especially the quality, of the human-annotated labels fed into DCNNs. However, it is often difficult to acquire the gold standard of human-annotated labels for dense nerve tissues; human annotations inevitably contain discrepancies or even errors, which substantially impact the performance of DCNNs. Thus, a novel boosting framework consisting of a DCNN for multilabel semantic segmentation with a customized Dice-logarithmic loss function, a fusion module combining the annotated labels and the corresponding predictions from the DCNN, and a boosting algorithm to sequentially update the sample weights during network training iterations was proposed to systematically improve the quality of the annotated labels; this framework eventually resulted in improved segmentation task performance. The microoptical sectioning tomography (MOST) dataset was then employed to assess the effectiveness of the proposed framework. The result indicated that the framework, even trained with a dataset including some poor-quality human-annotated labels, achieved state-of-the-art performance in the segmentation of somata and vessels in the mouse brain. Thus, the proposed technique of artificial intelligence could advance neuroscience research.

Download Full-text

Comparing fully convolutional networks, random forest, support vector machine, and patch-based deep convolutional neural networks for object-based wetland mapping using images from small unmanned aircraft system

GIScience & Remote Sensing ◽

10.1080/15481603.2018.1426091 ◽

2018 ◽

Vol 55 (2) ◽

pp. 243-264 ◽

Cited By ~ 71

Author(s):

Tao Liu ◽

Amr Abd-Elrahman ◽

Jon Morton ◽

Victor L. Wilhelm

Keyword(s):

Neural Networks ◽

Support Vector Machine ◽

Unmanned Aircraft ◽

Support Vector ◽

Wetland Mapping ◽

Deep Convolutional Neural Networks ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Object Based ◽

Aircraft System

Download Full-text

Deep Convolutional Neural Networks for Hyperspectral Image Classification

Journal of Sensors ◽

10.1155/2015/258619 ◽

2015 ◽

Vol 2015 ◽

pp. 1-12 ◽

Cited By ~ 419

Author(s):

Wei Hu ◽

Yangyu Huang ◽

Li Wei ◽

Fan Zhang ◽

Hengchao Li

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Hyperspectral Image ◽

Image Data ◽

Classification Performance ◽

Spectral Signature ◽

Support Vector ◽

Data Sets ◽

Deep Convolutional Neural Networks ◽

Visual Tasks

Recently, convolutional neural networks have demonstrated excellent performance on various visual tasks, including the classification of common two-dimensional images. In this paper, deep convolutional neural networks are employed to classify hyperspectral images directly in spectral domain. More specifically, the architecture of the proposed classifier contains five layers with weights which are the input layer, the convolutional layer, the max pooling layer, the full connection layer, and the output layer. These five layers are implemented on each spectral signature to discriminate against others. Experimental results based on several hyperspectral image data sets demonstrate that the proposed method can achieve better classification performance than some traditional methods, such as support vector machines and the conventional deep learning-based methods.

Download Full-text

Rectified Binary Convolutional Networks for Enhancing the Performance of 1-bit DCNNs

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/120 ◽

2019 ◽

Cited By ~ 3

Author(s):

Chunlei Liu ◽

Wenrui Ding ◽

Xin Xia ◽

Yuan Hu ◽

Baochang Zhang ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Superior Performance ◽

Feature Maps ◽

Unified Framework ◽

Deep Convolutional Neural Networks ◽

Convolutional Networks ◽

Computation Efficiency ◽

Significant Performance ◽

Binary Network

Binarized convolutional neural networks (BCNNs) are widely used to improve memory and computation efficiency of deep convolutional neural networks (DCNNs) for mobile and AI chips based applications. However, current BCNNs are not able to fully explore their corresponding full-precision models, causing a significant performance gap between them. In this paper, we propose rectified binary convolutional networks (RBCNs), towards optimized BCNNs, by combining full-precision kernels and feature maps to rectify the binarization process in a unified framework. In particular, we use a GAN to train the 1-bit binary network with the guidance of its corresponding full-precision model, which significantly improves the performance of BCNNs. The rectified convolutional layers are generic and flexible, and can be easily incorporated into existing DCNNs such as WideResNets and ResNets. Extensive experiments demonstrate the superior performance of the proposed RBCNs over state-of-the-art BCNNs. In particular, our method shows strong generalization on the object tracking task.

Download Full-text

Improving representations of genomic sequence motifs in convolutional networks with exponential activations

10.1101/2020.06.14.150706 ◽

2020 ◽

Cited By ~ 3

Author(s):

Peter K. Koo ◽

Matt Ploenzke

Keyword(s):

Neural Networks ◽

Dna Sequences ◽

Test Performance ◽

Genomic Sequence ◽

Comprehensive Analysis ◽

Sequence Motifs ◽

Deep Convolutional Neural Networks ◽

Convolutional Networks ◽

Learned Features

ABSTRACTDeep convolutional neural networks (CNNs) trained on regulatory genomic sequences tend to build representations in a distributed manner, making it a challenge to extract learned features that are biologically meaningful, such as sequence motifs. Here we perform a comprehensive analysis on synthetic sequences to investigate the role that CNN activations have on model interpretability. We show that employing an exponential activation to first layer filters consistently leads to interpretable and robust representations of motifs compared to other commonly used activations. Strikingly, we demonstrate that CNNs with better test performance do not necessarily imply more interpretable representations with attribution methods. We find that CNNs with exponential activations significantly improve the efficacy of recovering biologically meaningful representations with attribution methods. We demonstrate these results generalise to real DNA sequences across several in vivo datasets. Together, this work demonstrates how a small modification to existing CNNs, i.e. setting exponential activations in the first layer, can significantly improve the robustness and interpretabilty of learned representations directly in convolutional filters and indirectly with attribution methods.

Download Full-text

Logo Recognition with the Use of Deep Convolutional Neural Networks

Engineering, Technology & Applied Science Research ◽

10.48084/etasr.3734 ◽

2020 ◽

Vol 10 (5) ◽

pp. 6191-6194

Author(s):

A. Alsheikhy ◽

Y. Said ◽

M. Barr

Keyword(s):

Neural Network ◽

Neural Networks ◽

Object Recognition ◽

Neural Network Model ◽

State Of The Art ◽

Original Data ◽

Deep Convolutional Neural Networks ◽

Convolutional Networks ◽

Computational Overhead ◽

Logo Recognition

Automatic logo recognition is gaining importance due to the increasing number of its applications. Unlike other object recognition tasks, logo recognition is more challenging because of the limited amount of the available original data. In this paper, the transfer leaning technique was applied to a Deep Convolutional Neural Network model to guarantee logo recognition using a small computational overhead. The proposed method was based on the Densely Connected Convolutional Networks (DenseNet). The experimental results show that for the FlickrLogos-32 logo recognition dataset, our proposed method performs comparably with state-of-the-art methods while using fewer parameters.

Download Full-text

From photos to sketches - how humans and deep neural networks process objects across different levels of visual abstraction

10.31234/osf.io/xg2uy ◽

2021 ◽

Author(s):

Johannes Janek Daniel Singer ◽

Katja Seeliger ◽

Tim Christian Kietzmann ◽

Martin N Hebart

Keyword(s):

Neural Networks ◽

Poor Performance ◽

Classification Performance ◽

Natural Images ◽

Fine Tuning ◽

Deep Convolutional Neural Networks ◽

Line Drawings ◽

General Utility ◽

Intermediate Layers ◽

Latent Representations

Line drawings convey meaning with just a few strokes. Despite strong simplifications, humans can recognize objects depicted in such abstracted images without effort. To what degree do deep convolutional neural networks (CNNs) mirror this human ability to generalize to abstracted object images? While CNNs trained on natural images have been shown to exhibit poor classification performance on drawings, other work has demonstrated highly similar latent representations in the networks for abstracted and natural images. Here, we address these seemingly conflicting findings by analyzing the activation patterns of a CNN trained on natural images across a set of photos, drawings and sketches of the same objects and comparing them to human behavior. We find a highly similar representational structure across levels of visual abstraction in early and intermediate layers of the network. This similarity, however, does not translate to later stages in the network, resulting in low classification performance for drawings and sketches. We identified that texture bias in CNNs contributes to the dissimilar representational structure in late layers and the poor performance on drawings. Finally, by fine-tuning late network layers with object drawings, we show that performance can be largely restored, demonstrating the general utility of features learned on natural images in early and intermediate layers for the recognition of drawings. In conclusion, generalization to abstracted images such as drawings seems to be an emergent property of CNNs trained on natural images, which is, however, suppressed by domain-related biases that arise during later processing stages in the network.

Download Full-text