ACM: Adaptive Cross-Modal Graph Convolutional Neural Networks for RGB-D Scene Recognition

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019176 ◽

2019 ◽

Vol 33 ◽

pp. 9176-9184 ◽

Cited By ~ 3

Author(s):

Yuan Yuan ◽

Zhitong Xiong ◽

Qi Wang

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Feature Learning ◽

Local Features ◽

Scene Recognition ◽

Scene Classification ◽

Deep Convolutional Neural Networks ◽

Scene Representation ◽

Complex Relationships ◽

Rgb Image

RGB image classification has achieved significant performance improvement with the resurge of deep convolutional neural networks. However, mono-modal deep models for RGB image still have several limitations when applied to RGB-D scene recognition. 1) Images for scene classification usually contain more than one typical object with flexible spatial distribution, so the object-level local features should also be considered in addition to global scene representation. 2) Multi-modal features in RGB-D scene classification are still under-utilized. Simply combining these modal-specific features suffers from the semantic gaps between different modalities. 3) Most existing methods neglect the complex relationships among multiple modality features. Considering these limitations, this paper proposes an adaptive crossmodal (ACM) feature learning framework based on graph convolutional neural networks for RGB-D scene recognition. In order to make better use of the modal-specific cues, this approach mines the intra-modality relationships among the selected local features from one modality. To leverage the multi-modal knowledge more effectively, the proposed approach models the inter-modality relationships between two modalities through the cross-modal graph (CMG). We evaluate the proposed method on two public RGB-D scene classification datasets: SUN-RGBD and NYUD V2, and the proposed method achieves state-of-the-art performance.

Download Full-text

Illumination-robust face recognition based on deep convolutional neural networks architectures

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v18.i2.pp1015-1027 ◽

2020 ◽

Vol 18 (2) ◽

pp. 1015

Author(s):

Ridha Ilyas Bendjillali ◽

Mohammed Beladgham ◽

Khaled Merit ◽

Abdelmalik Taleb-Ahmed

Keyword(s):

Neural Networks ◽

Face Recognition ◽

Convolutional Neural Networks ◽

Feature Learning ◽

Histogram Equalization ◽

Detection Algorithm ◽

Deep Convolutional Neural Networks ◽

Biometric Technology ◽

Equalization Algorithm ◽

Robust Face Recognition

<p><span>In the last decade, facial recognition techniques are considered the most important fields of research in biometric technology. In this research paper, we present a Face Recognition (FR) system divided into three steps: The Viola-Jones face detection algorithm, facial image enhancement using Modified Contrast Limited Adaptive Histogram Equalization algorithm (M-CLAHE), and feature learning for classiﬁcation. For learning the features followed by classiﬁcation we used VGG16, ResNet50 and Inception-v3 Convolutional Neural Networks (CNN) architectures for the proposed system. Our experimental work was performed on the Extended Yale B database and CMU PIE face database. Finally, the comparison with the other methods on both databases shows the robustness and effectiveness of the proposed approach. Where the Inception-v3 architecture has achieved a rate of 99, 44% and 99, 89% respectively.</span></p>

Download Full-text

Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery

Remote Sensing ◽

10.3390/rs71114680 ◽

2015 ◽

Vol 7 (11) ◽

pp. 14680-14707 ◽

Cited By ~ 513

Author(s):

Fan Hu ◽

Gui-Song Xia ◽

Jingwen Hu ◽

Liangpei Zhang

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

High Resolution ◽

Convolutional Neural Networks ◽

Scene Classification ◽

Deep Convolutional Neural Networks ◽

Remote Sensing Imagery

Download Full-text

PulseNetOne: Fast Unsupervised Pruning of Convolutional Neural Networks for Remote Sensing

Remote Sensing ◽

10.3390/rs12071092 ◽

2020 ◽

Vol 12 (7) ◽

pp. 1092

Author(s):

David Browne ◽

Michael Giering ◽

Steven Prestwich

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

Recognition Task ◽

Scene Recognition ◽

Training Data ◽

Learning Approach ◽

Scene Classification

Scene classification is an important aspect of image/video understanding and segmentation. However, remote-sensing scene classification is a challenging image recognition task, partly due to the limited training data, which causes deep-learning Convolutional Neural Networks (CNNs) to overfit. Another difficulty is that images often have very different scales and orientation (viewing angle). Yet another is that the resulting networks may be very large, again making them prone to overfitting and unsuitable for deployment on memory- and energy-limited devices. We propose an efficient deep-learning approach to tackle these problems. We use transfer learning to compensate for the lack of data, and data augmentation to tackle varying scale and orientation. To reduce network size, we use a novel unsupervised learning approach based on k-means clustering, applied to all parts of the network: most network reduction methods use computationally expensive supervised learning methods, and apply only to the convolutional or fully connected layers, but not both. In experiments, we set new standards in classification accuracy on four remote-sensing and two scene-recognition image datasets.

Download Full-text

NROI based feature learning for automated tumor stage classification of pulmonary lung nodules using deep convolutional neural networks

Journal of King Saud University - Computer and Information Sciences ◽

10.1016/j.jksuci.2019.11.013 ◽

2019 ◽

Cited By ~ 2

Author(s):

Supriya Suresh ◽

Subaji Mohan

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Feature Learning ◽

Tumor Stage ◽

Lung Nodules ◽

Deep Convolutional Neural Networks ◽

Stage Classification

Download Full-text

Residential scene classification for gridded population sampling in developing countries using deep convolutional neural networks on satellite imagery

International Journal of Health Geographics ◽

10.1186/s12942-018-0132-1 ◽

2018 ◽

Vol 17 (1) ◽

Cited By ~ 12

Author(s):

Robert F. Chew ◽

Safaa Amer ◽

Kasey Jones ◽

Jennifer Unangst ◽

James Cajka ◽

...

Keyword(s):

Neural Networks ◽

Developing Countries ◽

Convolutional Neural Networks ◽

Satellite Imagery ◽

Scene Classification ◽

Deep Convolutional Neural Networks ◽

Population Sampling

Download Full-text

Local features and global shape information in object classification by deep convolutional neural networks

Vision Research ◽

10.1016/j.visres.2020.04.003 ◽

2020 ◽

Vol 172 ◽

pp. 46-61 ◽

Cited By ~ 1

Author(s):

Nicholas Baker ◽

Hongjing Lu ◽

Gennady Erlikhman ◽

Philip J. Kellman

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Object Classification ◽

Local Features ◽

Deep Convolutional Neural Networks ◽

Shape Information ◽

Global Shape

Download Full-text

Improving deep convolutional neural networks with unsupervised feature learning

2015 IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2015.7351206 ◽

2015 ◽

Cited By ~ 17

Author(s):

Kien Nguyen ◽

Clinton Fookes ◽

Sridha Sridharan

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Feature Learning ◽

Unsupervised Feature Learning ◽

Deep Convolutional Neural Networks

Download Full-text

A novel MapReduce-based deep convolutional neural network algorithm

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-201790 ◽

2021 ◽

pp. 1-13

Author(s):

Xiang-Min Liu ◽

Jian Hu ◽

Deborah Simon Mwakapesa ◽

Y.A. Nanehkaran ◽

Yi-Min Mao ◽

...

Keyword(s):

Neural Networks ◽

Big Data ◽

Convolutional Neural Networks ◽

Large Scale ◽

Feature Learning ◽

Deep Convolutional Neural Networks ◽

Network Training ◽

Load Rate ◽

Data Environment ◽

Neural Networks Optimization

Deep convolutional neural networks (DCNNs), with their complex network structure and powerful feature learning and feature expression capabilities, have been remarkable successes in many large-scale recognition tasks. However, with the expectation of memory overhead and response time, along with the increasing scale of data, DCNN faces three non-rival challenges in a big data environment: excessive network parameters, slow convergence, and inefficient parallelism. To tackle these three problems, this paper develops a deep convolutional neural networks optimization algorithm (PDCNNO) in the MapReduce framework. The proposed method first pruned the network to obtain a compressed network in order to effectively reduce redundant parameters. Next, a conjugate gradient method based on modified secant equation (CGMSE) is developed in the Map phase to further accelerate the convergence of the network. Finally, a load balancing strategy based on regulate load rate (LBRLA) is proposed in the Reduce phase to quickly achieve equal grouping of data and thus improving the parallel performance of the system. We compared the PDCNNO algorithm with other algorithms on three datasets, including SVHN, EMNIST Digits, and ISLVRC2012. The experimental results show that our algorithm not only reduces the space and time overhead of network training but also obtains a well-performing speed-up ratio in a big data environment.

Download Full-text