Neural network feature learning based on image self-encoding

With the rapid development of information technology and the arrival of the era of big data, people’s access to information is increasingly relying on information such as images. Today, image data are showing an increasing trend in the form of an index. How to use deep learning models to extract valuable information from massive data is very important. In the face of such a situation, people cannot accurately and timely find out the information they need. Therefore, the research on image retrieval technology is very important. Image retrieval is an important technology in the field of computer vision image processing. It realizes fast and accurate query of similar images in image database. The excellent feature representation not only can represent the category information of the image but also capture the relevant semantic information of the image. If the neural network feature learning expression is combined with the image retrieval field, it will definitely improve the application of image retrieval technology. To solve the above problems, this article studies the problems encountered in deep learning neural network feature learning based on image self-encoding and discusses its feature expression in the field of image retrieval. By adding the spatial relationship information obtained by image self-encoding in the neural network training process, the feature expression ability of the selected neural network is improved, and the neural network feature learning based on image coding is successfully applied to the popular field of image retrieval.

Download Full-text

GENERATING SYNTHETIC TRAINING DATA FOR OBJECT DETECTION USING MULTI-TASK GENERATIVE ADVERSARIAL NETWORKS

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-v-2-2020-443-2020 ◽

2020 ◽

Vol V-2-2020 ◽

pp. 443-449

Author(s):

Y. Lin ◽

K. Suzuki ◽

H. Takeda ◽

K. Nakamura

Keyword(s):

Neural Network ◽

Deep Learning ◽

Rapid Development ◽

Real Data ◽

Training Data ◽

Generative Adversarial Networks ◽

Great Success ◽

Traffic Signs ◽

Adversarial Networks ◽

The Neural Network

Abstract. Nowadays, digitizing roadside objects, for instance traffic signs, is a necessary step for generating High Definition Maps (HD Map) which remains as an open challenge. Rapid development of deep learning technology using Convolutional Neural Networks (CNN) has achieved great success in computer vision field in recent years. However, performance of most deep learning algorithms highly depends on the quality of training data. Collecting the desired training dataset is a difficult task, especially for roadside objects due to their imbalanced numbers along roadside. Although, training the neural network using synthetic data have been proposed. The distribution gap between synthetic and real data still exists and could aggravate the performance. We propose to transfer the style between synthetic and real data using Multi-Task Generative Adversarial Networks (SYN-MTGAN) before training the neural network which conducts the detection of roadside objects. Experiments focusing on traffic signs show that our proposed method can reach mAP of 0.77 and is able to improve detection performance for objects whose training samples are difficult to collect.

Download Full-text

Deep Learning for Plant Classification and Content-Based Image Retrieval

Cybernetics and Information Technologies ◽

10.2478/cait-2019-0005 ◽

2019 ◽

Vol 19 (1) ◽

pp. 88-100

Author(s):

Bálint Pál Gyires-Tóth ◽

Márton Osváth ◽

Dávid Papp ◽

Gábor Szűcs

Keyword(s):

Neural Network ◽

Deep Learning ◽

Image Retrieval ◽

Search Task ◽

Feature Learning ◽

Content Based Image Retrieval ◽

Network Architectures ◽

Average Precision ◽

Test Set ◽

Fully Connected

Abstract The main goal of the present research is to classify images of plants to species with deep learning. We used convolutional neural network architectures for feature learning and fully connected layers with logsoftmax output for classification. Pretrained models on ImageNet were used, and transfer learning was applied. In the current research image sets published in the scope of the PlantCLEF 2015 challenge were used. The proposed system surpasses the results of all top competitors of the challenge by 8% and 7% at observation and image levels, respectively. Our secondary goal was to satisfy the users’ needs in content-based image retrieval to give relevant hits during species search task. We optimized the length of the returned lists in order to maximize MAP (Mean Average Precision), which is critical to the performance of image retrieval. Thus, we achieved more than 50% improvement of MAP in the test set compared to the baseline.

Download Full-text

A Survey on Image Retrieval Techniques

Intelligent Systems and Computer Technology - Advances in Parallel Computing ◽

10.3233/apc200174 ◽

2020 ◽

Author(s):

Lalitha K ◽

Murugavalli S

Keyword(s):

Neural Network ◽

Deep Learning ◽

Image Retrieval ◽

Medical Tourism ◽

Digital Images ◽

Rapid Development ◽

Vital Role ◽

Content Based Image Retrieval ◽

Processing Methods ◽

Exponential Order

In the recent years, with the rapid development of technologies production, digital images tend to grow in exponential order. This makes the challenge in retrieving image. Image retrieval plays a vital role in various fields such as Medical, Tourism promotion, Forensics Lab etc. In order to handle these rapid growth efficient images retrieval methods need to be developed. This paper is presented with the survey of different Image retrieval techniques which used various techniques from visual features to the latest deep learning with Convolutional Neural Network(CNN) which become the best approach for image retrieval with number of layers applicable for large database. This survey concretes the way to understand the processing methods of Content Based Image Retrieval (CBIR).

Download Full-text

An Anatomy of a Hybrid Color Descriptor with a Neural Network Model to Enhance the Retrieval Accuracy of an Image Retrieval System

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813666191122113801 ◽

2019 ◽

Vol 13 ◽

Author(s):

Shikha Bhardwaj ◽

Gitanjali Pandove ◽

Pawan Kumar Dahiya

Keyword(s):

Neural Network ◽

Deep Learning ◽

Image Retrieval ◽

Hybrid System ◽

Back Propagation ◽

Back Propagation Neural Network ◽

Retrieval Accuracy ◽

Color Descriptor ◽

Benchmark Datasets ◽

Color Moment

Background: In order to retrieve a particular image from vast repository of images, an efficient system is required and such an eminent system is well-known by the name Content-based image retrieval (CBIR) system. Color is indeed an important attribute of an image and the proposed system consist of a hybrid color descriptor which is used for color feature extraction. Deep learning, has gained a prominent importance in the current era. So, the performance of this fusion based color descriptor is also analyzed in the presence of Deep learning classifiers. Method: This paper describes a comparative experimental analysis on various color descriptors and the best two are chosen to form an efficient color based hybrid system denoted as combined color moment-color autocorrelogram (Co-CMCAC). Then, to increase the retrieval accuracy of the hybrid system, a Cascade forward back propagation neural network (CFBPNN) is used. The classification accuracy obtained by using CFBPNN is also compared to Patternnet neural network. Results: The results of the hybrid color descriptor depict that the proposed system has superior results of the order of 95.4%, 88.2%, 84.4% and 96.05% on Corel-1K, Corel-5K, Corel-10K and Oxford flower benchmark datasets respectively as compared to many state-of-the-art related techniques. Conclusion: This paper depict an experimental and analytical analysis on different color feature descriptors namely, Color moment (CM), Color auto-correlogram (CAC), Color histogram (CH), Color coherence vector (CCV) and Dominant color descriptor (DCD). The proposed hybrid color descriptor (Co-CMCAC) is utilized for the withdrawal of color features with Cascade forward back propagation neural network (CFBPNN) is used as a classifier on four benchmark datasets namely Corel-1K, Corel-5K and Corel-10K and Oxford flower.

Download Full-text

Augmented Reality Maintenance Assistant Using YOLOv5

Applied Sciences ◽

10.3390/app11114758 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4758

Author(s):

Ana Malta ◽

Mateus Mendes ◽

Torres Farinha

Keyword(s):

Neural Network ◽

Deep Learning ◽

Object Recognition ◽

Augmented Reality ◽

Real Time ◽

Recognition System ◽

High Accuracy ◽

Video Streams ◽

The Neural Network ◽

Deep Learning Neural Network

Maintenance professionals and other technical staff regularly need to learn to identify new parts in car engines and other equipment. The present work proposes a model of a task assistant based on a deep learning neural network. A YOLOv5 network is used for recognizing some of the constituent parts of an automobile. A dataset of car engine images was created and eight car parts were marked in the images. Then, the neural network was trained to detect each part. The results show that YOLOv5s is able to successfully detect the parts in real time video streams, with high accuracy, thus being useful as an aid to train professionals learning to deal with new equipment using augmented reality. The architecture of an object recognition system using augmented reality glasses is also designed.

Download Full-text

End-to-End Autonomous Driving Through Dueling Double Deep Q-Network

Automotive Innovation ◽

10.1007/s42154-021-00151-3 ◽

2021 ◽

Author(s):

Baiyu Peng ◽

Qi Sun ◽

Shengbo Eben Li ◽

Dongsuk Kum ◽

Yuming Yin ◽

...

Keyword(s):

Neural Network ◽

State Space ◽

Learning Algorithm ◽

Rapid Development ◽

Autonomous Driving ◽

Saliency Map ◽

Hierarchical Architecture ◽

Link Type ◽

The Neural Network ◽

End To End

AbstractRecent years have seen the rapid development of autonomous driving systems, which are typically designed in a hierarchical architecture or an end-to-end architecture. The hierarchical architecture is always complicated and hard to design, while the end-to-end architecture is more promising due to its simple structure. This paper puts forward an end-to-end autonomous driving method through a deep reinforcement learning algorithm Dueling Double Deep Q-Network, making it possible for the vehicle to learn end-to-end driving by itself. This paper firstly proposes an architecture for the end-to-end lane-keeping task. Unlike the traditional image-only state space, the presented state space is composed of both camera images and vehicle motion information. Then corresponding dueling neural network structure is introduced, which reduces the variance and improves sampling efficiency. Thirdly, the proposed method is applied to The Open Racing Car Simulator (TORCS) to demonstrate its great performance, where it surpasses human drivers. Finally, the saliency map of the neural network is visualized, which indicates the trained network drives by observing the lane lines. A video for the presented work is available online, https://youtu.be/76ciJmIHMD8 or https://v.youku.com/v_show/id_XNDM4ODc0MTM4NA==.html.

Download Full-text

Neutrosophy for physiological data compression: in particular by neural nets using deep learning

10.54216/ijns.010203 ◽

2020 ◽

pp. 74-80

Author(s):

Philippe Schweizer ◽

Keyword(s):

Neural Network ◽

Deep Learning ◽

Boundary Surface ◽

Human Perception ◽

Neural Nets ◽

Physiological Data ◽

Compression Process ◽

The Neural Network ◽

Algorithmic Solution ◽

Algorithmic Techniques

We would like to show the small distance in neutropsophy applications in sciences and humanities, has both finally consider as a terminal user a human. The pace of data production continues to grow, leading to increased needs for efficient storage and transmission. Indeed, the consumption of this information is preferably made on mobile terminals using connections invoiced to the user and having only reduced storage capacities. Deep learning neural networks have recently exceeded the compression rates of algorithmic techniques for text. We believe that they can also significantly challenge classical methods for both audio and visual data (images and videos). To obtain the best physiological compression, i.e. the highest compression ratio because it comes closest to the specificity of human perception, we propose using a neutrosophical representation of the information for the entire compression-decompression cycle. Such a representation consists for each elementary information to add to it a simple neutrosophical number which informs the neural network about its characteristics relative to compression during this treatment. Such a neutrosophical number is in fact a triplet (t,i,f) representing here the belonging of the element to the three constituent components of information in compression; 1° t = the true significant part to be preserved, 2° i = the inderterminated redundant part or noise to be eliminated in compression and 3° f = the false artifacts being produced in the compression process (to be compensated). The complexity of human perception and the subtle niches of its defects that one seeks to exploit requires a detailed and complex mapping that a neural network can produce better than any other algorithmic solution, and networks with deep learning have proven their ability to produce a detailed boundary surface in classifiers.

Download Full-text

Is the Skip Connection Provable to Reform the Neural Network Loss Landscape?

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/387 ◽

2020 ◽

Author(s):

Lifu Wang ◽

Bo Shen ◽

Ning Zhao ◽

Zhiyuan Zhang

Keyword(s):

Neural Network ◽

Deep Learning ◽

Level Sets ◽

Theoretical Explanation ◽

Learning Ability ◽

Local Minima ◽

Global Minima ◽

Residual Network ◽

The Neural Network ◽

Hidden Nodes

The residual network is now one of the most effective structures in deep learning, which utilizes the skip connections to “guarantee" the performance will not get worse. However, the non-convexity of the neural network makes it unclear whether the skip connections do provably improve the learning ability since the nonlinearity may create many local minima. In some previous works [Freeman and Bruna, 2016], it is shown that despite the non-convexity, the loss landscape of the two-layer ReLU network has good properties when the number m of hidden nodes is very large. In this paper, we follow this line to study the topology (sub-level sets) of the loss landscape of deep ReLU neural networks with a skip connection and theoretically prove that the skip connection network inherits the good properties of the two-layer network and skip connections can help to control the connectedness of the sub-level sets, such that any local minima worse than the global minima of some two-layer ReLU network will be very “shallow". The “depth" of these local minima are at most O(m^(η-1)/n), where n is the input dimension, η<1. This provides a theoretical explanation for the effectiveness of the skip connection in deep learning.

Download Full-text

Sub3DNet1.0: a deep-learning model for regional-scale 3D subsurface structure mapping

Geoscientific Model Development ◽

10.5194/gmd-14-3421-2021 ◽

2021 ◽

Vol 14 (6) ◽

pp. 3421-3435

Author(s):

Zhenjiao Jiang ◽

Dirk Mallants ◽

Lei Gao ◽

Tim Munday ◽

Gregoire Mariethoz ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Land Surface ◽

Regional Scale ◽

Learning Model ◽

Generic Structure ◽

Squared Error ◽

The Neural Network ◽

Deep Learning Model ◽

Surface Observations

Abstract. This study introduces an efficient deep-learning model based on convolutional neural networks with joint autoencoder and adversarial structures for 3D subsurface mapping from 2D surface observations. The method was applied to delineate paleovalleys in an Australian desert landscape. The neural network was trained on a 6400 km2 domain by using a land surface topography as 2D input and an airborne electromagnetic (AEM)-derived probability map of paleovalley presence as 3D output. The trained neural network has a squared error <0.10 across 99 % of the training domain and produces a squared error <0.10 across 93 % of the validation domain, demonstrating that it is reliable in reconstructing 3D paleovalley patterns beyond the training area. Due to its generic structure, the neural network structure designed in this study and the training algorithm have broad application potential to construct 3D geological features (e.g., ore bodies, aquifer) from 2D land surface observations.

Download Full-text

Pattern-Based Deep Learning Method to Extract Information from the Log Dataset

Journal of Circuits System and Computers ◽

10.1142/s0218126621502960 ◽

2021 ◽

pp. 2150296

Author(s):

Xi Li ◽

Ting Wang ◽

Shexiong Wang

Keyword(s):

Neural Network ◽

Deep Learning ◽

Data Sets ◽

Learning Method ◽

Statistical Features ◽

Log Data ◽

The Neural Network ◽

Testing Data ◽

Extract Information

It draws researchers’ attentions how to make use of the log data effectively without paying much for storing them. In this paper, we propose pattern-based deep learning method to extract the features from log datasets and to facilitate its further use at the reasonable expense of the storage performances. By taking the advantages of the neural network and thoughts to combine statistical features with experts’ knowledge, there are satisfactory results in the experiments on some specified datasets and on the routine systems that our group maintains. Processed on testing data sets, the model is 5%, at least, more likely to outperform its competitors in accuracy perspective. More importantly, its schema unveils a new way to mingle experts’ experiences with statistical log parser.

Download Full-text