scholarly journals APPLICATION OF A CONVOLUTIONAL NEURAL NETWORK AND A KOHONEN NETWORK FOR ACCELERATED DETECTION AND RECOGNITION OF OBJECTS IN IMAGES

2020 ◽  
Vol 4 ◽  
pp. 11-18
Author(s):  
Victor Skuratov ◽  
Konstantin Kuzmin ◽  
Igor Nelin ◽  
Mikhail Sedankin

One of the most effective ways to improve the accuracy and speed of algorithms for searching and recognizing objects in images is to pre-select areas of interest in which it is likely to detect objects of interest. To determine areas of interest in a pre-processed radar or satellite image of the underlying surface, the Kohonen network was used. The found areas of interest are sent to the convolutional neural network, which provides the final detection and recognition of objects. The combination of the above methods allows to speed up the process of searching and recognizing objects in images, which is becoming more expedient due to the constantly growing volume of data for analysis. The process of preliminary processing of input data is described, the process of searching and recognizing patterns of aircraft against the underlying surface is presented, and the analysis of the results is carried out. The use of the Kohonen neural network makes it possible to reduce the amount of data analyzed by the convolutional network by 18–125 times, which accordingly accelerates the process of detection and recognition of the object of interest. The size of the parts of the input image fed into the convolutional network, into which the zones of interest are divided, is tied to the image scale and is equal to the size of the largest detectable object, which can significantly reduce the training sample. Application of the presented methods and centering of the object on training images allows accelerating the convolutional network training by more than 5 times and increasing the recognition accuracy by at least 10%, as well as minimizing the required minimum number of layers and neurons of the network by at least halving, respectively increasing its speed

2018 ◽  
Vol 4 (1) ◽  
pp. 297-300 ◽  
Author(s):  
Mattias P. Heinrich ◽  
Maik Stille ◽  
Thorsten M. Buzug

AbstractLow-dose CT has received increasing attention in the recent years and is considered a promising method to reduce the risk of cancer in patients. However, the reduction of the dosage leads to quantum noise in the raw data, which is carried on in the reconstructed images. Two different multilayer convolutional neural network (CNN) architectures for the denoising of CT images are investigated. ResFCN is based on a fully-convolutional network that consists of three blocks of 5×5 convolutions filters and a ResUNet that is trained with 10 convolutional blocks that are arranged in a multi-scale fashion. Both architectures feature a residual connection of the input image to ease learning. Training images are based on realistic simulations by using the XCAT phantom. The ResUNet approach shows the most promising results with a peak signal to noise ratio of 44.00 compared to ResFCN with 41.79.


Informatics ◽  
2020 ◽  
Vol 17 (3) ◽  
pp. 36-43
Author(s):  
D. A. Paulenka ◽  
V. A. Kovalev ◽  
E. V. Snezhko ◽  
V. A. Liauchuk ◽  
E. I. Pechkovsky

The results of the development of hardware and software system (micromodule), which detects and classifies underlying surface images of the Earth are presented. The micromodule can be installed on board of a light unmanned aerial vehicle (drone). The device has the size 5.2×7.4×3.1 cm, the weight52 g, runs on a Raspberry Pi Zero Wireless single-board microcomputer and uses a convolutional neural network based on MobileNetV2 architecture for real-time image classification. When developing the micromodule, the authors aimed to achieve a real-time image classification on inexpensive mobile equipment with low computing power so that the classification quality is  comparable  to  popular  deep  convolutional  network  architectures. The provided information could be useful for engineers and researchers who are developing compact budget mobile systems for processing, analyzing and recognition of images.


2020 ◽  
Vol 65 (6) ◽  
pp. 759-773
Author(s):  
Segu Praveena ◽  
Sohan Pal Singh

AbstractLeukaemia detection and diagnosis in advance is the trending topic in the medical applications for reducing the death toll of patients with acute lymphoblastic leukaemia (ALL). For the detection of ALL, it is essential to analyse the white blood cells (WBCs) for which the blood smear images are employed. This paper proposes a new technique for the segmentation and classification of the acute lymphoblastic leukaemia. The proposed method of automatic leukaemia detection is based on the Deep Convolutional Neural Network (Deep CNN) that is trained using an optimization algorithm, named Grey wolf-based Jaya Optimization Algorithm (GreyJOA), which is developed using the Grey Wolf Optimizer (GWO) and Jaya Optimization Algorithm (JOA) that improves the global convergence. Initially, the input image is applied to pre-processing and the segmentation is performed using the Sparse Fuzzy C-Means (Sparse FCM) clustering algorithm. Then, the features, such as Local Directional Patterns (LDP) and colour histogram-based features, are extracted from the segments of the pre-processed input image. Finally, the extracted features are applied to the Deep CNN for the classification. The experimentation evaluation of the method using the images of the ALL IDB2 database reveals that the proposed method acquired a maximal accuracy, sensitivity, and specificity of 0.9350, 0.9528, and 0.9389, respectively.


Biomolecules ◽  
2021 ◽  
Vol 11 (2) ◽  
pp. 264
Author(s):  
Kaisa Liimatainen ◽  
Riku Huttunen ◽  
Leena Latonen ◽  
Pekka Ruusuvuori

Identifying localization of proteins and their specific subpopulations associated with certain cellular compartments is crucial for understanding protein function and interactions with other macromolecules. Fluorescence microscopy is a powerful method to assess protein localizations, with increasing demand of automated high throughput analysis methods to supplement the technical advancements in high throughput imaging. Here, we study the applicability of deep neural network-based artificial intelligence in classification of protein localization in 13 cellular subcompartments. We use deep learning-based on convolutional neural network and fully convolutional network with similar architectures for the classification task, aiming at achieving accurate classification, but importantly, also comparison of the networks. Our results show that both types of convolutional neural networks perform well in protein localization classification tasks for major cellular organelles. Yet, in this study, the fully convolutional network outperforms the convolutional neural network in classification of images with multiple simultaneous protein localizations. We find that the fully convolutional network, using output visualizing the identified localizations, is a very useful tool for systematic protein localization assessment.


2020 ◽  
Vol 10 (6) ◽  
pp. 2104
Author(s):  
Michał Tomaszewski ◽  
Paweł Michalski ◽  
Jakub Osuchowski

This article presents an analysis of the effectiveness of object detection in digital images with the application of a limited quantity of input. The possibility of using a limited set of learning data was achieved by developing a detailed scenario of the task, which strictly defined the conditions of detector operation in the considered case of a convolutional neural network. The described solution utilizes known architectures of deep neural networks in the process of learning and object detection. The article presents comparisons of results from detecting the most popular deep neural networks while maintaining a limited training set composed of a specific number of selected images from diagnostic video. The analyzed input material was recorded during an inspection flight conducted along high-voltage lines. The object detector was built for a power insulator. The main contribution of the presented papier is the evidence that a limited training set (in our case, just 60 training frames) could be used for object detection, assuming an outdoor scenario with low variability of environmental conditions. The decision of which network will generate the best result for such a limited training set is not a trivial task. Conducted research suggests that the deep neural networks will achieve different levels of effectiveness depending on the amount of training data. The most beneficial results were obtained for two convolutional neural networks: the faster region-convolutional neural network (faster R-CNN) and the region-based fully convolutional network (R-FCN). Faster R-CNN reached the highest AP (average precision) at a level of 0.8 for 60 frames. The R-FCN model gained a worse AP result; however, it can be noted that the relationship between the number of input samples and the obtained results has a significantly lower influence than in the case of other CNN models, which, in the authors’ assessment, is a desired feature in the case of a limited training set.


Cells ◽  
2019 ◽  
Vol 8 (9) ◽  
pp. 1012 ◽  
Author(s):  
Xuan ◽  
Pan ◽  
Zhang ◽  
Liu ◽  
Sun

Aberrant expressions of long non-coding RNAs (lncRNAs) are often associated with diseases and identification of disease-related lncRNAs is helpful for elucidating complex pathogenesis. Recent methods for predicting associations between lncRNAs and diseases integrate their pertinent heterogeneous data. However, they failed to deeply integrate topological information of heterogeneous network comprising lncRNAs, diseases, and miRNAs. We proposed a novel method based on the graph convolutional network and convolutional neural network, referred to as GCNLDA, to infer disease-related lncRNA candidates. The heterogeneous network containing the lncRNA, disease, and miRNA nodes, is constructed firstly. The embedding matrix of a lncRNA-disease node pair was constructed according to various biological premises about lncRNAs, diseases, and miRNAs. A new framework based on a graph convolutional network and a convolutional neural network was developed to learn network and local representations of the lncRNA-disease pair. On the left side of the framework, the autoencoder based on graph convolution deeply integrated topological information within the heterogeneous lncRNA-disease-miRNA network. Moreover, as different node features have discriminative contributions to the association prediction, an attention mechanism at node feature level is constructed. The left side learnt the network representation of the lncRNA-disease pair. The convolutional neural networks on the right side of the framework learnt the local representation of the lncRNA-disease pair by focusing on the similarities, associations, and interactions that are only related to the pair. Compared to several state-of-the-art prediction methods, GCNLDA had superior performance. Case studies on stomach cancer, osteosarcoma, and lung cancer confirmed that GCNLDA effectively discovers the potential lncRNA-disease associations.


2021 ◽  
Vol 2083 (3) ◽  
pp. 032015
Author(s):  
Guanru Zou ◽  
Yulin Luo ◽  
Zefeng Feng

Abstract Convolutional neural network is an important neural network model in deep learning and a common algorithm in computer vision problems. From the perspective of practical application scenarios, this paper studies whether padding in convolutional neural network convolution layer weakens the image edge information. In order to eliminate the background factor, this paper select MNIST dataset as the research object, move the 0-9 digital image to the specified image edge by clearing the white area pixels in the specified direction, and use OpenCV to realize bilinear interpolation to scale the image to ensure that the image dimension is 28×28. The convolution neural network is built to train the original dataset and the processed dataset, and the accuracy rates are 0.9892 and 0.1082 respectively. In the comparative experiment, padding cannot solve the problem of weakening the image edge weight well. In the actual digital recognition scene, it is necessary to consider whether the core recognition area in the input image is at the edge of the image.


2019 ◽  
Vol 1 (01) ◽  
pp. 11-19 ◽  
Author(s):  
James Deva Koresh H

The paper puts forward a real time traffic sign sensing (detection and recognition) frame work for enhancing the vehicles capability in order to have a save driving, path planning. The proposed method utilizes the capsules neural network that outperforms the convolutional neural network by eluding the necessities for the manual effort. The capsules network provides a better resistance for the spatial variance and the high reliability in the sensing of the traffic sign compared to the convolutional network. The evaluation of the capsule network with the Indian traffic data set shows a 15% higher accuracy when compared with the CNN and the RNN.


Author(s):  
Erfan Ghadery ◽  
Sajad Movahedi ◽  
Heshaam Faili ◽  
Azadeh Shakery

The advent of the Internet has caused a significant growth in the number of opinions expressed about products or services on e-commerce websites. Aspect category detection, which is one of the challenging subtasks of aspect-based sentiment analysis, deals with categorizing a given review sentence into a set of predefined categories. Most of the research efforts in this field are devoted to English language reviews, while there are a large number of reviews in other languages that are left unexplored. In this paper, we propose a multilingual method to perform aspect category detection on reviews in different languages, which makes use of a deep convolutional neural network with multilingual word embeddings. To the best of our knowledge, our method is the first attempt at performing aspect category detection on multiple languages simultaneously. Empirical results on the multilingual dataset provided by SemEval workshop demonstrate the effectiveness of the proposed method1.


Sign in / Sign up

Export Citation Format

Share Document