Method of determination of the text direction on the image with the use of convolutional neural network

Informatization and communication ◽

10.34219/2078-8320-2020-11-2-96-99 ◽

2020 ◽

pp. 96-99

Author(s):

P.L. Nikolaev

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Deep Neural Network ◽

Binary Classification ◽

Synthetic Data ◽

Real Data ◽

Method Of Determination ◽

Classification Of Images

This article deals with method of binary classification of images with small text on them Classification is based on the fact that the text can have 2 directions – it can be positioned horizontally and read from left to right or it can be turned 180 degrees so the image must be rotated to read the sign. This type of text can be found on the covers of a variety of books, so in case of recognizing the covers, it is necessary first to determine the direction of the text before we will directly recognize it. The article suggests the development of a deep neural network for determination of the text position in the context of book covers recognizing. The results of training and testing of a convolutional neural network on synthetic data as well as the examples of the network functioning on the real data are presented.

Download Full-text

A Study on Multi Class Classification from Breast Cancer Images using Ensemble Network and Transfer Learning

Recent Patents on Engineering ◽

10.2174/1872212114999201109205421 ◽

2020 ◽

Vol 14 ◽

Author(s):

Lahari Tipirneni ◽

Rizwan Patan

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Convolutional Neural Network ◽

Binary Classification ◽

Disease Diagnosis ◽

Feature Descriptors ◽

Histopathological Images ◽

Viable Approach ◽

Multi Class Classification

Abstract:: Millions of deaths all over the world are caused by breast cancer every year. It has become the most common type of cancer in women. Early detection will help in better prognosis and increases the chance of survival. Automating the classification using Computer-Aided Diagnosis (CAD) systems can make the diagnosis less prone to errors. Multi class classification and Binary classification of breast cancer is a challenging problem. Convolutional neural network architectures extract specific feature descriptors from images, which cannot represent different types of breast cancer. This leads to false positives in classification, which is undesirable in disease diagnosis. The current paper presents an ensemble Convolutional neural network for multi class classification and Binary classification of breast cancer. The feature descriptors from each network are combined to produce the final classification. In this paper, histopathological images are taken from publicly available BreakHis dataset and classified between 8 classes. The proposed ensemble model can perform better when compared to the methods proposed in the literature. The results showed that the proposed model could be a viable approach for breast cancer classification.

Download Full-text

Detection and Classification of Brain Tumors from MRI Images Using a Deep Convolutional Neural Network Approach

International Journal of Software Innovation ◽

10.4018/ijsi.293269 ◽

2022 ◽

Vol 10 (1) ◽

pp. 0-0

Keyword(s):

Neural Network ◽

Deep Learning ◽

Brain Tumor ◽

Convolutional Neural Network ◽

Binary Classification ◽

Deep Convolutional Neural Network ◽

Neural Network Models ◽

Neural Network Approach ◽

Cancer Disease

Brain tumor is a severe cancer disease caused by uncontrollable and abnormal partitioning of cells. Timely disease detection and treatment plans lead to the increased life expectancy of patients. Automated detection and classification of brain tumor are a more challenging process which is based on the clinician’s knowledge and experience. For this fact, one of the most practical and important techniques is to use deep learning. Recent progress in the ﬁelds of deep learning has helped the clinician’s in medical imaging for medical diagnosis of brain tumor. In this paper, we present a comparison of Deep Convolutional Neural Network models for automatically binary classification query MRI images dataset with the goal of taking precision tools to health professionals based on fined recent versions of DenseNet, Xception, NASNet-A, and VGGNet. The experiments were conducted using an MRI open dataset of 3,762 images. Other performance measures used in the study are the area under precision, recall, and specificity.

Download Full-text

Binary Classification of Visual Scenes Using Convolutional Neural Network

2019 Novel Intelligent and Leading Emerging Sciences Conference (NILES) ◽

10.1109/niles.2019.8909304 ◽

2019 ◽

Cited By ~ 1

Author(s):

Aya M. Shaaban ◽

Walid Al-Atabany ◽

Nancy M. Salem

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Binary Classification ◽

Visual Scenes

Download Full-text

Explainable Deep Learning Reproduces a ‘Professional Eye’ on the Diagnosis of Internal Disorders in Persimmon Fruit

Plant and Cell Physiology ◽

10.1093/pcp/pcaa111 ◽

2020 ◽

Vol 61 (11) ◽

pp. 1967-1973

Author(s):

Takashi Akagi ◽

Masanori Onishi ◽

Kanae Masuda ◽

Ryohei Kuroki ◽

Kohei Baba ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Binary Classification ◽

Network Models ◽

Plant Biology ◽

Neural Network Models ◽

Persimmon Fruit

Abstract Recent rapid progress in deep neural network techniques has allowed recognition and classification of various objects, often exceeding the performance of the human eye. In plant biology and crop sciences, some deep neural network frameworks have been applied mainly for effective and rapid phenotyping. In this study, beyond simple optimizations of phenotyping, we propose an application of deep neural networks to make an image-based internal disorder diagnosis that is hard even for experts, and to visualize the reasons behind each diagnosis to provide biological interpretations. Here, we exemplified classification of calyx-end cracking in persimmon fruit by using five convolutional neural network models with various layer structures and examined potential analytical options involved in the diagnostic qualities. With 3,173 visible RGB images from the fruit apex side, the neural networks successfully made the binary classification of each degree of disorder, with up to 90% accuracy. Furthermore, feature visualizations, such as Grad-CAM and LRP, visualize the regions of the image that contribute to the diagnosis. They suggest that specific patterns of color unevenness, such as in the fruit peripheral area, can be indexes of calyx-end cracking. These results not only provided novel insights into indexes of fruit internal disorders but also proposed the potential applicability of deep neural networks in plant biology.

Download Full-text

Vibration Anomaly Detection using Deep Neural Network and Convolutional Neural Network

International Journal of Modeling and Optimization ◽

10.7763/ijmo.2021.v11.772 ◽

2021 ◽

pp. 19-28

Author(s):

Crina Deac ◽

◽

Gicu Călin Deac ◽

Radu Constantin Parpală ◽

Cicerone Laurentiu Popa ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Deep Neural Network ◽

Time Series Data ◽

Real Data ◽

Series Data ◽

Health State ◽

Optimal Learning ◽

Accuracy And Precision ◽

The Cost

Identifying the “health state” of the equipment is the domain of condition monitoring. The paper proposes a study of two models: DNN (Deep Neural Network) and CNN (Convolutional Neural Network) over an existent dataset provided by Case Western Reserve University for analyzing vibrations in fault diagnosis. After the model is trained on the windowed dataset using an optimal learning rate, minimizing the cost function, and is tested by computing the loss, accuracy and precision across the results, the weights are saved, and the models can be tested on other real data. The trained model recognizes raw time series data collected by micro electro-mechanical accelerometer sensors and detects anomalies based on former times series entries.

Download Full-text

A New Method for Classification of Images Using Convolutional Neural Network Based on Dwt-Svd Perceptual Hash Function

2018 3rd International Conference on Computer Science and Engineering (UBMK) ◽

10.1109/ubmk.2018.8566537 ◽

2018 ◽

Author(s):

Fatih Ozyurt ◽

Huseyin Kutlu ◽

Engin Avci ◽

Derya Avci

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Hash Function ◽

New Method ◽

Perceptual Hash ◽

Classification Of Images

Download Full-text

An Accuracy vs. Complexity Comparison of Deep Learning Architectures for the Detection of COVID-19 Disease

Computation ◽

10.3390/computation9010003 ◽

2021 ◽

Vol 9 (1) ◽

pp. 3

Author(s):

Sima Sarv Ahrabi ◽

Michele Scarpiniti ◽

Enzo Baccarelli ◽

Alireza Momenzadeh

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Medical Research ◽

Binary Classification ◽

Lung Infection ◽

X Rays ◽

Training Process ◽

Learning Architectures

In parallel with the vast medical research on clinical treatment of COVID-19, an important action to have the disease completely under control is to carefully monitor the patients. What the detection of COVID-19 relies on most is the viral tests, however, the study of X-rays is helpful due to the ease of availability. There are various studies that employ Deep Learning (DL) paradigms, aiming at reinforcing the radiography-based recognition of lung infection by COVID-19. In this regard, we make a comparison of the noteworthy approaches devoted to the binary classification of infected images by using DL techniques, then we also propose a variant of a convolutional neural network (CNN) with optimized parameters, which performs very well on a recent dataset of COVID-19. The proposed model’s effectiveness is demonstrated to be of considerable importance due to its uncomplicated design, in contrast to other presented models. In our approach, we randomly put several images of the utilized dataset aside as a hold out set; the model detects most of the COVID-19 X-rays correctly, with an excellent overall accuracy of 99.8%. In addition, the significance of the results obtained by testing different datasets of diverse characteristics (which, more specifically, are not used in the training process) demonstrates the effectiveness of the proposed approach in terms of an accuracy up to 93%.

Download Full-text

Classification and Separation of Diffraction Energy on Pre-Migration Seismic Data using Deep Learning

10.5194/egusphere-egu2020-5376 ◽

2020 ◽

Author(s):

Brydon Lowney ◽

Ivan Lokmer ◽

Gareth Shane O'Brien ◽

Christopher Bean

Keyword(s):

Neural Network ◽

Network Architecture ◽

Deep Neural Network ◽

Computational Cost ◽

Synthetic Data ◽

Real Data ◽

Training Data ◽

Separation Technique ◽

Generative Adversarial Network ◽

Analytical Separation

Diffractions are a useful aspect of the seismic wavefield and are often underutilised. By separating the diffractions from the rest of the wavefield they can be used for various applications such as velocity analysis, structural imaging, and wavefront tomography. However, separating the diffractions is a challenging task due to the comparatively low amplitudes of diffractions as well as the overlap between reflection and diffraction energy. Whilst there are existing analytical methods for separation, these act to remove reflections, leaving a volume which contains diffractions and noise. On top of this, analytical separation techniques can be costly computationally as well as requiring manual parameterisation. To alleviate these issues, a deep neural network has been trained to automatically identify and separate diffractions from reflections and noise on pre-migration data.Here, a Generative Adversarial Network (GAN) has been trained for the automated separation. This is a type of deep neural network architecture which contains two neural networks which compete against one another. One neural network acts as a generator, creating new data which appears visually similar to the real data, while a second neural network acts as a discriminator, trying to identify whether the given data is real or fake. As the generator improves, so too does the discriminator, giving a deeper understanding of the data. To avoid overfitting to a specific dataset as well as to improve the cross-data applicability of the network, data from several different seismic datasets from geologically distinct locations has been used in training. When comparing a network trained on a single dataset compared to one trained on several datasets, it is seen that providing additional data improves the separation on both the original and new datasets.The automatic separation technique is then compared with a conventional, analytical, separation technique; plane-wave destruction (PWD). The computational cost of the GAN separation is vastly superior to that of PWD, performing a separation in minutes on a 3-D dataset in comparison to hours. Although in some complex areas the GAN separation is of a higher quality than the PWD separation, as it does not rely on the dip, there are also areas where the PWD outperforms the GAN separation. The GAN may be enhanced by adding more training data as well as by improving the initial separation used to create the training data, which is based around PWD and thus is imperfect and can introduce bias into the network. A potential for this is training the GAN entirely using synthetic data, which allows for a perfect separation as the points are known, however, it must be of sufficient volume for training and sufficient quality for real data applicability.

Download Full-text

Convolutional Neural Network for Binary Classification of Chromophobe Renal Cell Carcinoma and Oncocytoma

10.1364/fio.2021.jth5a.80 ◽

2021 ◽

Author(s):

Andrew Cheng ◽

Michael Icaza ◽

Nicholas Judd ◽

Jason Smith ◽

Sushmita Mukherjee ◽

...

Keyword(s):

Neural Network ◽

Renal Cell Carcinoma ◽

Convolutional Neural Network ◽

Cell Carcinoma ◽

Renal Cell ◽

Binary Classification ◽

Chromophobe Renal Cell Carcinoma

Download Full-text

Classification of the operating state of rolling bearings with the convolutional neural network with variable dilation factors

Bulletin of the National Technical University «KhPI» Series: Dynamics and Strength of Machines ◽

10.20998/2078-9130.2021.2.249274 ◽

2021 ◽

pp. 106-111

Author(s):

Ruslan Babudzhan ◽

Konstantyn Isaienkov ◽

Oleksii Vodka ◽

Danilo Krasiy ◽

Ivan Zadorozhny ◽

...

Keyword(s):

Neural Network ◽

Mathematical Model ◽

Convolutional Neural Network ◽

Binary Classification ◽

Classification Problems ◽

Rolling Bearings ◽

Data Set ◽

Shaft Rotation ◽

The Mathematical Model

The work describes rolling bearings operation data processing, and their use in the problem of constructing a mathematical model of the binary classification of the operating state of bearings by the method of a convolutional neural network with varying factors of dilatation of the kernel of convolutional layers. To classify bearings with defects, we used vibration acceleration data from our own test bench and a publicly available data set. The work also investigated a method for generalizing the classification of bearing signals obtained as a result of fundamentally different experiments and having different standard sizes. To unify signals, the following processing method is proposed: select data areas with displacement, go to the frequency space using fast Fourier transform, cut off frequencies exceeding 10 times the shaft rotation frequency, restore the signal while maintaining 10 shaft rotation periods, scale the received signal by dividing it by its diameter orbits of the rolling body and interpolate the signal at 2048 points. This algorithm also allows to generate a balanced sample for building a mathematical model. This feature is provided by varying the step of splitting the initial signal. The advantage of this algorithm over the classical methods of oversampling or undersampling is the generation of new objects that specify the statistical parameters of the general population. The signal processing algorithm was used both for binary classification problems within one dataset, and for training on one and testing on another. To increase the data set for training and testing the mathematical model, the bootstrapping method is used, based on multiple generation of samples using the Monte Carlo method. The quality of the mathematical model of binary classification was assessed by the proportion of correct answers. The problem is formulated as the problem of minimizing binary cross entropy. The results obtained are presented in the form of graphs demonstrating the neural network training process and graphs of the distribution density of metrics.

Download Full-text