Rider and Sunflower optimization-driven neural network for image classification

2021 ◽  
Vol 19 (1-2) ◽  
pp. 41-61
Author(s):  
Hanumantha Rao Nadendla ◽  
A. Srikrishna ◽  
K. Gangadhara Rao

Image classification is the classical issue in computer vision, machine learning, and image processing. The image classification is measured by differentiating the image into the prescribed category based on the content of the vision. In this paper, a novel classifier named RideSFO-NN is developed for image classification. The proposed method performs the image classification by undergoing two steps, namely feature extraction and classification. Initially, the images from various sources are provided to the proposed Weighted Shape-Size Pattern Spectra for pattern analysis. From the pattern analysis, the significant features are obtained for the classification. Here, the proposed Weighted Shape-Size Pattern Spectra is designed by modifying the gray-scale decomposition with Weight-Shape decomposition. Then, the classification is done based on Neural Network (NN) classifier, which is trained using an optimization approach. The optimization will be done by the proposed Ride Sunflower optimization (RideSFO) algorithm, which is the integration of Rider optimization algorithm (ROA), and Sunflower optimization algorithm (SFO). Finally, the image classification performance is evaluated using RideSFO-NN based on sensitivity, specificity, and accuracy. The developed RideSFO-NN method achieves the maximal accuracy of 94%, maximal sensitivity of 93.87%, and maximal specificity of 90.52% based on K-Fold.

2022 ◽  
Vol 12 (1) ◽  
Author(s):  
Saritha Balasubramaniyan ◽  
Vijay Jeyakumar ◽  
Deepa Subramaniam Nachimuthu

AbstractDiabetes is a serious metabolic disorder with high rate of prevalence worldwide; the disease has the characteristics of improper secretion of insulin in pancreas that results in high glucose level in blood. The disease is also associated with other complications such as cardiovascular disease, retinopathy, neuropathy and nephropathy. The development of computer aided decision support system is inevitable field of research for disease diagnosis that will assist clinicians for the early prognosis of diabetes and to facilitate necessary treatment at the earliest. In this research study, a Traditional Chinese Medicine based diabetes diagnosis is presented based on analyzing the extracted features of panoramic tongue images such as color, texture, shape, tooth markings and fur. The feature extraction is done by Convolutional Neural Network (CNN)—ResNet 50 architecture, and the classification is performed by the proposed Deep Radial Basis Function Neural Network (RBFNN) algorithm based on auto encoder learning mechanism. The proposed model is simulated in MATLAB environment and evaluated with performance metrics—accuracy, precision, sensitivity, specificity, F1 score, error rate, and receiver operating characteristics (ROC). On comparing with existing models, the proposed CNN based Deep RBFNN machine learning classifier model outperformed with better classification performance and proving its effectiveness.


Sensors ◽  
2019 ◽  
Vol 19 (1) ◽  
pp. 210 ◽  
Author(s):  
Zied Tayeb ◽  
Juri Fedjaev ◽  
Nejla Ghaboosi ◽  
Christoph Richter ◽  
Lukas Everding ◽  
...  

Non-invasive, electroencephalography (EEG)-based brain-computer interfaces (BCIs) on motor imagery movements translate the subject’s motor intention into control signals through classifying the EEG patterns caused by different imagination tasks, e.g., hand movements. This type of BCI has been widely studied and used as an alternative mode of communication and environmental control for disabled patients, such as those suffering from a brainstem stroke or a spinal cord injury (SCI). Notwithstanding the success of traditional machine learning methods in classifying EEG signals, these methods still rely on hand-crafted features. The extraction of such features is a difficult task due to the high non-stationarity of EEG signals, which is a major cause by the stagnating progress in classification performance. Remarkable advances in deep learning methods allow end-to-end learning without any feature engineering, which could benefit BCI motor imagery applications. We developed three deep learning models: (1) A long short-term memory (LSTM); (2) a spectrogram-based convolutional neural network model (CNN); and (3) a recurrent convolutional neural network (RCNN), for decoding motor imagery movements directly from raw EEG signals without (any manual) feature engineering. Results were evaluated on our own publicly available, EEG data collected from 20 subjects and on an existing dataset known as 2b EEG dataset from “BCI Competition IV”. Overall, better classification performance was achieved with deep learning models compared to state-of-the art machine learning techniques, which could chart a route ahead for developing new robust techniques for EEG signal decoding. We underpin this point by demonstrating the successful real-time control of a robotic arm using our CNN based BCI.


2021 ◽  
Vol 2021 ◽  
pp. 1-6
Author(s):  
Qiang Cai ◽  
Fenghai Li ◽  
Yifan Chen ◽  
Haisheng Li ◽  
Jian Cao ◽  
...  

Along with the strong representation of the convolutional neural network (CNN), image classification tasks have achieved considerable progress. However, majority of works focus on designing complicated and redundant architectures for extracting informative features to improve classification performance. In this study, we concentrate on rectifying the incomplete outputs of CNN. To be concrete, we propose an innovative image classification method based on Label Rectification Learning (LRL) through kernel extreme learning machine (KELM). It mainly consists of two steps: (1) preclassification, extracting incomplete labels through a pretrained CNN, and (2) label rectification, rectifying the generated incomplete labels by the KELM to obtain the rectified labels. Experiments conducted on publicly available datasets demonstrate the effectiveness of our method. Notably, our method is extensible which can be easily integrated with off-the-shelf networks for improving performance.


Geophysics ◽  
2020 ◽  
Vol 85 (6) ◽  
pp. R477-R492 ◽  
Author(s):  
Bingbing Sun ◽  
Tariq Alkhalifah

Full-waveform inversion (FWI) is a nonlinear optimization problem, and a typical optimization algorithm such as the nonlinear conjugate gradient or limited-memory Broyden-Fletcher-Goldfarb-Shanno (LBFGS) would iteratively update the model mainly along the gradient-descent direction of the misfit function or a slight modification of it. Based on the concept of meta-learning, rather than using a hand-designed optimization algorithm, we have trained the machine (represented by a neural network) to learn an optimization algorithm, entitled the “ML-descent,” and apply it in FWI. Using a recurrent neural network (RNN), we use the gradient of the misfit function as the input, and the hidden states in the RNN incorporate the history information of the gradient similar to an LBFGS algorithm. However, unlike the fixed form of the LBFGS algorithm, the machine-learning (ML) version evolves in response to the gradient. The loss function for training is formulated as a weighted summation of the L2 norm of the data residuals in the original inverse problem. As with any well-defined nonlinear inverse problem, the optimization can be locally approximated by a linear convex problem; thus, to accelerate the training, we train the neural network by minimizing randomly generated quadratic functions instead of performing time-consuming FWIs. To further improve the accuracy and robustness, we use a variational autoencoder that projects and represents the model in latent space. We use the Marmousi and the overthrust examples to demonstrate that the ML-descent method shows faster convergence and outperforms conventional optimization algorithms. The energy in the deeper part of the models can be recovered by the ML-descent even when the pseudoinverse of the Hessian is not incorporated in the FWI update.


Mekatronika ◽  
2020 ◽  
Vol 2 (2) ◽  
pp. 23-27
Author(s):  
Amirul Asyraf Abdul Manan ◽  
Mohd Azraai Mohd Razman ◽  
Ismail Mohd Khairuddin ◽  
Muhammad Nur Aiman Shapiee

This study presents an application of using a Convolutional Neural Network (CNN) based detector to detect chili and its leaves in the chili plant image. Detecting chili on its plant is essential for the development of robotic vision and monitoring. Thus, helps us supervise the plant growth, furthermore, analyses their productivity and quality. This paper aims to develop a system that can monitor and identify bird’s eye chili plants by implementing machine learning. First, the development of methodology for efficient detection of bird’s eye chili and its leaf was made. A dataset of a total of 1866 images after augmentation of bird’s eye chili and its leaf was used in this experiment. YOLO Darknet was implemented to train the dataset. After a series of experiments were conducted, the model is compared with other transfer learning models like YOLO Tiny, Faster R-CNN, and EfficientDet. The classification performance of these transfer learning models has been calculated and compared with each other. The experimental result shows that the Yolov4 Darknet model achieves mAP of 75.69%, followed by EfficientDet at 71.85% for augmented dataset.


Images are the fastest growing content, they contribute significantly to the amount of data generated on the internet every day. Image classification is a challenging problem that social media companies work on vigorously to enhance the user’s experience with the interface. The recent advances in the field of machine learning and computer vision enables personalized suggestions and automatic tagging of images. Convolutional neural network is a hot research topic these days in the field of machine learning. With the help of immensely dense labelled data available on the internet the networks can be trained to recognize the differentiating features among images under the same label. New neural network algorithms are developed frequently that outperform the state-of-art machine learning algorithms. Recent algorithms have managed to produce error rates as low as 3.1%. In this paper the architecture of important CNN algorithms that have gained attention are discussed, analyzed and compared and the concept of transfer learning is used to classify different breeds of dogs..


2020 ◽  
Vol 9 (4) ◽  
pp. 1
Author(s):  
Arman I. Mohammed ◽  
Ahmed AK. Tahir

A new optimization algorithm called Adam Meged with AMSgrad (AMAMSgrad) is modified and used for training a convolutional neural network type Wide Residual Neural Network, Wide ResNet (WRN), for image classification purpose. The modification includes the use of the second moment as in AMSgrad and the use of Adam updating rule but with and (2) as the power of the denominator. The main aim is to improve the performance of the AMAMSgrad optimizer by a proper selection of and the power of the denominator. The implementation of AMAMSgrad and the two known methods (Adam and AMSgrad) on the Wide ResNet using CIFAR-10 dataset for image classification reveals that WRN performs better with AMAMSgrad optimizer compared to its performance with Adam and AMSgrad optimizers. The accuracies of training, validation and testing are improved with AMAMSgrad over Adam and AMSgrad. AMAMSgrad needs less number of epochs to reach maximum performance compared to Adam and AMSgrad. With AMAMSgrad, the training accuracies are (90.45%, 97.79%, 99.98%, 99.99%) respectively at epoch (60, 120, 160, 200), while validation accuracy for the same epoch numbers are (84.89%, 91.53%, 95.05%, 95.23). For testing, the WRN with AMAMSgrad provided an overall accuracy of 94.8%. All these accuracies outrages those provided by WRN with Adam and AMSgrad. The classification metric measures indicate that the given architecture of WRN with the three optimizers performs significantly well and with high confidentiality, especially with AMAMSgrad optimizer.


2020 ◽  
Vol 23 (6) ◽  
pp. 1172-1191
Author(s):  
Artem Aleksandrovich Elizarov ◽  
Evgenii Viktorovich Razinkov

Recently, such a direction of machine learning as reinforcement learning has been actively developing. As a consequence, attempts are being made to use reinforcement learning for solving computer vision problems, in particular for solving the problem of image classification. The tasks of computer vision are currently one of the most urgent tasks of artificial intelligence. The article proposes a method for image classification in the form of a deep neural network using reinforcement learning. The idea of ​​the developed method comes down to solving the problem of a contextual multi-armed bandit using various strategies for achieving a compromise between exploitation and research and reinforcement learning algorithms. Strategies such as -greedy, -softmax, -decay-softmax, and the UCB1 method, and reinforcement learning algorithms such as DQN, REINFORCE, and A2C are considered. The analysis of the influence of various parameters on the efficiency of the method is carried out, and options for further development of the method are proposed.


2020 ◽  
Author(s):  
Yasuhiro Date ◽  
Feifei Wei ◽  
Yuuri Tsuboi ◽  
Kengo Ito ◽  
Kenji Sakata ◽  
...  

Abstract Nuclear magnetic resonance (NMR)-based relaxometry is widely used in various fields of research because of its advantages such as simple sample preparation, easy handling, and relatively low cost compared with metabolomics approaches. However, there have been no reports on the application of the T2 relaxation curves in metabolomics studies involving the evaluation of metabolic mixtures, such as geographical origin determination and feature extraction by pattern recognition and data mining. In this study, we describe a data mining method for relaxometric data (i.e., relaxometric learning). This method is based on a machine learning algorithm supported by the analytical framework optimized for the relaxation curve analyses. In the analytical framework, we incorporated a variable optimization approach and bootstrap resampling-based matrixing to enhance the classification performance and balance the sample size between groups, respectively. The relaxometric learning enabled the extraction of features related to the physical properties of fish muscle and the determination of the geographical origin of the fish by improving the classification performance. Our results suggest that relaxometric learning is a powerful and versatile alternative to conventional metabolomics approaches for evaluating fleshiness of chemical mixtures in food and for other biological and chemical research requiring a nondestructive, cost-effective, and time-saving method.


2019 ◽  
Vol 490 (4) ◽  
pp. 4770-4777 ◽  
Author(s):  
M Kovačević ◽  
G Chiaro ◽  
S Cutini ◽  
G Tosti

ABSTRACT Machine learning is an automatic technique that is revolutionizing scientific research, with innovative applications and wide use in astrophysics. The aim of this study was to develop an optimized version of an Artificial Neural Network machine learning method for classifying blazar candidates of uncertain type detected by the Fermi Large Area Telescope γ-ray instrument. The final result of this study increased the classification performance by about 80 ${{\ \rm per\ cent}}$ with respect to previous method, leaving only 15 unclassified blazars out of 573 blazar candidates of uncertain type listed in the LAT 4-year Source Catalog.


Sign in / Sign up

Export Citation Format

Share Document