Recognition of mango leaf disease using convolutional neural network models: a transfer learning approach

Aditya Rajbongshi; Thaharim Khan; Md. Mahbubur Rahman; Anik Pramanik; Shah Md Tanvir Siddiquee; Narayan Ranjan Chakraborty

doi:10.11591/ijeecs.v23.i3.pp1681-1688

Recognition of mango leaf disease using convolutional neural network models: a transfer learning approach

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v23.i3.pp1681-1688 ◽

2021 ◽

Vol 23 (3) ◽

pp. 1681

Author(s):

Aditya Rajbongshi ◽

Thaharim Khan ◽

Md. Mahbubur Rahman ◽

Anik Pramanik ◽

Shah Md Tanvir Siddiquee ◽

...

Keyword(s):

Transfer Learning ◽

Image Data ◽

Network Models ◽

Plant Diseases ◽

Prevention Measures ◽

Neural Network Models ◽

Data Set ◽

Learning Techniques ◽

Overall Performance ◽

Mango Leaves

<p>The acknowledgment of plant diseases assumes an indispensable part in taking infectious prevention measures to improve the quality and amount of harvest yield. Mechanization of plant diseases is a lot advantageous as it decreases the checking work in an enormous cultivated area where mango is planted to a huge extend. Leaves being the food hotspot for plants, the early and precise recognition of leaf diseases is significant. This work focused on grouping and distinguishing the diseases of mango leaves through the process of CNN. DenseNet201, InceptionResNetV2, InceptionV3, ResNet50, ResNet152V2, and Xception all these models of CNN with transfer learning techniques are used here for getting better accuracy from the targeted data set. Image acquisition, image segmentation, and features extraction are the steps involved in disease detection. Different kinds of leaf diseases which are considered as the class for this work such as anthracnose, gall machi, powdery mildew, red rust are used in the dataset consisting of 1500 images of diseased and also healthy mango leaves image data another class is also added in the dataset. We have also evaluated the overall performance matrices and found that the DenseNet201 outperforms by obtaining the highest accuracy as 98.00% than other models.</p>

Download Full-text

Arabic text summarization using deep learning approach

Journal Of Big Data ◽

10.1186/s40537-020-00386-7 ◽

2020 ◽

Vol 7 (1) ◽

Author(s):

Molham Al-Maleh ◽

Said Desouki

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Language Processing ◽

Network Models ◽

Arabic Language ◽

Text Summarization ◽

Neural Network Models ◽

Data Set ◽

Learning Techniques ◽

Arabic Text Summarization

AbstractNatural language processing has witnessed remarkable progress with the advent of deep learning techniques. Text summarization, along other tasks like text translation and sentiment analysis, used deep neural network models to enhance results. The new methods of text summarization are subject to a sequence-to-sequence framework of encoder–decoder model, which is composed of neural networks trained jointly on both input and output. Deep neural networks take advantage of big datasets to improve their results. These networks are supported by the attention mechanism, which can deal with long texts more efficiently by identifying focus points in the text. They are also supported by the copy mechanism that allows the model to copy words from the source to the summary directly. In this research, we are re-implementing the basic summarization model that applies the sequence-to-sequence framework on the Arabic language, which has not witnessed the employment of this model in the text summarization before. Initially, we build an Arabic data set of summarized article headlines. This data set consists of approximately 300 thousand entries, each consisting of an article introduction and the headline corresponding to this introduction. We then apply baseline summarization models to the previous data set and compare the results using the ROUGE scale.

Download Full-text

Evaluation of transfer learning in deep convolutional neural network models for cardiac short axis slice classification

Scientific Reports ◽

10.1038/s41598-021-81525-9 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Namgyu Ho ◽

Yoon-Chul Kim

Keyword(s):

Transfer Learning ◽

Image Data ◽

Network Models ◽

Short Axis Slice ◽

Training Data ◽

Correct Identification ◽

Short Axis ◽

Deep Convolutional Neural Networks ◽

Neural Network Models ◽

Axis Slice

AbstractIn computer-aided analysis of cardiac MRI data, segmentations of the left ventricle (LV) and myocardium are performed to quantify LV ejection fraction and LV mass, and they are performed after the identification of a short axis slice coverage, where automatic classification of the slice range of interest is preferable. Standard cardiac image post-processing guidelines indicate the importance of the correct identification of a short axis slice range for accurate quantification. We investigated the feasibility of applying transfer learning of deep convolutional neural networks (CNNs) as a means to automatically classify the short axis slice range, as transfer learning is well suited to medical image data where labeled data is scarce and expensive to obtain. The short axis slice images were classified into out-of-apical, apical-to-basal, and out-of-basal, on the basis of short axis slice location in the LV. We developed a custom user interface to conveniently label image slices into one of the three categories for the generation of training data and evaluated the performance of transfer learning in nine popular deep CNNs. Evaluation with unseen test data indicated that among the CNNs the fine-tuned VGG16 produced the highest values in all evaluation categories considered and appeared to be the most appropriate choice for the cardiac slice range classification.

Download Full-text

StackBC: Deep learning and transfer learning techniques based stacking approach for accurate Invasive Ductal Carcinoma classification using histology images

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-212240 ◽

2022 ◽

pp. 1-12

Author(s):

Amin Ul Haq ◽

Jian Ping Li ◽

Samad Wali ◽

Sultan Ahmad ◽

Zafar Ali ◽

...

Keyword(s):

Breast Cancer ◽

Deep Learning ◽

Transfer Learning ◽

Invasive Ductal Carcinoma ◽

Ductal Carcinoma ◽

Early Recognition ◽

Predictive Performance ◽

Operating Characteristics ◽

Data Set ◽

Learning Techniques

Artificial intelligence (AI) based computer-aided diagnostic (CAD) systems can effectively diagnose critical disease. AI-based detection of breast cancer (BC) through images data is more efficient and accurate than professional radiologists. However, the existing AI-based BC diagnosis methods have complexity in low prediction accuracy and high computation time. Due to these reasons, medical professionals are not employing the current proposed techniques in E-Healthcare to effectively diagnose the BC. To diagnose the breast cancer effectively need to incorporate advanced AI techniques based methods in diagnosis process. In this work, we proposed a deep learning based diagnosis method (StackBC) to detect breast cancer in the early stage for effective treatment and recovery. In particular, we have incorporated deep learning models including Convolutional neural network (CNN), Long short term memory (LSTM), and Gated recurrent unit (GRU) for the classification of Invasive Ductal Carcinoma (IDC). Additionally, data augmentation and transfer learning techniques have been incorporated for data set balancing and for effective training the model. To further improve the predictive performance of model we used stacking technique. Among the three base classifiers (CNN, LSTM, GRU) the predictive performance of GRU are better as compared to individual model. The GRU is selected as a meta classifier to distinguish between Non-IDC and IDC breast images. The method Hold-Out has been incorporated and the data set is split into 90% and 10% for training and testing of the model, respectively. Model evaluation metrics have been computed for model performance evaluation. To analyze the efficacy of the model, we have used breast histology images data set. Our experimental results demonstrated that the proposed StackBC method achieved improved performance by gaining 99.02% accuracy and 100% area under the receiver operating characteristics curve (AUC-ROC) compared to state-of-the-art methods. Due to the high performance of the proposed method, we recommend it for early recognition of breast cancer in E-Healthcare.

Download Full-text

Chest X-Ray Image Analysis of Convolutional Neural Network Models with Transfer Learning for Prediction of COVID Patients

Smart Computing Techniques and Applications - Smart Innovation, Systems and Technologies ◽

10.1007/978-981-16-1502-3_36 ◽

2021 ◽

pp. 353-361

Author(s):

M. Shyamala Devi ◽

P. Swathi ◽

N. Pavan Kumar ◽

Ravi Varma Tungala ◽

Saranya Vivekanandan ◽

...

Keyword(s):

Neural Network ◽

Image Analysis ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Network Models ◽

Neural Network Models ◽

X Ray ◽

Chest X Ray

Download Full-text

Multiple Ensemble Neural Network Models with Fuzzy Response Aggregation for Predicting COVID-19 Time Series: The Case of Mexico

Healthcare ◽

10.3390/healthcare8020181 ◽

2020 ◽

Vol 8 (2) ◽

pp. 181 ◽

Cited By ~ 5

Author(s):

Patricia Melin ◽

Julio Cesar Monica ◽

Daniela Sanchez ◽

Oscar Castillo

Keyword(s):

Neural Network ◽

Neural Networks ◽

Time Series ◽

Fuzzy Logic ◽

Network Models ◽

Neural Network Models ◽

Validation Data ◽

Data Set ◽

Ensemble Neural Networks ◽

Ensemble Neural Network

In this paper, a multiple ensemble neural network model with fuzzy response aggregation for the COVID-19 time series is presented. Ensemble neural networks are composed of a set of modules, which are used to produce several predictions under different conditions. The modules are simple neural networks. Fuzzy logic is then used to aggregate the responses of several predictor modules, in this way, improving the final prediction by combining the outputs of the modules in an intelligent way. Fuzzy logic handles the uncertainty in the process of making a final decision about the prediction. The complete model was tested for the case of predicting the COVID-19 time series in Mexico, at the level of the states and the whole country. The simulation results of the multiple ensemble neural network models with fuzzy response integration show very good predicted values in the validation data set. In fact, the prediction errors of the multiple ensemble neural networks are significantly lower than using traditional monolithic neural networks, in this way showing the advantages of the proposed approach.

Download Full-text

Using Deep Learning for Image-Based Potato Tuber Disease Detection

Phytopathology ◽

10.1094/phyto-08-18-0288-r ◽

2019 ◽

Vol 109 (6) ◽

pp. 1083-1087 ◽

Cited By ~ 11

Author(s):

Dor Oppenheim ◽

Guy Shani ◽

Orly Erlich ◽

Leah Tsror

Keyword(s):

Deep Learning ◽

Low Cost ◽

Image Data ◽

Object Identification ◽

Plant Diseases ◽

Disease Classification ◽

Successful Implementation ◽

Potato Tubers ◽

Data Set ◽

Convolutional Networks

Many plant diseases have distinct visual symptoms, which can be used to identify and classify them correctly. This article presents a potato disease classification algorithm that leverages these distinct appearances and advances in computer vision made possible by deep learning. The algorithm uses a deep convolutional neural network, training it to classify the tubers into five classes: namely, four disease classes and a healthy potato class. The database of images used in this study, containing potato tubers of different cultivars, sizes, and diseases, was acquired, classified, and labeled manually by experts. The models were trained over different train-test splits to better understand the amount of image data needed to apply deep learning for such classification tasks. The models were tested over a data set of images taken using standard low-cost RGB (red, green, and blue) sensors and were tagged by experts, demonstrating high classification accuracy. This is the first article to report the successful implementation of deep convolutional networks, popular in object identification, to the task of disease identification in potato tubers, showing the potential of deep learning techniques in agricultural tasks.

Download Full-text

Convolutional Neural Network for Remote-Sensing Scene Classification: Transfer Learning Analysis

Remote Sensing ◽

10.3390/rs12010086 ◽

2019 ◽

Vol 12 (1) ◽

pp. 86 ◽

Cited By ~ 14

Author(s):

Rafael Pires de Lima ◽

Kurt Marfurt

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Network Models ◽

Remote Sensing Image ◽

Natural Images ◽

Natural Image ◽

Scene Classification ◽

Neural Network Models

Remote-sensing image scene classification can provide significant value, ranging from forest fire monitoring to land-use and land-cover classification. Beginning with the first aerial photographs of the early 20th century to the satellite imagery of today, the amount of remote-sensing data has increased geometrically with a higher resolution. The need to analyze these modern digital data motivated research to accelerate remote-sensing image classification. Fortunately, great advances have been made by the computer vision community to classify natural images or photographs taken with an ordinary camera. Natural image datasets can range up to millions of samples and are, therefore, amenable to deep-learning techniques. Many fields of science, remote sensing included, were able to exploit the success of natural image classification by convolutional neural network models using a technique commonly called transfer learning. We provide a systematic review of transfer learning application for scene classification using different datasets and different deep-learning models. We evaluate how the specialization of convolutional neural network models affects the transfer learning process by splitting original models in different points. As expected, we find the choice of hyperparameters used to train the model has a significant influence on the final performance of the models. Curiously, we find transfer learning from models trained on larger, more generic natural images datasets outperformed transfer learning from models trained directly on smaller remotely sensed datasets. Nonetheless, results show that transfer learning provides a powerful tool for remote-sensing scene classification.

Download Full-text

Extracting Representative Images of Tourist Attractions from Flickr by Combining an Improved Cluster Method and Multiple Deep Learning Models

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9020081 ◽

2020 ◽

Vol 9 (2) ◽

pp. 81 ◽

Cited By ~ 1

Author(s):

Shanshan Han ◽

Fu Ren ◽

Qingyun Du ◽

Dawei Gui

Keyword(s):

Real Life ◽

Image Data ◽

Network Models ◽

Cluster Method ◽

Single Shot ◽

Large Area ◽

Neural Network Models ◽

Tourist Attractions ◽

Geotagged Photos ◽

Filter Noise

Extracting representative images of tourist attractions from geotagged photos is beneficial to many fields in tourist management, such as applications in touristic information systems. This task usually begins with clustering to extract tourist attractions from raw coordinates in geotagged photos. However, most existing cluster methods are limited in the accuracy and granularity of the places of interest, as well as in detecting distinct tags, due to its primary consideration of spatial relationships. After clustering, the challenge still exists for the task of extracting representative images within the geotagged base image data, because of the existence of noisy photos occupied by a large area proportion of humans and unrelated objects. In this paper, we propose a framework containing an improved cluster method and multiple neural network models to extract representative images of tourist attractions. We first propose a novel time- and user-constrained density-joinable cluster method (TU-DJ-Cluster), specific to photos with similar geotags to detect place-relevant tags. Then we merge and extend the clusters according to the similarity between pairs of tag embeddings, as trained from Word2Vec. Based on the clustering result, we filter noise images with Multilayer Perceptron and a single-shot multibox detector model, and further select representative images with the deep ranking model. We select Beijing as the study area. The quantitative and qualitative analysis, as well as the questionnaire results obtained from real-life tourists, demonstrate the effectiveness of this framework.

Download Full-text

An explicit methodology for manufacturing cost–tolerance modeling and optimization using the neural network integrated with the genetic algorithm

Artificial intelligence for engineering design analysis and manufacturing ◽

10.1017/s0890060420000219 ◽

2020 ◽

Vol 34 (3) ◽

pp. 430-443

Author(s):

A. Saravanan ◽

J. Jerald ◽

A. Delphin Carolina Rani

Keyword(s):

Neural Network ◽

Genetic Algorithm ◽

Network Model ◽

Neural Network Model ◽

Network Models ◽

Manufacturing Cost ◽

Genetic Operators ◽

Neural Network Models ◽

Data Set ◽

The Neural Network

AbstractThe objective of the paper is to develop a new method to model the manufacturing cost–tolerance and to optimize the tolerance values along with its manufacturing cost. A cost–tolerance relation has a complex nonlinear correlation among them. The property of a neural network makes it possible to model the complex correlation, and the genetic algorithm (GA) is integrated with the best neural network model to optimize the tolerance values. The proposed method used three types of neural network models (multilayer perceptron, backpropagation network, and radial basis function). These network models were developed separately for prismatic and rotational parts. For the construction of network models, part size and tolerance values were used as input neurons. The reference manufacturing cost was assigned as the output neuron. The qualitative production data set was gathered in a workshop and partitioned into three files for training, testing, and validation, respectively. The architecture of the network model was identified based on the best regression coefficient and the root-mean-square-error value. The best network model was integrated into the GA, and the role of genetic operators was also studied. Finally, two case studies from the literature were demonstrated in order to validate the proposed method. A new methodology based on the neural network model enables the design and process planning engineers to propose an intelligent decision irrespective of their experience.

Download Full-text

Unsupervised Mutual Information Criterion for Elimination of Overtraining in Supervised Multilayer Networks

Neural Computation ◽

10.1162/neco.1995.7.1.86 ◽

1995 ◽

Vol 7 (1) ◽

pp. 86-107 ◽

Cited By ~ 51

Author(s):

G. Deco ◽

W. Finnoff ◽

H. G. Zimmermann

Keyword(s):

Mutual Information ◽

Network Architecture ◽

Synthetic Data ◽

Network Models ◽

Information Criterion ◽

Small Data ◽

Economic Time Series ◽

Neural Network Models ◽

Data Set ◽

Penalty Term

Controlling the network complexity in order to prevent overfitting is one of the major problems encountered when using neural network models to extract the structure from small data sets. In this paper we present a network architecture designed for use with a cost function that includes a novel complexity penalty term. In this architecture the outputs of the hidden units are strictly positive and sum to one, and their outputs are defined as the probability that the actual input belongs to a certain class formed during learning. The penalty term expresses the mutual information between the inputs and the extracted classes. This measure effectively describes the network complexity with respect to the given data in an unsupervised fashion. The efficiency of this architecture/penalty-term when combined with backpropagation training, is demonstrated on a real world economic time series forecasting problem. The model was also applied to the benchmark sunspot data and to a synthetic data set from the statistics community.

Download Full-text