UAV-based coffee yield prediction utilizing feature selection and deep learning

Email has sustained to be an essential part of our lives and as a means for better communication on the internet. The challenge pertains to the spam emails residing a large amount of space and bandwidth. The defect of state-of-the-art spam filtering methods like misclassification of genuine emails as spam (false positives) is the rising challenge to the internet world. Depending on the classification techniques, literature provides various algorithms for the classification of email spam. This paper tactics to develop a novel spam detection model for improved cybersecurity. The proposed model involves several phases like dataset acquisition, feature extraction, optimal feature selection, and detection. Initially, the benchmark dataset of email is collected that involves both text and image datasets. Next, the feature extraction is performed using two sets of features like text features and visual features. In the text features, Term Frequency-Inverse Document Frequency (TF-IDF) is extracted. For the visual features, color correlogram and Gray-Level Co-occurrence Matrix (GLCM) are determined. Since the length of the extracted feature vector seems to the long, the optimal feature selection process is done. The optimal feature selection is performed by a new meta-heuristic algorithm called Fitness Oriented Levy Improvement-based Dragonfly Algorithm (FLI-DA). Once the optimal features are selected, the detection is performed by the hybrid learning technique that is composed of two deep learning approaches named Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN). For improving the performance of existing deep learning approaches, the number of hidden neurons of RNN and CNN is optimized by the same FLI-DA. Finally, the optimized hybrid learning technique having CNN and RNN classifies the data into spam and ham. The experimental outcomes show the ability of the proposed method to perform the spam email classification based on improved deep learning.

Download Full-text

Forecasting air pollutant concentration using a novel spatiotemporal deep learning model based on clustering, feature selection and empirical wavelet transform

The Science of The Total Environment ◽

10.1016/j.scitotenv.2021.149654 ◽

2021 ◽

pp. 149654

Author(s):

Jusong Kim ◽

Xiaoli Wang ◽

Chollyong Kang ◽

Jinwon Yu ◽

Penghui Li

Keyword(s):

Feature Selection ◽

Deep Learning ◽

Wavelet Transform ◽

Learning Model ◽

Air Pollutant ◽

Pollutant Concentration ◽

Model Based ◽

Empirical Wavelet Transform ◽

Deep Learning Model

Download Full-text

Feature Selection and Deep Learning for Deterioration Prediction of the Bridges

Journal of Performance of Constructed Facilities ◽

10.1061/(asce)cf.1943-5509.0001653 ◽

2021 ◽

Vol 35 (6) ◽

pp. 04021078

Author(s):

Jinsong Zhu ◽

Yanlei Wang

Keyword(s):

Feature Selection ◽

Deep Learning

Download Full-text

UAV-Based Hyperspectral and Ensemble Machine Learning for Predicting Yield in Winter Wheat

Agronomy ◽

10.3390/agronomy12010202 ◽

2022 ◽

Vol 12 (1) ◽

pp. 202

Author(s):

Zhen Chen ◽

Qian Cheng ◽

Fuyi Duan ◽

Xiuqiao Huang ◽

Honggang Xu ◽

...

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Winter Wheat ◽

Prediction Model ◽

Grain Filling ◽

Yield Prediction ◽

Spectral Indices ◽

Learner Model ◽

Ensemble Machine Learning ◽

Base Learner

Winter wheat is a widely-grown cereal crop worldwide. Using growth-stage information to estimate winter wheat yields in a timely manner is essential for accurate crop management and rapid decision-making in sustainable agriculture, and to increase productivity while reducing environmental impact. UAV remote sensing is widely used in precision agriculture due to its flexibility and increased spatial and spectral resolution. Hyperspectral data are used to model crop traits because of their ability to provide continuous rich spectral information and higher spectral fidelity. In this study, hyperspectral image data of the winter wheat crop canopy at the flowering and grain-filling stages was acquired by a low-altitude unmanned aerial vehicle (UAV), and machine learning was used to predict winter wheat yields. Specifically, a large number of spectral indices were extracted from the spectral data, and three feature selection methods, recursive feature elimination (RFE), Boruta feature selection, and the Pearson correlation coefficient (PCC), were used to filter high spectral indices in order to reduce the dimensionality of the data. Four major basic learner models, (1) support vector machine (SVM), (2) Gaussian process (GP), (3) linear ridge regression (LRR), and (4) random forest (RF), were also constructed, and an ensemble machine learning model was developed by combining the four base learner models. The results showed that the SVM yield prediction model, constructed on the basis of the preferred features, performed the best among the base learner models, with an R2 between 0.62 and 0.73. The accuracy of the proposed ensemble learner model was higher than that of each base learner model; moreover, the R2 (0.78) for the yield prediction model based on Boruta’s preferred characteristics was the highest at the grain-filling stage.

Download Full-text

Deep Learning Based Wheat Crop Yield Prediction Model in Punjab Region of North India

Applied Artificial Intelligence ◽

10.1080/08839514.2021.1976091 ◽

2021 ◽

pp. 1-25

Author(s):

Nishu Bali ◽

Anshu Singla

Keyword(s):

Deep Learning ◽

Prediction Model ◽

Crop Yield ◽

North India ◽

Wheat Crop ◽

Yield Prediction

Download Full-text

Feature Selection for Wheat Yield Prediction

Research and Development in Intelligent Systems XXVI ◽

10.1007/978-1-84882-983-1_36 ◽

2009 ◽

pp. 465-478 ◽

Cited By ~ 4

Author(s):

Georg Ruß ◽

Rudolf Kruse

Keyword(s):

Feature Selection ◽

Wheat Yield ◽

Yield Prediction ◽

Selection For

Download Full-text

Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph15081750 ◽

2018 ◽

Vol 15 (8) ◽

pp. 1750 ◽

Cited By ~ 4

Author(s):

Seonho Kim ◽

Jungjoon Kim ◽

Hong-Woo Chun

Keyword(s):

Artificial Intelligence ◽

Time Series ◽

Feature Selection ◽

Deep Learning ◽

Natural Language Processing ◽

Data Analysis ◽

Natural Language ◽

Real Number ◽

Real Time ◽

Language Processing

Interest in research involving health-medical information analysis based on artificial intelligence, especially for deep learning techniques, has recently been increasing. Most of the research in this field has been focused on searching for new knowledge for predicting and diagnosing disease by revealing the relation between disease and various information features of data. These features are extracted by analyzing various clinical pathology data, such as EHR (electronic health records), and academic literature using the techniques of data analysis, natural language processing, etc. However, still needed are more research and interest in applying the latest advanced artificial intelligence-based data analysis technique to bio-signal data, which are continuous physiological records, such as EEG (electroencephalography) and ECG (electrocardiogram). Unlike the other types of data, applying deep learning to bio-signal data, which is in the form of time series of real numbers, has many issues that need to be resolved in preprocessing, learning, and analysis. Such issues include leaving feature selection, learning parts that are black boxes, difficulties in recognizing and identifying effective features, high computational complexities, etc. In this paper, to solve these issues, we provide an encoding-based Wave2vec time series classifier model, which combines signal-processing and deep learning-based natural language processing techniques. To demonstrate its advantages, we provide the results of three experiments conducted with EEG data of the University of California Irvine, which are a real-world benchmark bio-signal dataset. After converting the bio-signals (in the form of waves), which are a real number time series, into a sequence of symbols or a sequence of wavelet patterns that are converted into symbols, through encoding, the proposed model vectorizes the symbols by learning the sequence using deep learning-based natural language processing. The models of each class can be constructed through learning from the vectorized wavelet patterns and training data. The implemented models can be used for prediction and diagnosis of diseases by classifying the new data. The proposed method enhanced data readability and intuition of feature selection and learning processes by converting the time series of real number data into sequences of symbols. In addition, it facilitates intuitive and easy recognition, and identification of influential patterns. Furthermore, real-time large-capacity data analysis is facilitated, which is essential in the development of real-time analysis diagnosis systems, by drastically reducing the complexity of calculation without deterioration of analysis performance by data simplification through the encoding process.

Download Full-text