Plot-Based Classification of Macronutrient Levels in Oil Palm Trees with Landsat-8 Images and Machine Learning

Oil palm crops are essential for ensuring sustainable edible oil production, in which production is highly dependent on fertilizer applications. Using Landsat-8 imageries, the feasibility of macronutrient level classification with Machine Learning (ML) was studied. Variable rates of compost and inorganic fertilizer were applied to experimental plots and the following nutrients were studied: nitrogen (N), phosphorus (P), potassium (K), magnesium (Mg) and calcium (Ca). By applying image filters, separability metrics, vegetation indices (VI) and feature selection, spectral features for each plot were acquired and used with ML models to classify macronutrient levels of palm stands from chemical foliar analysis of their 17th frond. The models were calibrated and validated with 30 repetitions, with the best mean overall accuracy reported for N and K at 79.7 ± 4.3% and 76.6 ± 4.1% respectively, while accuracies for P, Mg and Ca could not be accurately classified due to the limitations of the dataset used. The study highlighted the effectiveness of separability metrics in quantifying class separability, the importance of indices for N and K level classification, and the effects of filter and feature selection on model performance, as well as concluding RF or SVM models for excessive N and K level detection. Future improvements should focus on further model validation and the use of higher-resolution imaging.

Download Full-text

Discriminating between large-scale oil palm plantations and smallholdings on tropical peatlands using vegetation indices and supervised classification of LANDSAT-8

International Journal of Remote Sensing ◽

10.1080/01431161.2019.1579944 ◽

2019 ◽

Vol 40 (19) ◽

pp. 7312-7328 ◽

Cited By ~ 2

Author(s):

Aslinda Oon ◽

Helmi Zulhaidi Mohd Shafri ◽

Alex Mark Lechner ◽

Badrul Azhar

Keyword(s):

Oil Palm ◽

Supervised Classification ◽

Large Scale ◽

Vegetation Indices ◽

Landsat 8 ◽

Tropical Peatlands

Download Full-text

Classification of Oil Palm Female Inflorescences Anthesis Stages Using Machine Learning Approaches

Information Processing in Agriculture ◽

10.1016/j.inpa.2020.11.007 ◽

2020 ◽

Author(s):

Mamehgol Yousefi ◽

Azmin Shakrine ◽

Samsuzana bt. Abd Aziz ◽

Syaril Azrad ◽

Mohamed Mazmira ◽

...

Keyword(s):

Machine Learning ◽

Oil Palm ◽

Learning Approaches

Download Full-text

Mapping Allochemical Limestone Formations in Hazara, Pakistan Using Google Cloud Architecture: Application of Machine-Learning Algorithms on Multispectral Data

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10020058 ◽

2021 ◽

Vol 10 (2) ◽

pp. 58

Author(s):

Muhammad Fawad Akbar Khan ◽

Khan Muhammad ◽

Shahid Bashir ◽

Shahab Ud Din ◽

Muhammad Hanif

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Learning Algorithms ◽

Remote Sensing Data ◽

Kappa Coefficient ◽

Machine Learning Algorithms ◽

Landsat 8 ◽

Sensing Data ◽

Fossiliferous Limestone

Low-resolution Geological Survey of Pakistan (GSP) maps surrounding the region of interest show oolitic and fossiliferous limestone occurrences correspondingly in Samanasuk, Lockhart, and Margalla hill formations in the Hazara division, Pakistan. Machine-learning algorithms (MLAs) have been rarely applied to multispectral remote sensing data for differentiating between limestone formations formed due to different depositional environments, such as oolitic or fossiliferous. Unlike the previous studies that mostly report lithological classification of rock types having different chemical compositions by the MLAs, this paper aimed to investigate MLAs’ potential for mapping subclasses within the same lithology, i.e., limestone. Additionally, selecting appropriate data labels, training algorithms, hyperparameters, and remote sensing data sources were also investigated while applying these MLAs. In this paper, first, oolitic (Samanasuk), fossiliferous (Lockhart and Margalla) limestone-bearing formations along with the adjoining Hazara formation were mapped using random forest (RF), support vector machine (SVM), classification and regression tree (CART), and naïve Bayes (NB) MLAs. The RF algorithm reported the best accuracy of 83.28% and a Kappa coefficient of 0.78. To further improve the targeted allochemical limestone formation map, annotation labels were generated by the fusion of maps obtained from principal component analysis (PCA), decorrelation stretching (DS), X-means clustering applied to ASTER-L1T, Landsat-8, and Sentinel-2 datasets. These labels were used to train and validate SVM, CART, NB, and RF MLAs to obtain a binary classification map of limestone occurrences in the Hazara division, Pakistan using the Google Earth Engine (GEE) platform. The classification of Landsat-8 data by CART reported 99.63% accuracy, with a Kappa coefficient of 0.99, and was in good agreement with the field validation. This binary limestone map was further classified into oolitic (Samanasuk) and fossiliferous (Lockhart and Margalla) formations by all the four MLAs; in this case, RF surpassed all the other algorithms with an improved accuracy of 96.36%. This improvement can be attributed to better annotation, resulting in a binary limestone classification map, which formed a mask for improved classification of oolitic and fossiliferous limestone in the area.

Download Full-text

Laser-induced breakdown spectroscopy for the classification of wood materials using machine learning methods combined with feature selection

Plasma Science and Technology ◽

10.1088/2058-6272/abf1ac ◽

2021 ◽

Author(s):

Xutai Cui ◽

Qianqian Wang ◽

Kai Wei ◽

Geer Teng ◽

Xiangjun Xu

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Laser Induced Breakdown Spectroscopy ◽

Learning Methods ◽

Breakdown Spectroscopy ◽

Machine Learning Methods ◽

Laser Induced Breakdown

Download Full-text

A combined strategy of feature selection and machine learning to identify predictors of prediabetes

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocz204 ◽

2019 ◽

Vol 27 (3) ◽

pp. 396-406 ◽

Cited By ~ 1

Author(s):

Kushan De Silva ◽

Daniel Jönsson ◽

Ryan T Demmer

Keyword(s):

Machine Learning ◽

Feature Selection ◽

National Health ◽

Screening Tool ◽

Model Performance ◽

Nutrition Examination Survey ◽

Validation Data ◽

Internal Validation ◽

Health And Nutrition ◽

Wide Range

Abstract Objective To identify predictors of prediabetes using feature selection and machine learning on a nationally representative sample of the US population. Materials and Methods We analyzed n = 6346 men and women enrolled in the National Health and Nutrition Examination Survey 2013–2014. Prediabetes was defined using American Diabetes Association guidelines. The sample was randomly partitioned to training (n = 3174) and internal validation (n = 3172) sets. Feature selection algorithms were run on training data containing 156 preselected exposure variables. Four machine learning algorithms were applied on 46 exposure variables in original and resampled training datasets built using 4 resampling methods. Predictive models were tested on internal validation data (n = 3172) and external validation data (n = 3000) prepared from National Health and Nutrition Examination Survey 2011–2012. Model performance was evaluated using area under the receiver operating characteristic curve (AUROC). Predictors were assessed by odds ratios in logistic models and variable importance in others. The Centers for Disease Control (CDC) prediabetes screening tool was the benchmark to compare model performance. Results Prediabetes prevalence was 23.43%. The CDC prediabetes screening tool produced 64.40% AUROC. Seven optimal (≥ 70% AUROC) models identified 25 predictors including 4 potentially novel associations; 20 by both logistic and other nonlinear/ensemble models and 5 solely by the latter. All optimal models outperformed the CDC prediabetes screening tool (P < 0.05). Discussion Combined use of feature selection and machine learning increased predictive performance outperforming the recommended screening tool. A range of predictors of prediabetes was identified. Conclusion This work demonstrated the value of combining feature selection with machine learning to identify a wide range of predictors that could enhance prediabetes prediction and clinical decision-making.

Download Full-text

Comparison of Class Separability, Forward Sequential Search and Genetic Algorithms for Feature Selection in the Classification of Individual and Clustered Microcalcifications in Digital Mammograms

Lecture Notes in Computer Science - Image Analysis and Recognition ◽

10.1007/978-3-540-74260-9_81 ◽

2007 ◽

pp. 911-922 ◽

Cited By ~ 2

Author(s):

Rolando R. Hernández-Cisneros ◽

Hugo Terashima-Marín ◽

Santiago E. Conant-Pablos

Keyword(s):

Genetic Algorithms ◽

Feature Selection ◽

Sequential Search ◽

Class Separability

Download Full-text

Isolation Forests to Evaluate Class Separability and the Representativeness of Training and Validation Areas in Land Cover Classification

Remote Sensing ◽

10.3390/rs11243000 ◽

2019 ◽

Vol 11 (24) ◽

pp. 3000 ◽

Cited By ~ 2

Author(s):

Francisco Alonso-Sarria ◽

Carmen Valdivieso-Ros ◽

Francisco Gomariz-Castillo

Keyword(s):

Remote Sensing ◽

Random Forest ◽

Land Cover ◽

Classification Accuracy ◽

Land Cover Classification ◽

Landsat 8 ◽

Commission Errors ◽

Class Separability ◽

Isolation Forest

Supervised land cover classification from remote sensing imagery is based on gathering a set of training areas to characterise each of the classes and to train a predictive model that is then used to predict land cover in the rest of the image. This procedure relies mainly on the assumptions of statistical separability of the classes and the representativeness of the training areas. This paper uses isolation forests, a type of random tree ensembles, to analyse both assumptions and to easily correct lack of representativeness by digitising new training areas where needed to improve the classification of a Landsat-8 set of images with Random Forest. The results show that the improved set of training areas after the isolation forest analysis is more representative of the whole image and increases classification accuracy. Besides, the distribution of isolation values can be useful to estimate class separability. A class separability parameter that summarises such distributions is proposed. This parameter is more correlated to omission and commission errors than other separability measures such as the Jeffries–Matusita distance.

Download Full-text

A fuzzy based feature selection from independent component subspace for machine learning classification of microarray data

Genomics Data ◽

10.1016/j.gdata.2016.02.012 ◽

2016 ◽

Vol 8 ◽

pp. 4-15 ◽

Cited By ~ 37

Author(s):

Rabia Aziz ◽

C.K. Verma ◽

Namita Srivastava

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Microarray Data ◽

Independent Component ◽

Machine Learning Classification

Download Full-text

Feature Selection and Classification of Leukemia Cancer Using Machine Learning Techniques

Machine Learning Research ◽

10.11648/j.mlr.20200502.11 ◽

2020 ◽

Vol 5 (2) ◽

pp. 18

Author(s):

Md. Alamgir Sarder ◽

Md. Maniruzzaman ◽

Benojir Ahammed

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Machine Learning Techniques ◽

Learning Techniques

Download Full-text

Classification of Landsat-8 Imagery Based On Pca And Ndvi Methods

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.j9843.0881019 ◽

2019 ◽

Vol 8 (10) ◽

pp. 4321-4325

Keyword(s):

Remote Sensing ◽

Satellite Image ◽

Vegetation Indices ◽

Region Of Interest ◽

The Other ◽

Spectral Information ◽

Landsat 8 ◽

Statistical Parameters ◽

Sensing Applications

Remote sensing is an important issue in satellite image classification. In developing a significant sustainable system in agriculture farming, the major concern for remote sensing applications is the crop classification mechanism. The other important application in remote sensing is urban classification which gives the information about houses, roads, buildings, vegetation etc. A superior indicator for the presence of vegetation can be computed from the vegetation indices of a satellite image. This indicator supports in describing the health of vegetation through the image attributes like greenness and density. The other parameter in detecting objects or region of interest is an image is the texture. A satellite image contains spectral information and can be represented by more spectral bands and classification is very tough task. Generally, Classification of individual pixels in satellite images is based on the spectral information. In this research paper Principle component analysis and combination of PCA and NDVI classification methods are applied on Landsat-8 images. These images are acquired from USGS. The performance of these methods is compared in statistical parameters such as Kappa coefficient, overall accuracy, user’s accuracy, precision accuracy and F1 accuracy. In this work existing method is PCA and proposed method is PCA+NDVI. Experimental results shows that the proposed method has better statistical values compared to existing method.

Download Full-text