scholarly journals Using Different Algorithms and Multi-Seasonal, Textural and Ancillary Information to Increase Classification Accuracy during the Period 2000–2015 in a Mediterranean Semiarid Area

Author(s):  
Francisco Gomariz-Castillo ◽  
Francisco Alonso-Sarría ◽  
Fulgencio Cánovas-García

The aim of this study is to evaluate three different strategies to improve classification accuracy in a highly fragmented semiarid area. i) Using different classification algorithms: Maximum Likelihood, Random Forest, Support Vector Machines and Sequential Maximum a Posteriori, with parameter optimisation in the second and third cases; ii) using different feature sets: spectral features, spectral and textural features, and spectral, textural and terrain features; and iii) using different image-sets: winter, spring, summer, autumn, winter+summer, winter+ spring+summer; and a four seasons combination. A 3-way ANOVA is used to discern which of these approaches and their interactions significantly increases accuracy. Tukey-Kramer contrast using a heteroscedasticity-consistent estimation of the kappa covariances matrix was used to check for significant differences in accuracy. The experiment was carried out with Landsat TM, ETM, and OLI images corresponding to the period 2000-2015. A combination of four images was the best way to improve accuracy. Maximum Likelihood, Random Forest and Support Vector Machines do not significantly increase accuracy when textural information is added, but do so when terrain features are taken into account. On the other hand, Sequential Maximum a Posteriori increases accuracy when textural features are used, but reduces accuracy substantially when terrain features are included. Random Forest using the three feature subsets and Sequential Maximum a Posteriori with spectral and textural features had the largest kappa values, around 0.9.

Author(s):  
Mario Fabián Marini

El partido de Coronel Rosales (Buenos Aires, Argentina) se halla localizado dentro de la región pampeana austral, una de las de mayor relevancia agro productiva del país. En este contexto, el conocimiento de la superficie cultivada adquiere significativa importancia para la posterior planificación agrícola y económica. En tal sentido, la discriminación de cultivos mediante teledetección se dificulta cuando se trata de los de ciclo fenológico muy similar, como el trigo y la cebada. En este estudio se realizó una discriminación de dichos cultivos empleando imágenes de Radar de Apertura Sintética (SAR) Sentinel-1A SLC, imágenes ópticas Sentinel-2 y una combinación de ambos tipos de datos. Se incorporaron medidas de coherencia, textura e intensidad de retrodispersión extraídas de los datos SAR durante el ciclo fenológico completo. Sobre cada escena Sentinel-2 se obtuvo el Índice de Diferencia Normalizada de Vegetación (Normalized Difference Vegetation Index - NDVI). Se emplearon tres algoritmos de clasificación: Máxima Verosimilitud (Maximum Likelihood - MLC), Máquinas de Soporte Vectorial (Support Vector Machines - SVM) y Random Forest (RF). Los mejores resultados se obtuvieron al combinar imágenes ópticas y SAR empleando el clasificador RF. La combinación de las retrodispersiones VV y VH junto a la coherencia y la textura de las imágenes SAR, sumada al apilado de NDVI de imágenes ópticas, arrojó los máximos valores de precisión de la clasificación. El valor de F1 fue de 87.27% para el trigo y de 89.20% para la cebada.


2021 ◽  
Vol 15 (4) ◽  
pp. 101-116
Author(s):  
Lamyaa Gamal El-deen Taha ◽  
Rania Elsayed Ibrahim

The Marina area represents an official new gateway of entry to Egypt and the development of infrastructure is proceeding rapidly in this region. The objective of this research is to obtain building data by means of automated extraction from Pléiades satellite images. This is due to the need for efficient mapping and updating of geodatabases for urban planning and touristic development. It compares the performance of random forest algorithm to other classifiers like maximum likelihood, support vector machines, and backpropagation neural networks over the well-organized buildings which appeared in the satellite images. Images were subsequently classified into two classes: buildings and non-buildings. In addition, basic morphological operations such as opening and closing were used to enhance the smoothness and connectedness of the classified imagery.The overall accuracy for random forest, maximum likelihood, support vector machines, and backpropagation were 97%, 95%, 93% and 92% respectively. It was found that random forest was the best option, followed by maximum likelihood, while the least effective was the backpropagation neural network. The completeness and correctness of the detected buildings were evaluated. Experiments confirmed that the four classification methods can effectively and accurately detect 100% of buildings from very high-resolution images. It is encouraged to use machine learning algorithms for object detection and extraction from very high-resolution images.


2020 ◽  
Vol 44 (4) ◽  
pp. 627-635
Author(s):  
A.M. Belov ◽  
A.Y. Denisova

Earth remote sensing data fusion is intended to produce images of higher quality than the original ones. However, the fusion impact on further thematic processing remains an open question because fusion methods are mostly used to improve the visual data representation. This article addresses an issue of the effect of fusion with increasing spatial and spectral resolution of data on thematic classification of images using various state-of-the-art classifiers and features extraction methods. In this paper, we use our own algorithm to perform multi-frame image fusion over optical remote sensing images with different spatial and spectral resolutions. For classification, we applied support vector machines and Random Forest algorithms. For features, we used spectral channels, extended attribute profiles and local feature attribute profiles. An experimental study was carried out using model images of four imaging systems. The resulting image had a spatial resolution of 2, 3, 4 and 5 times better than for the original images of each imaging system, respectively. As a result of our studies, it was revealed that for the support vector machines method, fusion was inexpedient since excessive spatial details had a negative effect on the classification. For the Random Forest algorithm, the classification results of a fused image were more accurate than for the original low-resolution images in 90% of cases. For example, for images with the smallest difference in spatial resolution (2 times) from the fusion result, the classification accuracy of the fused image was on average 4% higher. In addition, the results obtained for the Random Forest algorithm with fusion were better than the results for the support vector machines method without fusion. Additionally, it was shown that the classification accuracy of a fused image using the Random Forest method could be increased by an average of 9% due to the use of extended attribute profiles as features. Thus, when using data fusion, it is better to use the Random Forest classifier, whereas using fusion with the support vector machines method is not recommended.


Tecnura ◽  
2019 ◽  
Vol 23 (59) ◽  
pp. 13-26 ◽  
Author(s):  
José Antonio Valero Medina ◽  
Beatriz Elena Alzate Atehortúa

Context: Nowadays, the images of the Earth surface and the algorithms for their classification are widely available. In particular, the algorithms are promising in the differentiating of cotton crops stages, but it is necessary to establish the capabilities of the different algorithms in order to identify their advantages, and disadvantages. Method: This paper describes the assessment process in which the Support Vector Machines (SVM) and random-forest technique (decision trees) are compared with the maximum likelihood estimation when differentiating the stages of cotton crops. A RapidEye satellite image of a geographic area in the municipality of San Pelayo, Cordoba (Colombia), is used for the study. Using a set of sampling polygons, a random sample of 6000 pixels was taken (2000 training and 4000 for validating the classifications.) Confusion matrices, and R (data processing and analysis software) were used during the validation process Results: The maximun likelihood estimation presented a correct classification percentage of 68.95%. SVM correctly classified 81.325% of the cases and the decision trees correctly classified 78.925%. The confidence test for the classifications showed non-overlapping intervals, and SVM obtained the highest values. Conclusions: It was possible to confirm the superiority of the technique based on support vector machines for the proposed verification zones. However, this technique requires a number of classes that comprehensively represent the variations of the image (in order to guarantee a minimum number of support vectors) to avoid confusion in the classification of non-sampled areas. This was less evident in the other two classification techniques analysed.


Sensors ◽  
2019 ◽  
Vol 19 (20) ◽  
pp. 4523 ◽  
Author(s):  
Carlos Cabo ◽  
Celestino Ordóñez ◽  
Fernando Sáchez-Lasheras ◽  
Javier Roca-Pardiñas ◽  
and Javier de Cos-Juez

We analyze the utility of multiscale supervised classification algorithms for object detection and extraction from laser scanning or photogrammetric point clouds. Only the geometric information (the point coordinates) was considered, thus making the method independent of the systems used to collect the data. A maximum of five features (input variables) was used, four of them related to the eigenvalues obtained from a principal component analysis (PCA). PCA was carried out at six scales, defined by the diameter of a sphere around each observation. Four multiclass supervised classification models were tested (linear discriminant analysis, logistic regression, support vector machines, and random forest) in two different scenarios, urban and forest, formed by artificial and natural objects, respectively. The results obtained were accurate (overall accuracy over 80% for the urban dataset, and over 93% for the forest dataset), in the range of the best results found in the literature, regardless of the classification method. For both datasets, the random forest algorithm provided the best solution/results when discrimination capacity, computing time, and the ability to estimate the relative importance of each variable are considered together.


2019 ◽  
Vol 11 (11) ◽  
pp. 3222 ◽  
Author(s):  
Pascal Schirmer ◽  
Iosif Mporas

In this paper we evaluate several well-known and widely used machine learning algorithms for regression in the energy disaggregation task. Specifically, the Non-Intrusive Load Monitoring approach was considered and the K-Nearest-Neighbours, Support Vector Machines, Deep Neural Networks and Random Forest algorithms were evaluated across five datasets using seven different sets of statistical and electrical features. The experimental results demonstrated the importance of selecting both appropriate features and regression algorithms. Analysis on device level showed that linear devices can be disaggregated using statistical features, while for non-linear devices the use of electrical features significantly improves the disaggregation accuracy, as non-linear appliances have non-sinusoidal current draw and thus cannot be well parametrized only by their active power consumption. The best performance in terms of energy disaggregation accuracy was achieved by the Random Forest regression algorithm.


2021 ◽  
Vol 13 (18) ◽  
pp. 3573
Author(s):  
Chunfang Kong ◽  
Yiping Tian ◽  
Xiaogang Ma ◽  
Zhengping Weng ◽  
Zhiting Zhang ◽  
...  

Regarding the ever increasing and frequent occurrence of serious landslide disaster in eastern Guangxi, the current study was implemented to adopt support vector machines (SVM), particle swarm optimization support vector machines (PSO-SVM), random forest (RF), and particle swarm optimization random forest (PSO-RF) methods to assess landslide susceptibility in Zhaoping County. To this end, 10 landslide disaster-related variables including digital elevation model (DEM)-derived, meteorology-derived, Landsat8-derived, geology-derived, and human activities factors were provided. Of 345 landslide disaster locations found, 70% were used to train the models, and the rest of them were performed for model verification. The aforementioned four models were run, and landslide susceptibility evaluation maps were produced. Then, receiver operating characteristics (ROC) curves, statistical analysis, and field investigation were performed to test and verify the efficiency of these models. Analysis and comparison of the results denoted that all four landslide models performed well for the landslide susceptibility evaluation as indicated by the area under curve (AUC) values of ROC curves from 0.863 to 0.934. Among them, it has been shown that the PSO-RF model has the highest accuracy in comparison to other landslide models, followed by the PSO-SVM model, the RF model, and the SVM model. Moreover, the results also showed that the PSO algorithm has a good effect on SVM and RF models. Furthermore, the landslide models devolved in the present study are promising methods that could be transferred to other regions for landslide susceptibility evaluation. In addition, the evaluation results can provide suggestions for disaster reduction and prevention in Zhaoping County of eastern Guangxi.


Sign in / Sign up

Export Citation Format

Share Document