scholarly journals Comparison of Machine Learning Techniques for Spatio-Temporal Air Temperature Modelling using Earth Observation Satellites

2019 ◽  
Vol 3 ◽  
pp. 353
Author(s):  
Schneider dos Santos R ◽  
Gasparrini A
Author(s):  
Melika Sajadian ◽  
Ana Teixeira ◽  
Faraz S. Tehrani ◽  
Mathias Lemmens

Abstract. Built environments developed on compressible soils are susceptible to land deformation. The spatio-temporal monitoring and analysis of these deformations are necessary for sustainable development of cities. Techniques such as Interferometric Synthetic Aperture Radar (InSAR) or predictions based on soil mechanics using in situ characterization, such as Cone Penetration Testing (CPT) can be used for assessing such land deformations. Despite the combined advantages of these two methods, the relationship between them has not yet been investigated. Therefore, the major objective of this study is to reconcile InSAR measurements and CPT measurements using machine learning techniques in an attempt to better predict land deformation.


2020 ◽  
Author(s):  
Oladimeji Mudele ◽  
Fabio M. Bayer ◽  
Lucas Zanandrez ◽  
Alvaro Eiras ◽  
Paolo Gamba

<div>Over 50% of the world population is at risk of mosquito-borne diseases. Female Ae. aegypti mosquito species transmit Zika, Dengue, and Chikungunya. The spread of these diseases correlate positively with the vector population, and this population depends on biotic and abiotic environmental factors including temperature, vegetation condition, humidity and precipitation. To combat virus outbreaks, information about vector population is required. To this aim, Earth observation (EO) data provide fast, efficient and economically viable means to estimate environmental features of interest. In this work, we present a temporal distribution model for adult female Ae. aegypti mosquitoes based on the joint use of the Normalized Difference Vegetation Index, the Normalized Difference Water Index, the Land Surface Temperature (both at day and night time), along with the precipitation information, extracted from EO data. The model was applied separately to data obtained during three different vector control and field data collection condition regimes, and used to explain the differences in environmental variable contributions across these regimes. To this aim, a random forest (RF) regression technique and its nonlinear features importance ranking based on mean decrease impurity (MDI) were implemented. To prove the robustness of the proposed model, other machine learning techniques, including support vector regression, decision trees and k-nearest neighbor regression, as well as artificial neural networks, and statistical models such as the linear regression model and generalized linear model were also considered. Our results show that machine learning techniques perform better than linear statistical models for the task at hand, and RF performs best. By ranking the importance of all features based on MDI in RF and selecting the subset comprising the most</div>


2020 ◽  
Author(s):  
Oladimeji Mudele ◽  
Fabio M. Bayer ◽  
Lucas Zanandrez ◽  
Alvaro Eiras ◽  
Paolo Gamba

<div>Over 50% of the world population is at risk of mosquito-borne diseases. Female Ae. aegypti mosquito species transmit Zika, Dengue, and Chikungunya. The spread of these diseases correlate positively with the vector population, and this population depends on biotic and abiotic environmental factors including temperature, vegetation condition, humidity and precipitation. To combat virus outbreaks, information about vector population is required. To this aim, Earth observation (EO) data provide fast, efficient and economically viable means to estimate environmental features of interest. In this work, we present a temporal distribution model for adult female Ae. aegypti mosquitoes based on the joint use of the Normalized Difference Vegetation Index, the Normalized Difference Water Index, the Land Surface Temperature (both at day and night time), along with the precipitation information, extracted from EO data. The model was applied separately to data obtained during three different vector control and field data collection condition regimes, and used to explain the differences in environmental variable contributions across these regimes. To this aim, a random forest (RF) regression technique and its nonlinear features importance ranking based on mean decrease impurity (MDI) were implemented. To prove the robustness of the proposed model, other machine learning techniques, including support vector regression, decision trees and k-nearest neighbor regression, as well as artificial neural networks, and statistical models such as the linear regression model and generalized linear model were also considered. Our results show that machine learning techniques perform better than linear statistical models for the task at hand, and RF performs best. By ranking the importance of all features based on MDI in RF and selecting the subset comprising the most</div>


Water ◽  
2019 ◽  
Vol 11 (6) ◽  
pp. 1268 ◽  
Author(s):  
Zhenzhen Di ◽  
Miao Chang ◽  
Peikun Guo ◽  
Yang Li ◽  
Yin Chang

Most worldwide industrial wastewater, including in China, is still directly discharged to aquatic environments without adequate treatment. Because of a lack of data and few methods, the relationships between pollutants discharged in wastewater and those in surface water have not been fully revealed and unsupervised machine learning techniques, such as clustering algorithms, have been neglected in related research fields. In this study, real-time monitoring data for chemical oxygen demand (COD), ammonia nitrogen (NH3-N), pH, and dissolved oxygen in the wastewater discharged from 2213 factories and in the surface water at 18 monitoring sections (sites) in 7 administrative regions in the Yangtze River Basin from 2016 to 2017 were collected and analyzed by the partitioning around medoids (PAM) and expectation–maximization (EM) clustering algorithms, Welch t-test, Wilcoxon test, and Spearman correlation. The results showed that compared with the spatial cluster comprising unpolluted sites, the spatial cluster comprised heavily polluted sites where more wastewater was discharged had relatively high COD (>100 mg L−1) and NH3-N (>6 mg L−1) concentrations and relatively low pH (<6) from 15 industrial classes that respected the different discharge limits outlined in the pollutant discharge standards. The results also showed that the economic activities generating wastewater and the geographical distribution of the heavily polluted wastewater changed from 2016 to 2017, such that the concentration ranges of pollutants in discharges widened and the contributions from some emerging enterprises became more important. The correlations between the quality of the wastewater and the surface water strengthened as the whole-year data sets were reduced to the heavily polluted periods by the EM clustering and water quality evaluation. This study demonstrates how unsupervised machine learning algorithms play an objective and effective role in data mining real-time monitoring information and highlighting spatio–temporal relationships between pollutants in wastewater discharges and surface water to support scientific water resource management.


2020 ◽  
Author(s):  
Futo Tomizawa ◽  
Yohei Sawada

Abstract. Prediction of spatio-temporal chaotic systems is important in various fields, such as Numerical Weather Prediction (NWP). While data assimilation methods have been applied in NWP, machine learning techniques, such as Reservoir Computing (RC), are recently recognized as promising tools to predict spatio-temporal chaotic systems. However, the sensitivity of the skill of the machine learning based prediction to the imperfectness of observations is unclear. In this study, we evaluate the skill of RC with noisy and sparsely distributed observations. We intensively compare the performances of RC and Local Ensemble Transform Kalman Filter (LETKF) by applying them to the prediction of the Lorenz 96 system. Although RC can successfully predict the Lorenz 96 system if the system is perfectly observed, we find that RC is vulnerable to observation sparsity compared with LETKF. To overcome this limitation of RC, we propose to combine LETKF and RC. In our proposed method, the system is predicted by RC that learned the analysis time series estimated by LETKF. Our proposed method can successfully predict the Lorenz 96 system using noisy and sparsely distributed observations. Most importantly, our method can predict better than LETKF when the process-based model is imperfect.


2019 ◽  
Author(s):  
Mark A. Lee ◽  
Angelo Monteiro ◽  
Andrew Barclay ◽  
Jon Marcar ◽  
Mirena Miteva-Neagu ◽  
...  

AbstractPredicting harvest timing is a key challenge to sustainably develop soft fruit farming and reduce food waste. Soft fruits are perishable, high-value and seasonal, and sales prices are typically time-sensitive. In addition, fruit harvesting is labour-intensive and increasingly expensive making accurate phenological predictions valuable for growers. A novel approach for predicting soft fruit phenology and yields was developed and tested, using strawberries as the model crop. Seedlings were planted in polytunnels, and environmental and yield data were collected throughout the growing season. Over 1.2 million datapoints were collected by networked microsensors which measured spatial and temporal variability in air temperature, relative humidity (RH), soil moisture and photosynthetically active radiation (PAR). Fleeces were added to a subset of the plants to generate additional within-polytunnel variation. Cumulative fruit yields followed logistic growth curves and the coefficients of these curves were dependent on micro-climatic growing conditions. After 10,000 iterations, machine learning revealed that RH was the optimal factor informing the coefficients of these curves, perhaps because it is an integrative metric of air temperature and water status. Trigonometric models transformed weather forecasts, which showed a relatively low agreement with polytunnel air temperature (R2 = 0.6) and RH (R2 = 0.5) measurements, into more accurate polytunnel-specific predictions for temperature and RH (both R2 = 0.8). We present a framework for using machine-learning techniques to calculate curve coefficients and parametrise coupled weather models which can predict fruit yields and timing to a greater degree of accuracy that previously possible. Dataloggers measuring environmental and yield data could infer model parameters using iterative training for novel fruit varieties or crop types growing in different locations without a-priori phenological information. At this stage in the development of artificial intelligence and networked microsensors, this is a step forward in generating bespoke phenological prediction models to inform and support growers.


2020 ◽  
Vol 32 (1) ◽  
Author(s):  
Bruno Ferreira ◽  
Muriel Iten ◽  
Rui G. Silva

Abstract This paper presents and explores the different Earth Observation approaches and their contribution to the achievement of United Nations Sustainable Development Goals. A review on the Sustainable Development concept and its goals is presented followed by Earth Observation approaches relevant to this field, giving special attention to the contribution of Machine Learning methods and algorithms as well as their potential and capabilities to support the achievement of Sustainable Development Goals. Overall, it is observed that Earth Observation plays a key role in monitoring the Sustainable Development Goals given its cost-effectiveness pertaining to data acquisition on all scales and information richness. Despite the success of Machine Learning upon Earth Observation data analysis, it is observed that performance is heavily dependent on the ability to extract and synthesise characteristics from data. Hence, a deeper and effective analysis of the available data is required to identify the strongest features and, hence, the key factors pertaining to Sustainable Development. Overall, this research provides a deeper understanding on the relation between Sustainable Development, Earth Observation and Machine Learning, and how these can support the Sustainable Development of countries and the means to find their correlations. In pursuing the Sustainable Development Goals, given the relevance and growing amount of data generated through Earth Observation, it is concluded that there is an increased need for new methods and techniques strongly suggesting the use of new Machine Learning techniques.


Sign in / Sign up

Export Citation Format

Share Document