scholarly journals An evaluation framework for input variable selection algorithms for environmental data-driven models

2014 ◽  
Vol 62 ◽  
pp. 33-51 ◽  
Author(s):  
Stefano Galelli ◽  
Greer B. Humphrey ◽  
Holger R. Maier ◽  
Andrea Castelletti ◽  
Graeme C. Dandy ◽  
...  
2021 ◽  
Vol 149 ◽  
Author(s):  
Junwen Tao ◽  
Yue Ma ◽  
Xuefei Zhuang ◽  
Qiang Lv ◽  
Yaqiong Liu ◽  
...  

Abstract This study proposed a novel ensemble analysis strategy to improve hand, foot and mouth disease (HFMD) prediction by integrating environmental data. The approach began by establishing a vector autoregressive model (VAR). Then, a dynamic Bayesian networks (DBN) model was used for variable selection of environmental factors. Finally, a VAR model with constraints (CVAR) was established for predicting the incidence of HFMD in Chengdu city from 2011 to 2017. DBN showed that temperature was related to HFMD at lags 1 and 2. Humidity, wind speed, sunshine, PM10, SO2 and NO2 were related to HFMD at lag 2. Compared with the autoregressive integrated moving average model with external variables (ARIMAX), the CVAR model had a higher coefficient of determination (R2, average difference: + 2.11%; t = 6.2051, P = 0.0003 < 0.05), a lower root mean-squared error (−24.88%; t = −5.2898, P = 0.0007 < 0.05) and a lower mean absolute percentage error (−16.69%; t = −4.3647, P = 0.0024 < 0.05). The accuracy of predicting the time-series shape was 88.16% for the CVAR model and 86.41% for ARIMAX. The CVAR model performed better in terms of variable selection, model interpretation and prediction. Therefore, it could be used by health authorities to identify potential HFMD outbreaks and develop disease control measures.


Atmosphere ◽  
2020 ◽  
Vol 11 (11) ◽  
pp. 1160
Author(s):  
Jason Kelley

Solar radiation received at the Earth’s surface provides the energy driving all micro-meteorological phenomena. Local solar radiation measurements are used to estimate energy mediated processes such as evapotranspiration (ET); this information is important in managing natural resources. However, the technical requirements to reliably measure solar radiation limits more extensive adoption of data-driven management. High-quality radiation sensors are expensive, delicate, and require skill to maintain. In contrast, low-cost sensors are widely available, but may lack long-term reliability and intra-sensor repeatability. As weather stations measure solar radiation and other parameters simultaneously, machine learning can be used to integrate various types of environmental data, identify periods of erroneous measurements, and estimate corrected values. We demonstrate two case studies in which we use neural networks (NN) to augment direct radiation measurements with data from co-located sensors, and generate radiation estimates with comparable accuracy to the data typically available from agro-meteorology networks. NN models that incorporated radiometer data reproduced measured radiation with an R2 of 0.9–0.98, and RMSE less than 100 Wm−2, while models using only weather parameters obtained R2 less than 0.75 and RMSE greater than 140 Wm−2. These cases show that a simple NN implementation can complement standard procedures for estimating solar radiation, create opportunities to measure radiation at low-cost, and foster adoption of data-driven management.


Sign in / Sign up

Export Citation Format

Share Document