scholarly journals Bayesian Hierarchical Models can Infer Interpretable Predictions of Leaf Area Index From Heterogeneous Datasets

2022 ◽  
Vol 9 ◽  
Author(s):  
Olivera Stojanović ◽  
Bastian Siegmann ◽  
Thomas Jarmer ◽  
Gordon Pipa ◽  
Johannes Leugering

Environmental scientists often face the challenge of predicting a complex phenomenon from a heterogeneous collection of datasets that exhibit systematic differences. Accounting for these differences usually requires including additional parameters in the predictive models, which increases the probability of overfitting, particularly on small datasets. We investigate how Bayesian hierarchical models can help mitigate this problem by allowing the practitioner to incorporate information about the structure of the dataset explicitly. To this end, we look at a typical application in remote sensing: the estimation of leaf area index of white winter wheat, an important indicator for agronomical modeling, using measurements of reflectance spectra collected at different locations and growth stages. Since the insights gained from such a model could be used to inform policy or business decisions, the interpretability of the model is a primary concern. We, therefore, focus on models that capture the association between leaf area index and the spectral reflectance at various wavelengths by spline-based kernel functions, which can be visually inspected and analyzed. We compare models with three different levels of hierarchy: a non-hierarchical baseline model, a model with hierarchical bias parameter, and a model in which bias and kernel parameters are hierarchically structured. We analyze them using Markov Chain Monte Carlo sampling diagnostics and an intervention-based measure of feature importance. The improved robustness and interpretability of this approach show that Bayesian hierarchical models are a versatile tool for the prediction of leaf area index, particularly in scenarios where the available data sources are heterogeneous.

2021 ◽  
Author(s):  
Olivera Stojanović ◽  
Bastian Siegmann ◽  
Thomas Jarmer ◽  
Gordon Pipa ◽  
Johannes Leugering

AbstractEnvironmental scientists often have to predict a complex phenomenon from a heterogeneous collection of datasets. This is particularly challenging if there are systematic differences between them, as is often the case. Accounting for these differences requires a larger number of parameters and thus increases the risk of overfitting. We investigate how Bayesian hierarchical models can help mitigate this problem by allowing the practitioner to explicitly incorporate information about the dataset structure and general domain knowledge. To this end, we look at a typical application in remote sensing: the estimation of leaf area index (of white winter wheat), an important indicator for agronomical modeling, from measurements of reflectance spectra collected at different locations and growth stages. Since the insights gained from such a model could be used to inform policy or business decisions, the interpretability of the model is a primary concern. We, therefore, focus on models that capture the association between leaf area index and the spectral reflectance at various wavelengths by spline-based kernel functions, which can be visually inspected and analyzed. We compare models with three different levels of hierarchy: a non-hierarchical baseline model, a model with hierarchical bias parameter, and a model in which bias and kernel parameters are hierarchically structured. We analyze them using Markov Chain Monte Carlo sampling diagnostics and an intervention-based measure of feature importance. The improved robustness and interpretability of this approach lead us to recommend Bayesian hierarchical models as a versatile tool for environmental sciences and beyond, particularly in scenarios where the available data sources are heterogeneous.


2021 ◽  
Vol 14 (1) ◽  
pp. 98
Author(s):  
Quanjun Jiao ◽  
Qi Sun ◽  
Bing Zhang ◽  
Wenjiang Huang ◽  
Huichun Ye ◽  
...  

Canopy chlorophyll content (CCC) is an important indicator for crop-growth monitoring and crop productivity estimation. The hybrid method, involving the PROSAIL radiative transfer model and machine learning algorithms, has been widely applied for crop CCC retrieval. However, PROSAIL’s homogeneous canopy hypothesis limits the ability to use the PROSAIL-based CCC estimation across different crops with a row structure. In addition to leaf area index (LAI), average leaf angle (ALA) is the most important canopy structure factor in the PROSAIL model. Under the same LAI, adjustment of the ALA can make a PROSAIL simulation obtain the same canopy gap as the heterogeneous canopy at a specific observation angle. Therefore, parameterization of an adjusted ALA (ALAadj) is an optimal choice to make the PROSAIL model suitable for specific row-planted crops. This paper attempted to improve PROSAIL-based CCC retrieval for different crops, using a random forest algorithm, by introducing the prior knowledge of crop-specific ALAadj. Based on the field reflectance spectrum at nadir, leaf area index, and leaf chlorophyll content, parameterization of the ALAadj in the PROSAIL model for wheat and soybean was carried out. An algorithm integrating the random forest and PROSAIL simulations with prior ALAadj information was developed for wheat and soybean CCC retrieval. Ground-measured CCC measurements were used to validate the CCC retrieved from canopy spectra. The results showed that the ALAadj values (62 degrees for wheat; 45 degrees for soybean) that were parameterized for the PROSAIL model demonstrated good discrimination between the two crops. The proposed algorithm improved the CCC retrieval accuracy for wheat and soybean, regardless of whether continuous visible to near-infrared spectra with 50 bands (RMSE from 39.9 to 32.9 μg cm−2; R2 from 0.67 to 0.76) or discrete spectra with 13 bands (RMSE from 43.9 to 33.7 μg cm−2; R2 from 0.63 to 0.74) and nine bands (RMSE from 45.1 to 37.0 μg cm−2; R2 from 0.61 to 0.71) were used. The proposed hybrid algorithm, based on PROSAIL simulations with ALAadj, has the potential for satellite-based CCC estimation across different crop types, and it also has a good reference value for the retrieval of other crop parameters.


2020 ◽  
Vol 49 (4) ◽  
pp. 1103-1109
Author(s):  
Jiannan Cai ◽  
Qiancai Jiang ◽  
Yafei Shi ◽  
Jiabin Wang

Leaf area index (LAI) is an important indicator of vegetation growth and health monitoring. The free access of Sentinel-2 optical satellite data from European Space Agency (ESA) since June 2015 made it possible to determine dynamic monitoring of LAI in a large area and a short re-entry period, owing to its high spatial resolution (up to 10 m) and unique band combination. These features of Sentinel-2 may bring great application potential in urban forestry management and ecological environment monitoring. In case of Zhongshan city in Guangdong Province, based on Sentinels Application Platform (SNAP) software and two scenes of Sentinel-2 data, LAI of the whole city, as well as specific areas, were retrieved and analyzed. The retrieval result was consistent with the ecological spatial pattern and the vegetation protection status in Zhongshan city, also highly consistent with the field LAI measurement in the forest in a reservoir. The result indicates that this method has good applicability and accuracy in urban LAI inversion.


2008 ◽  
Author(s):  
Ralph F. Milliff ◽  
Mark Berliner ◽  
Emanuele D. Lorenzo ◽  
Christopher K. Wikle

Sign in / Sign up

Export Citation Format

Share Document