Assessment of Predictive Uncertainty of Data-Driven Environmental Models

Author(s):  
Benedikt Knüsel ◽  
Christoph Baumberger ◽  
Marius Zumwald ◽  
David N. Bresch ◽  
Reto Knutti

<p>Due to ever larger volumes of environmental data, environmental scientists can increasingly use machine learning to construct data-driven models of phenomena. Data-driven environmental models can provide useful information to society, but this requires that their uncertainties be understood. However, new conceptual tools are needed for this because existing approaches to assess the uncertainty of environmental models do so in terms of specific locations, such as model structure and parameter values. These locations are not informative for an assessment of the predictive uncertainty of data-driven models. Rather than the model structure or model parameters, we argue that it is the <em>behavior</em> of a data-driven model that should be subject to an assessment of uncertainty.</p><p>In this paper, we present a novel framework that can be used to assess the uncertainty of data-driven environmental models. The framework uses argument analysis and focuses on epistemic uncertainty, i.e., uncertainty that is related to a lack of knowledge. It proceeds in three steps. The first step consists in reconstructing the justification of the assumption that the model used is fit for the predictive task at hand. Arguments for this justification may, for example, refer to sensitivity analyses and model performance on a validation dataset. In a second step, this justification is evaluated to identify how conclusively the fitness-for-purpose assumption is justified. In a third step, the epistemic uncertainty is assessed based on the evaluation of the arguments. Epistemic uncertainty emerges due to insufficient justification of the fitness-for-purpose assumption, i.e., if the model is less-than-maximally fit-for-purpose. This lack of justification translates to predictive uncertainty, or <em>first-order uncertainty</em>. Uncertainty also emerges if it is unclear how well the fitness-for-purpose assumption is justified. We refer to this uncertainty as “second-order uncertainty”. In other words, second-order uncertainty is uncertainty that researchers face when assessing first-order uncertainty.</p><p>We illustrate how the framework is applied by discussing to a case study from environmental science in which data-driven models are used to make long-term projections of soil selenium concentrations. We highlight that in many applications, the lack of system understanding and the lack of transparency of machine learning can introduce a substantial level of second-order uncertainty. We close by sketching how the framework can inform uncertainty quantification.</p>

BJS Open ◽  
2021 ◽  
Vol 5 (1) ◽  
Author(s):  
F Torresan ◽  
F Crimì ◽  
F Ceccato ◽  
F Zavan ◽  
M Barbot ◽  
...  

Abstract Background The main challenge in the management of indeterminate incidentally discovered adrenal tumours is to differentiate benign from malignant lesions. In the absence of clear signs of invasion or metastases, imaging techniques do not always precisely define the nature of the mass. The present pilot study aimed to determine whether radiomics may predict malignancy in adrenocortical tumours. Methods CT images in unenhanced, arterial, and venous phases from 19 patients who had undergone resection of adrenocortical tumours and a cohort who had undergone surveillance for at least 5 years for incidentalomas were reviewed. A volume of interest was drawn for each lesion using dedicated software, and, for each phase, first-order (histogram) and second-order (grey-level colour matrix and run-length matrix) radiological features were extracted. Data were revised by an unsupervised machine learning approach using the K-means clustering technique. Results Of operated patients, nine had non-functional adenoma and 10 carcinoma. There were 11 patients in the surveillance group. Two first-order features in unenhanced CT and one in arterial CT, and 14 second-order parameters in unenhanced and venous CT and 10 second-order features in arterial CT, were able to differentiate adrenocortical carcinoma from adenoma (P < 0.050). After excluding two malignant outliers, the unsupervised machine learning approach correctly predicted malignancy in seven of eight adrenocortical carcinomas in all phases. Conclusion Radiomics with CT texture analysis was able to discriminate malignant from benign adrenocortical tumours, even by an unsupervised machine learning approach, in nearly all patients.


2020 ◽  
Vol 134 ◽  
pp. 104754 ◽  
Author(s):  
Benedikt Knüsel ◽  
Christoph Baumberger ◽  
Marius Zumwald ◽  
David N. Bresch ◽  
Reto Knutti

2021 ◽  
Author(s):  
Stefano Olgiati ◽  
Nima Heidari ◽  
Davide Meloni ◽  
Federico Pirovano ◽  
Ali Noorani ◽  
...  

Background Quantum computing (QC) and quantum machine learning (QML) are promising experimental technologies which can improve precision medicine applications by reducing the computational complexity of algorithms driven by big, unstructured, real-world data. The clinical problem of knee osteoarthritis is that, although some novel therapies are safe and effective, the response is variable, and defining the characteristics of an individual who will respond remains a challenge. In this paper we tested a quantum neural network (QNN) application to support precision data-driven clinical decisions to select personalized treatments for advanced knee osteoarthritis. Methods Following patients consent and Research Ethics Committee approval, we collected clinico-demographic data before and after the treatment from 170 patients eligible for knee arthroplasty (Kellgren-Lawrence grade ≥ 3, OKS ≤ 27, Age ≥ 64 and idiopathic aetiology of arthritis) treated over a 2 year period with a single injection of microfragmented fat. Gender classes were balanced (76 M, 94 F) to mitigate gender bias. A patient with an improvement ≥ 7 OKS has been considered a Responder. We trained our QNN Classifier on a randomly selected training subset of 113 patients to classify responders from non-responders (73 R, 40 NR) in pain and function at 1 year. Outliers were hidden from the training dataset but not from the validation set. Results We tested our QNN Classifier on a randomly selected test subset of 57 patients (34 R, 23 NR) including outliers. The No Information Rate was equal to 0.59. Our application correctly classified 28 Responders out of 34 and 6 non-Responders out of 23 (Sensitivity = 0.82, Specificity = 0.26, F1 Statistic= 0.71). The Positive (LR+) and Negative (LR-) Likelihood Ratios were respectively 1.11 and 0.68. The Diagnostic Odds Ratio (DOR) was equal to 2. Conclusions Preliminary results on a small validation dataset show that quantum machine learning applied to data-driven clinical decisions for the personalized treatment of advanced knee osteoarthritis is a promising technology to reduce computational complexity and improve prognostic performance. Our results need further research validation with larger, real-world unstructured datasets, and clinical validation with an AI Clinical Trial to test model efficacy, safety, clinical significance and relevance at a public health level.


2021 ◽  
Author(s):  
Kenichi Tatsumi ◽  
Noa Igarashi ◽  
Xiao Mengxue

Abstract Background The objective of this study is twofold. First, ascertain the important variables that predict tomato yields from plant height (PH) and vegetation index (VI) maps. The maps were derived from images taken by unmanned aerial vehicles (UAVs). Second, examine the accuracy of predictions of tomato fresh shoot masses (SM), fruit weights (FW), and the number of fruits (FN) from multiple machine learning algorithms using selected variable sets. To realize our objective, ultra-high-resolution RGB and multispectral images were collected by a UAV on ten days in 2020’s tomato growing season. From these images, 756 total variables, including first- (e.g., average, standard deviation, skewness, range, and maximum) and second-order (e.g., gray-level co-occurrence matrix features and growth rates of PH and VIs) statistics for each plant, were extracted. Several selection algorithms (i.e., Boruta, DALEX, genetic algorithm, least absolute shrinkage and selection operator, and recursive feature elimination) were used to select the variable sets useful for predicting SM, FW, and FN. Random forests, ridge regressions, and support vector machines were used to predict the yield using the top five selected variable sets. Results First-order statistics of PH and VIs collected during the early to mid-fruit formation periods, about one month prior to harvest, were important variables for predicting SM. Similar to the case for SM, variables collected approximately one month prior to harvest were important for predicting FW and FN. Furthermore, variables related to PH were unimportant for prediction. Compared with predictions obtained using only first-order statistics, those obtained using the second-order statistics of VIs were more accurate for FW and FN. Conclusions In addition to basic statistics (e.g., average and standard deviation), we derived second-order statistics of PH and VIs at the plant level using the ultra-high resolution UAV images. Our findings indicated that our variable selection method reduced the number variables needed for tomato yield prediction, improving the efficiency of phenotypic data collection and assisting with the selection of high-yield lines within breeding programs.


Plant Methods ◽  
2021 ◽  
Vol 17 (1) ◽  
Author(s):  
Kenichi Tatsumi ◽  
Noa Igarashi ◽  
Xiao Mengxue

Abstract Background The objective of this study is twofold. First, ascertain the important variables that predict tomato yields from plant height (PH) and vegetation index (VI) maps. The maps were derived from images taken by unmanned aerial vehicles (UAVs). Second, examine the accuracy of predictions of tomato fresh shoot masses (SM), fruit weights (FW), and the number of fruits (FN) from multiple machine learning algorithms using selected variable sets. To realize our objective, ultra-high-resolution RGB and multispectral images were collected by a UAV on ten days in 2020’s tomato growing season. From these images, 756 total variables, including first- (e.g., average, standard deviation, skewness, range, and maximum) and second-order (e.g., gray-level co-occurrence matrix features and growth rates of PH and VIs) statistics for each plant, were extracted. Several selection algorithms (i.e., Boruta, DALEX, genetic algorithm, least absolute shrinkage and selection operator, and recursive feature elimination) were used to select the variable sets useful for predicting SM, FW, and FN. Random forests, ridge regressions, and support vector machines were used to predict the yield using the top five selected variable sets. Results First-order statistics of PH and VIs collected during the early to mid-fruit formation periods, about one month prior to harvest, were important variables for predicting SM. Similar to the case for SM, variables collected approximately one month prior to harvest were important for predicting FW and FN. Furthermore, variables related to PH were unimportant for prediction. Compared with predictions obtained using only first-order statistics, those obtained using the second-order statistics of VIs were more accurate for FW and FN. The prediction accuracy of SM, FW, and FN by models constructed from all variables (rRMSE = 8.8–28.1%) was better than that from first-order statistics (rRMSE = 10.0–50.1%). Conclusions In addition to basic statistics (e.g., average and standard deviation), we derived second-order statistics of PH and VIs at the plant level using the ultra-high resolution UAV images. Our findings indicated that our variable selection method reduced the number variables needed for tomato yield prediction, improving the efficiency of phenotypic data collection and assisting with the selection of high-yield lines within breeding programs.


1997 ◽  
Vol 36 (04/05) ◽  
pp. 315-318 ◽  
Author(s):  
K. Momose ◽  
K. Komiya ◽  
A. Uchiyama

Abstract:The relationship between chromatically modulated stimuli and visual evoked potentials (VEPs) was considered. VEPs of normal subjects elicited by chromatically modulated stimuli were measured under several color adaptations, and their binary kernels were estimated. Up to the second-order, binary kernels obtained from VEPs were so characteristic that the VEP-chromatic modulation system showed second-order nonlinearity. First-order binary kernels depended on the color of the stimulus and adaptation, whereas second-order kernels showed almost no difference. This result indicates that the waveforms of first-order binary kernels reflect perceived color (hue). This supports the suggestion that kernels of VEPs include color responses, and could be used as a probe with which to examine the color visual system.


2017 ◽  
Vol 9 (3) ◽  
pp. 17-30
Author(s):  
Kelly James Clark

In Branden Thornhill-Miller and Peter Millican’s challenging and provocative essay, we hear a considerably longer, more scholarly and less melodic rendition of John Lennon’s catchy tune—without religion, or at least without first-order supernaturalisms (the kinds of religion we find in the world), there’d be significantly less intra-group violence. First-order supernaturalist beliefs, as defined by Thornhill-Miller and Peter Millican (hereafter M&M), are “beliefs that claim unique authority for some particular religious tradition in preference to all others” (3). According to M&M, first-order supernaturalist beliefs are exclusivist, dogmatic, empirically unsupported, and irrational. Moreover, again according to M&M, we have perfectly natural explanations of the causes that underlie such beliefs (they seem to conceive of such natural explanations as debunking explanations). They then make a case for second-order supernaturalism, “which maintains that the universe in general, and the religious sensitivities of humanity in particular, have been formed by supernatural powers working through natural processes” (3). Second-order supernaturalism is a kind of theism, more closely akin to deism than, say, Christianity or Buddhism. It is, as such, universal (according to contemporary psychology of religion), empirically supported (according to philosophy in the form of the Fine-Tuning Argument), and beneficial (and so justified pragmatically). With respect to its pragmatic value, second-order supernaturalism, according to M&M, gets the good(s) of religion (cooperation, trust, etc) without its bad(s) (conflict and violence). Second-order supernaturalism is thus rational (and possibly true) and inconducive to violence. In this paper, I will examine just one small but important part of M&M’s argument: the claim that (first-order) religion is a primary motivator of violence and that its elimination would eliminate or curtail a great deal of violence in the world. Imagine, they say, no religion, too.Janusz Salamon offers a friendly extension or clarification of M&M’s second-order theism, one that I think, with emendations, has promise. He argues that the core of first-order religions, the belief that Ultimate Reality is the Ultimate Good (agatheism), is rational (agreeing that their particular claims are not) and, if widely conceded and endorsed by adherents of first-order religions, would reduce conflict in the world.While I favor the virtue of intellectual humility endorsed in both papers, I will argue contra M&M that (a) belief in first-order religion is not a primary motivator of conflict and violence (and so eliminating first-order religion won’t reduce violence). Second, partly contra Salamon, who I think is half right (but not half wrong), I will argue that (b) the religious resources for compassion can and should come from within both the particular (often exclusivist) and the universal (agatheistic) aspects of religious beliefs. Finally, I will argue that (c) both are guilty, as I am, of the philosopher’s obsession with belief. 


Sign in / Sign up

Export Citation Format

Share Document