Measuring dependence in joint distributions of yield and weather variables

2011 ◽  
Vol 71 (1) ◽  
pp. 120-141 ◽  
Author(s):  
Raushan Bokusheva
2009 ◽  
Author(s):  
David V. Budescu ◽  
Ali E. Abbas ◽  
Yuhong Gu
Keyword(s):  

Author(s):  
Maria Ulan ◽  
Welf Löwe ◽  
Morgan Ericsson ◽  
Anna Wingkvist

AbstractA quality model is a conceptual decomposition of an abstract notion of quality into relevant, possibly conflicting characteristics and further into measurable metrics. For quality assessment and decision making, metrics values are aggregated to characteristics and ultimately to quality scores. Aggregation has often been problematic as quality models do not provide the semantics of aggregation. This makes it hard to formally reason about metrics, characteristics, and quality. We argue that aggregation needs to be interpretable and mathematically well defined in order to assess, to compare, and to improve quality. To address this challenge, we propose a probabilistic approach to aggregation and define quality scores based on joint distributions of absolute metrics values. To evaluate the proposed approach and its implementation under realistic conditions, we conduct empirical studies on bug prediction of ca. 5000 software classes, maintainability of ca. 15000 open-source software systems, and on the information quality of ca. 100000 real-world technical documents. We found that our approach is feasible, accurate, and scalable in performance.


Agronomy ◽  
2021 ◽  
Vol 11 (6) ◽  
pp. 1207
Author(s):  
Gonçalo C. Rodrigues ◽  
Ricardo P. Braga

This study aims to evaluate NASA POWER reanalysis products for daily surface maximum (Tmax) and minimum (Tmin) temperatures, solar radiation (Rs), relative humidity (RH) and wind speed (Ws) when compared with observed data from 14 distributed weather stations across Alentejo Region, Southern Portugal, with a hot summer Mediterranean climate. Results showed that there is good agreement between NASA POWER reanalysis and observed data for all parameters, except for wind speed, with coefficient of determination (R2) higher than 0.82, with normalized root mean square error (NRMSE) varying, from 8 to 20%, and a normalized mean bias error (NMBE) ranging from –9 to 26%, for those variables. Based on these results, and in order to improve the accuracy of the NASA POWER dataset, two bias corrections were performed to all weather variables: one for the Alentejo Region as a whole; another, for each location individually. Results improved significantly, especially when a local bias correction is performed, with Tmax and Tmin presenting an improvement of the mean NRMSE of 6.6 °C (from 8.0 °C) and 16.1 °C (from 20.5 °C), respectively, while a mean NMBE decreased from 10.65 to 0.2%. Rs results also show a very high goodness of fit with a mean NRMSE of 11.2% and mean NMBE equal to 0.1%. Additionally, bias corrected RH data performed acceptably with an NRMSE lower than 12.1% and an NMBE below 2.1%. However, even when a bias correction is performed, Ws lacks the performance showed by the remaining weather variables, with an NRMSE never lower than 19.6%. Results show that NASA POWER can be useful for the generation of weather data sets where ground weather stations data is of missing or unavailable.


1972 ◽  
Vol 31 (1) ◽  
pp. 131-140 ◽  
Author(s):  
Donald W. Zimmerman

The concepts of random error and reliability of measurements that are familiar in traditional theories based on the notions of “true values” and “errors” can be represented by a probability model having a simpler formal structure and fewer special assumptions about random sampling and independence of measurements. In this model formulas that relate observable events are derived from probability axioms and from primitive terms that refer to observable events, without an intermediate structure containing variances and correlations of “true” and “error” components of scores. While more economical in language and formalism, the model at the same time is more general than classical theories and applies to stochastic processes in which joint distributions of many dependent random variables are of interest. In addition, it clarifies some long-standing problems concerning “experimental independence” of measurements and the relation of sampling of individuals to sampling of measurements.


Sign in / Sign up

Export Citation Format

Share Document