Estimation of Environmental Contours Using a Storm Resampling Method

2021 ◽  
Author(s):  
Edward Mackay ◽  
Philip Jonathan
Keyword(s):  
2021 ◽  
Vol 9 ◽  
Author(s):  
Daniel Lowell Weller ◽  
Tanzy M. T. Love ◽  
Martin Wiedmann

Recent studies have shown that predictive models can supplement or provide alternatives to E. coli-testing for assessing the potential presence of food safety hazards in water used for produce production. However, these studies used balanced training data and focused on enteric pathogens. As such, research is needed to determine 1) if predictive models can be used to assess Listeria contamination of agricultural water, and 2) how resampling (to deal with imbalanced data) affects performance of these models. To address these knowledge gaps, this study developed models that predict nonpathogenic Listeria spp. (excluding L. monocytogenes) and L. monocytogenes presence in agricultural water using various combinations of learner (e.g., random forest, regression), feature type, and resampling method (none, oversampling, SMOTE). Four feature types were used in model training: microbial, physicochemical, spatial, and weather. “Full models” were trained using all four feature types, while “nested models” used between one and three types. In total, 45 full (15 learners*3 resampling approaches) and 108 nested (5 learners*9 feature sets*3 resampling approaches) models were trained per outcome. Model performance was compared against baseline models where E. coli concentration was the sole predictor. Overall, the machine learning models outperformed the baseline E. coli models, with random forests outperforming models built using other learners (e.g., rule-based learners). Resampling produced more accurate models than not resampling, with SMOTE models outperforming, on average, oversampling models. Regardless of resampling method, spatial and physicochemical water quality features drove accurate predictions for the nonpathogenic Listeria spp. and L. monocytogenes models, respectively. Overall, these findings 1) illustrate the need for alternatives to existing E. coli-based monitoring programs for assessing agricultural water for the presence of potential food safety hazards, and 2) suggest that predictive models may be one such alternative. Moreover, these findings provide a conceptual framework for how such models can be developed in the future with the ultimate aim of developing models that can be integrated into on-farm risk management programs. For example, future studies should consider using random forest learners, SMOTE resampling, and spatial features to develop models to predict the presence of foodborne pathogens, such as L. monocytogenes, in agricultural water when the training data is imbalanced.


2020 ◽  
Vol 11 (4) ◽  
pp. 579-589
Author(s):  
Muhamad Husnain Mohd Noh ◽  
Mohd Akramin Mohd Romlay ◽  
Chuan Zun Liang ◽  
Mohd Shamil Shaari ◽  
Akiyuki Takahashi

PurposeFailure of the materials occurs once the stress intensity factor (SIF) overtakes the material fracture toughness. At this level, the crack will grow rapidly resulting in unstable crack growth until a complete fracture happens. The SIF calculation of the materials can be conducted by experimental, theoretical and numerical techniques. Prediction of SIF is crucial to ensure safety life from the material failure. The aim of the simulation study is to evaluate the accuracy of SIF prediction using finite element analysis.Design/methodology/approachThe bootstrap resampling method is employed in S-version finite element model (S-FEM) to generate the random variables in this simulation analysis. The SIF analysis studies are promoted by bootstrap S-version Finite Element Model (BootstrapS-FEM). Virtual crack closure-integral method (VCCM) is an important concept to compute the energy release rate and SIF. The semielliptical crack shape is applied with different crack shape aspect ratio in this simulation analysis. The BootstrapS-FEM produces the prediction of SIFs for tension model.FindingsThe mean of BootstrapS-FEM is calculated from 100 samples by the resampling method. The bounds are computed based on the lower and upper bounds of the hundred samples of BootstrapS-FEM. The prediction of SIFs is validated with Newman–Raju solution and deterministic S-FEM within 95 percent confidence bounds. All possible values of SIF estimation by BootstrapS-FEM are plotted in a graph. The mean of the BootstrapS-FEM is referred to as point estimation. The Newman–Raju solution and deterministic S-FEM values are within the 95 percent confidence bounds. Thus, the BootstrapS-FEM is considered valid for the prediction with less than 6 percent of percentage error.Originality/valueThe bootstrap resampling method is employed in S-FEM to generate the random variables in this simulation analysis.


Biometrika ◽  
2019 ◽  
Vol 106 (4) ◽  
pp. 965-972
Author(s):  
D Lee ◽  
J K Kim ◽  
C J Skinner

Summary A within-cluster resampling method is proposed for fitting a multilevel model in the presence of informative cluster size. Our method is based on the idea of removing the information in the cluster sizes by drawing bootstrap samples which contain a fixed number of observations from each cluster. We then estimate the parameters by maximizing an average, over the bootstrap samples, of a suitable composite loglikelihood. The consistency of the proposed estimator is shown and does not require that the correct model for cluster size is specified. We give an estimator of the covariance matrix of the proposed estimator, and a test for the noninformativeness of the cluster sizes. A simulation study shows, as in Neuhaus & McCulloch (2011), that the standard maximum likelihood estimator exhibits little bias for some regression coefficients. However, for those parameters which exhibit nonnegligible bias, the proposed method is successful in correcting for this bias.


2018 ◽  
Vol 50 (1) ◽  
pp. 166-186 ◽  
Author(s):  
F. Saleh ◽  
V. Ramaswamy ◽  
N. Georgas ◽  
A. F. Blumberg ◽  
J. Pullen

Abstract The objective of this work was to evaluate the benefits of using multi-model meteorological ensembles in representing the uncertainty of hydrologic forecasts. An inter-comparison experiment was performed using meteorological inputs from different models corresponding to Hurricane Irene (2011), over three sub-basins of the Hudson River basin. The ensemble-based precipitation inputs were used as forcing in a hydrological model to retrospectively forecast hourly streamflow, with a 96-hour lead time. The inputs consisted of 73 ensemble members, namely one high-resolution ECMWF deterministic member, 51 ECMWF members and 21 NOAA/ESRL (GEFS Reforecasts v2) members. The precipitation inputs were resampled to a common grid using the bilinear resampling method that was selected upon analysing different resampling methods. The results show the advantages of forcing hydrologic forecasting systems with multi-model ensemble forecasts over using deterministic and single model ensemble forecasts. The work showed that using the median of all 73 ensemble streamflow forecasts relatively improved the Nash–Sutcliffe Efficiency and lowered the biases across the examined sub-basins, compared with using the ensemble median from an individual model. This research contributes to the growing literature that demonstrates the promising capabilities of multi-model systems to better describe the uncertainty in streamflow predictions.


Sign in / Sign up

Export Citation Format

Share Document