Estimation of Environmental Contours Using a Storm Resampling Method

Recent studies have shown that predictive models can supplement or provide alternatives to E. coli-testing for assessing the potential presence of food safety hazards in water used for produce production. However, these studies used balanced training data and focused on enteric pathogens. As such, research is needed to determine 1) if predictive models can be used to assess Listeria contamination of agricultural water, and 2) how resampling (to deal with imbalanced data) affects performance of these models. To address these knowledge gaps, this study developed models that predict nonpathogenic Listeria spp. (excluding L. monocytogenes) and L. monocytogenes presence in agricultural water using various combinations of learner (e.g., random forest, regression), feature type, and resampling method (none, oversampling, SMOTE). Four feature types were used in model training: microbial, physicochemical, spatial, and weather. “Full models” were trained using all four feature types, while “nested models” used between one and three types. In total, 45 full (15 learners*3 resampling approaches) and 108 nested (5 learners*9 feature sets*3 resampling approaches) models were trained per outcome. Model performance was compared against baseline models where E. coli concentration was the sole predictor. Overall, the machine learning models outperformed the baseline E. coli models, with random forests outperforming models built using other learners (e.g., rule-based learners). Resampling produced more accurate models than not resampling, with SMOTE models outperforming, on average, oversampling models. Regardless of resampling method, spatial and physicochemical water quality features drove accurate predictions for the nonpathogenic Listeria spp. and L. monocytogenes models, respectively. Overall, these findings 1) illustrate the need for alternatives to existing E. coli-based monitoring programs for assessing agricultural water for the presence of potential food safety hazards, and 2) suggest that predictive models may be one such alternative. Moreover, these findings provide a conceptual framework for how such models can be developed in the future with the ultimate aim of developing models that can be integrated into on-farm risk management programs. For example, future studies should consider using random forest learners, SMOTE resampling, and spatial features to develop models to predict the presence of foodborne pathogens, such as L. monocytogenes, in agricultural water when the training data is imbalanced.

Download Full-text

RESIDUAL BOOTSTRAP RESAMPLING METHOD FOR MULTIPLE LINEAR REGRESSION MODEL PARAMETER ESTIMATION

Jurnal Litbang Edusaintech ◽

10.51402/jle.v1i1.8 ◽

2020 ◽

Vol 1 (1) ◽

pp. 35-43

Author(s):

Fajar Prihatmono ◽

Moh Yamin Darsyah ◽

Abdul Karim

Keyword(s):

Parameter Estimation ◽

Linear Regression ◽

Regression Model ◽

Multiple Linear Regression ◽

Linear Regression Model ◽

Multiple Linear Regression Model ◽

Bootstrap Resampling ◽

Model Parameter ◽

Model Parameter Estimation ◽

Resampling Method

Download Full-text

A Stepwise Resampling Method of Multiple Hypothesis Testing

Journal of the American Statistical Association ◽

10.1080/01621459.1995.10476522 ◽

1995 ◽

Vol 90 (429) ◽

pp. 370-378 ◽

Cited By ~ 41

Author(s):

James F. Troendle

Keyword(s):

Hypothesis Testing ◽

Multiple Hypothesis Testing ◽

Multiple Hypothesis ◽

Resampling Method

Download Full-text

Uncertainty-Aware Resampling Method for Imbalanced Classification Using Evidence Theory

10.1007/978-3-030-86772-0_25 ◽

2021 ◽

pp. 342-353

Author(s):

Fares Grina ◽

Zied Elouedi ◽

Eric Lefèvre

Keyword(s):

Evidence Theory ◽

Imbalanced Classification ◽

Resampling Method

Download Full-text

Analysis of stress intensity factor for fatigue crack using bootstrap S-version finite element model

International Journal of Structural Integrity ◽

10.1108/ijsi-10-2019-0108 ◽

2020 ◽

Vol 11 (4) ◽

pp. 579-589

Author(s):

Muhamad Husnain Mohd Noh ◽

Mohd Akramin Mohd Romlay ◽

Chuan Zun Liang ◽

Mohd Shamil Shaari ◽

Akiyuki Takahashi

Keyword(s):

Stress Intensity Factor ◽

Finite Element ◽

Finite Element Model ◽

Simulation Analysis ◽

Element Model ◽

Bootstrap Resampling ◽

Confidence Bounds ◽

Content Type ◽

Resampling Method ◽

The Mean

PurposeFailure of the materials occurs once the stress intensity factor (SIF) overtakes the material fracture toughness. At this level, the crack will grow rapidly resulting in unstable crack growth until a complete fracture happens. The SIF calculation of the materials can be conducted by experimental, theoretical and numerical techniques. Prediction of SIF is crucial to ensure safety life from the material failure. The aim of the simulation study is to evaluate the accuracy of SIF prediction using finite element analysis.Design/methodology/approachThe bootstrap resampling method is employed in S-version finite element model (S-FEM) to generate the random variables in this simulation analysis. The SIF analysis studies are promoted by bootstrap S-version Finite Element Model (BootstrapS-FEM). Virtual crack closure-integral method (VCCM) is an important concept to compute the energy release rate and SIF. The semielliptical crack shape is applied with different crack shape aspect ratio in this simulation analysis. The BootstrapS-FEM produces the prediction of SIFs for tension model.FindingsThe mean of BootstrapS-FEM is calculated from 100 samples by the resampling method. The bounds are computed based on the lower and upper bounds of the hundred samples of BootstrapS-FEM. The prediction of SIFs is validated with Newman–Raju solution and deterministic S-FEM within 95 percent confidence bounds. All possible values of SIF estimation by BootstrapS-FEM are plotted in a graph. The mean of the BootstrapS-FEM is referred to as point estimation. The Newman–Raju solution and deterministic S-FEM values are within the 95 percent confidence bounds. Thus, the BootstrapS-FEM is considered valid for the prediction with less than 6 percent of percentage error.Originality/valueThe bootstrap resampling method is employed in S-FEM to generate the random variables in this simulation analysis.

Download Full-text

Resampling Method for Correction of Laser Tuning Fluctuations in OFDR Sensing System

2019 12th International Conference on Measurement ◽

10.23919/measurement47340.2019.8780066 ◽

2019 ◽

Author(s):

Jozefa Cervenova ◽

Jozef Jasenek ◽

Norbert Kaplan

Keyword(s):

Sensing System ◽

Resampling Method ◽

Laser Tuning

Download Full-text

Within-cluster resampling for multilevel models under informative cluster size

Biometrika ◽

10.1093/biomet/asz035 ◽

2019 ◽

Vol 106 (4) ◽

pp. 965-972

Author(s):

D Lee ◽

J K Kim ◽

C J Skinner

Keyword(s):

Maximum Likelihood ◽

Cluster Size ◽

Multilevel Model ◽

Multilevel Models ◽

Fixed Number ◽

Regression Coefficients ◽

Likelihood Estimator ◽

Correct Model ◽

Resampling Method ◽

Informative Cluster Size

Summary A within-cluster resampling method is proposed for fitting a multilevel model in the presence of informative cluster size. Our method is based on the idea of removing the information in the cluster sizes by drawing bootstrap samples which contain a fixed number of observations from each cluster. We then estimate the parameters by maximizing an average, over the bootstrap samples, of a suitable composite loglikelihood. The consistency of the proposed estimator is shown and does not require that the correct model for cluster size is specified. We give an estimator of the covariance matrix of the proposed estimator, and a test for the noninformativeness of the cluster sizes. A simulation study shows, as in Neuhaus & McCulloch (2011), that the standard maximum likelihood estimator exhibits little bias for some regression coefficients. However, for those parameters which exhibit nonnegligible bias, the proposed method is successful in correcting for this bias.

Download Full-text

Inter-comparison between retrospective ensemble streamflow forecasts using meteorological inputs from ECMWF and NOAA/ESRL in the Hudson River sub-basins during Hurricane Irene (2011)

Hydrology Research ◽

10.2166/nh.2018.182 ◽

2018 ◽

Vol 50 (1) ◽

pp. 166-186 ◽

Cited By ~ 3

Author(s):

F. Saleh ◽

V. Ramaswamy ◽

N. Georgas ◽

A. F. Blumberg ◽

J. Pullen

Keyword(s):

Lead Time ◽

Hudson River ◽

Model Systems ◽

Model Ensemble ◽

Single Model ◽

Ensemble Forecasts ◽

Resampling Methods ◽

Hurricane Irene ◽

Resampling Method ◽

Streamflow Forecasts

Abstract The objective of this work was to evaluate the benefits of using multi-model meteorological ensembles in representing the uncertainty of hydrologic forecasts. An inter-comparison experiment was performed using meteorological inputs from different models corresponding to Hurricane Irene (2011), over three sub-basins of the Hudson River basin. The ensemble-based precipitation inputs were used as forcing in a hydrological model to retrospectively forecast hourly streamflow, with a 96-hour lead time. The inputs consisted of 73 ensemble members, namely one high-resolution ECMWF deterministic member, 51 ECMWF members and 21 NOAA/ESRL (GEFS Reforecasts v2) members. The precipitation inputs were resampled to a common grid using the bilinear resampling method that was selected upon analysing different resampling methods. The results show the advantages of forcing hydrologic forecasting systems with multi-model ensemble forecasts over using deterministic and single model ensemble forecasts. The work showed that using the median of all 73 ensemble streamflow forecasts relatively improved the Nash–Sutcliffe Efficiency and lowered the biases across the examined sub-basins, compared with using the ensemble median from an individual model. This research contributes to the growing literature that demonstrates the promising capabilities of multi-model systems to better describe the uncertainty in streamflow predictions.

Download Full-text