scholarly journals Modeling of Combined Processing Steps for ReducingEscherichia coli O157:H7 Populations in Apple Cider

2001 ◽  
Vol 67 (1) ◽  
pp. 133-141 ◽  
Author(s):  
Heidi E. Uljas ◽  
Donald W. Schaffner ◽  
Siobain Duffy ◽  
Lihui Zhao ◽  
Steven C. Ingham

ABSTRACT Probabilistic models were used as a systematic approach to describe the response of Escherichia coli O157:H7 populations to combinations of commonly used preservation methods in unpasteurized apple cider. Using a complete factorial experimental design, the effect of pH (3.1 to 4.3), storage temperature and time (5 to 35°C for 0 to 6 h or 12 h), preservatives (0, 0.05, or 0.1% potassium sorbate or sodium benzoate), and freeze-thaw (F-T; −20°C, 48 h and 4°C, 4 h) treatment combinations (a total of 1,600 treatments) on the probability of achieving a 5-log10-unit reduction in a three-strain E. coli O157:H7 mixture in cider was determined. Using logistic regression techniques, pH, temperature, time, and concentration were modeled in separate segments of the data set, resulting in prediction equations for: (i) no preservatives, before F-T; (ii) no preservatives, after F-T; (iii) sorbate, before F-T; (iv) sorbate, after F-T; (v) benzoate, before F-T; and (vi) benzoate, after F-T. Statistical analysis revealed a highly significant (P < 0.0001) effect of all four variables, with cider pH being the most important, followed by temperature and time, and finally by preservative concentration. All models predicted 92 to 99% of the responses correctly. To ensure safety, use of the models is most appropriate at a 0.9 probability level, where the percentage of false positives, i.e., falsely predicting a 5-log10-unit reduction, is the lowest (0 to 4.4%). The present study demonstrates the applicability of logistic regression approaches to describing the effectiveness of multiple treatment combinations in pathogen control in cider making. The resulting models can serve as valuable tools in designing safe apple cider processes.

2020 ◽  
Vol 21 (1) ◽  
pp. 267-286 ◽  
Author(s):  
Amber J. Dood ◽  
John C. Dood ◽  
Daniel Cruz-Ramírez de Arellano ◽  
Kimberly B. Fields ◽  
Jeffrey R. Raker

Assessments that aim to evaluate student understanding of chemical reactions and reaction mechanisms should ask students to construct written or oral explanations of mechanistic representations; students can reproduce pictorial mechanism representations with minimal understanding of the meaning of the representations. Grading such assessments is time-consuming, which is a limitation for use in large-enrollment courses and for timely feedback for students. Lexical analysis and logistic regression techniques can be used to evaluate student written responses in STEM courses. In this study, we use lexical analysis and logistic regression techniques to score a constructed-response item which aims to evaluate student explanations about what is happening in a unimolecular nucleophilic substitution (i.e., SN1) reaction and why. We identify three levels of student explanation sophistication (i.e., descriptive only, surface level why, and deeper why), and qualitatively describe student reasoning about four main aspects of the reaction: leaving group, carbocation, nucleophile and electrophile, and acid–base proton transfer. Responses scored as Level 1 (N = 113, 11%) include only a description of what is happening in the reaction and do not address the why for any of the four aspects. Level 2 responses (N = 549, 53%) describe why the reaction is occurring at a surface level (i.e., using solely explicit features or mentioning implicit features without deeper explanation) for at least one aspect of the reaction. Level 3 responses (N = 379, 36%) explain the why at a deeper level by inferring implicit features from explicit features explained using electronic effects for at least one reaction aspect. We evaluate the predictive accuracy of two binomial logistic regression models for scoring the responses with these levels, achieving 86.9% accuracy (with the testing data set) when compared to human coding. The lexical analysis methodology and emergent scoring framework could be used as a foundation from which to develop scoring models for a broader array of reaction mechanisms.


Author(s):  
Dhilsath Fathima.M ◽  
S. Justin Samuel ◽  
R. Hari Haran

Aim: This proposed work is used to develop an improved and robust machine learning model for predicting Myocardial Infarction (MI) could have substantial clinical impact. Objectives: This paper explains how to build machine learning based computer-aided analysis system for an early and accurate prediction of Myocardial Infarction (MI) which utilizes framingham heart study dataset for validation and evaluation. This proposed computer-aided analysis model will support medical professionals to predict myocardial infarction proficiently. Methods: The proposed model utilize the mean imputation to remove the missing values from the data set, then applied principal component analysis to extract the optimal features from the data set to enhance the performance of the classifiers. After PCA, the reduced features are partitioned into training dataset and testing dataset where 70% of the training dataset are given as an input to the four well-liked classifiers as support vector machine, k-nearest neighbor, logistic regression and decision tree to train the classifiers and 30% of test dataset is used to evaluate an output of machine learning model using performance metrics as confusion matrix, classifier accuracy, precision, sensitivity, F1-score, AUC-ROC curve. Results: Output of the classifiers are evaluated using performance measures and we observed that logistic regression provides high accuracy than K-NN, SVM, decision tree classifiers and PCA performs sound as a good feature extraction method to enhance the performance of proposed model. From these analyses, we conclude that logistic regression having good mean accuracy level and standard deviation accuracy compared with the other three algorithms. AUC-ROC curve of the proposed classifiers is analyzed from the output figure.4, figure.5 that logistic regression exhibits good AUC-ROC score, i.e. around 70% compared to k-NN and decision tree algorithm. Conclusion: From the result analysis, we infer that this proposed machine learning model will act as an optimal decision making system to predict the acute myocardial infarction at an early stage than an existing machine learning based prediction models and it is capable to predict the presence of an acute myocardial Infarction with human using the heart disease risk factors, in order to decide when to start lifestyle modification and medical treatment to prevent the heart disease.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Martine De Cock ◽  
Rafael Dowsley ◽  
Anderson C. A. Nascimento ◽  
Davis Railsback ◽  
Jianwei Shen ◽  
...  

Abstract Background In biomedical applications, valuable data is often split between owners who cannot openly share the data because of privacy regulations and concerns. Training machine learning models on the joint data without violating privacy is a major technology challenge that can be addressed by combining techniques from machine learning and cryptography. When collaboratively training machine learning models with the cryptographic technique named secure multi-party computation, the price paid for keeping the data of the owners private is an increase in computational cost and runtime. A careful choice of machine learning techniques, algorithmic and implementation optimizations are a necessity to enable practical secure machine learning over distributed data sets. Such optimizations can be tailored to the kind of data and Machine Learning problem at hand. Methods Our setup involves secure two-party computation protocols, along with a trusted initializer that distributes correlated randomness to the two computing parties. We use a gradient descent based algorithm for training a logistic regression like model with a clipped ReLu activation function, and we break down the algorithm into corresponding cryptographic protocols. Our main contributions are a new protocol for computing the activation function that requires neither secure comparison protocols nor Yao’s garbled circuits, and a series of cryptographic engineering optimizations to improve the performance. Results For our largest gene expression data set, we train a model that requires over 7 billion secure multiplications; the training completes in about 26.90 s in a local area network. The implementation in this work is a further optimized version of the implementation with which we won first place in Track 4 of the iDASH 2019 secure genome analysis competition. Conclusions In this paper, we present a secure logistic regression training protocol and its implementation, with a new subprotocol to securely compute the activation function. To the best of our knowledge, we present the fastest existing secure multi-party computation implementation for training logistic regression models on high dimensional genome data distributed across a local area network.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Richard Johnston ◽  
Xiaohan Yan ◽  
Tatiana M. Anderson ◽  
Edwin A. Mitchell

AbstractThe effect of altitude on the risk of sudden infant death syndrome (SIDS) has been reported previously, but with conflicting findings. We aimed to examine whether the risk of sudden unexpected infant death (SUID) varies with altitude in the United States. Data from the Centers for Disease Control and Prevention (CDC)’s Cohort Linked Birth/Infant Death Data Set for births between 2005 and 2010 were examined. County of birth was used to estimate altitude. Logistic regression and Generalized Additive Model (GAM) were used, adjusting for year, mother’s race, Hispanic origin, marital status, age, education and smoking, father’s age and race, number of prenatal visits, plurality, live birth order, and infant’s sex, birthweight and gestation. There were 25,305,778 live births over the 6-year study period. The total number of deaths from SUID in this period were 23,673 (rate = 0.94/1000 live births). In the logistic regression model there was a small, but statistically significant, increased risk of SUID associated with birth at > 8000 feet compared with < 6000 feet (aOR = 1.93; 95% CI 1.00–3.71). The GAM showed a similar increased risk over 8000 feet, but this was not statistically significant. Only 9245 (0.037%) of mothers gave birth at > 8000 feet during the study period and 10 deaths (0.042%) were attributed to SUID. The number of SUID deaths at this altitude in the United States is very small (10 deaths in 6 years).


2012 ◽  
Vol 5 (1) ◽  
pp. 83-91 ◽  
Author(s):  
Sara C. P. Lovtang ◽  
Gregg M. Riegel

AbstractWhere the nonnative annual grass downy brome proliferates, it has changed ecosystem processes, such as nutrient, energy, and water cycles; successional pathways; and fire regimes. The objective of this study was to develop a model that predicts the presence of downy brome in Central Oregon and to test whether high presence correlates with greater cover. Understory data from the U.S. Department of Agriculture (USDA) Forest Service's Current Vegetation Survey (CVS) database for the Deschutes National Forest, the Ochoco National Forest, and the Crooked River National Grassland were compiled, and the presence of downy brome was determined for 1,092 systematically located plots. Logistic regression techniques were used to develop models for predicting downy brome populations. For the landscape including the eastside of the Cascade Mountains to the northwestern edge of the Great Basin, the following were selected as the best predictors of downy brome: low average March precipitation, warm minimum May temperature, few total trees per acre, many western junipers per acre, and a short distance to nearest road. The concordance index = 0.92. Using the equation from logistic regression, a probability for downy brome infestation was calculated for each CVS plot. The plots were assigned to a plant association group (PAG), and the average probability was calculated for the PAGs in which the CVS plots were located. This method could be duplicated in other areas where vegetation inventories take place.


2021 ◽  
Vol 29 ◽  
pp. 287-295
Author(s):  
Zhiming Zhou ◽  
Haihui Huang ◽  
Yong Liang

BACKGROUND: In genome research, it is particularly important to identify molecular biomarkers or signaling pathways related to phenotypes. Logistic regression model is a powerful discrimination method that can offer a clear statistical explanation and obtain the classification probability of classification label information. However, it is unable to fulfill biomarker selection. OBJECTIVE: The aim of this paper is to give the model efficient gene selection capability. METHODS: In this paper, we propose a new penalized logsum network-based regularization logistic regression model for gene selection and cancer classification. RESULTS: Experimental results on simulated data sets show that our method is effective in the analysis of high-dimensional data. For a large data set, the proposed method has achieved 89.66% (training) and 90.02% (testing) AUC performances, which are, on average, 5.17% (training) and 4.49% (testing) better than mainstream methods. CONCLUSIONS: The proposed method can be considered a promising tool for gene selection and cancer classification of high-dimensional biological data.


Circulation ◽  
2016 ◽  
Vol 133 (suppl_1) ◽  
Author(s):  
Nina P Paynter ◽  
Raji Balasubramanian ◽  
Shuba Gopal ◽  
Franco Giulianini ◽  
Leslie Tinker ◽  
...  

Background: Prior studies of metabolomic profiles and coronary heart disease (CHD) have been limited by relatively small case numbers and scant data in women. Methods: The discovery set examined 371 metabolites in 400 confirmed, incident CHD cases and 400 controls (frequency matched on age, race/ethnicity, hysterectomy status and time of enrollment) in the Women’s Health Initiative Observational Study (WHI-OS). All selected metabolites were validated in a separate set of 394 cases and 397 matched controls drawn from the placebo arms of the WHI Hormone Therapy trials and the WHI-OS. Discovery used 4 methods: false-discovery rate (FDR) adjusted logistic regression for individual metabolites, permutation corrected least absolute shrinkage and selection operator (LASSO) algorithms, sparse partial least squares discriminant analysis (PLS-DA) algorithms, and random forest algorithms. Each method was performed with matching factors only and with matching plus both medication use (aspirin, statins, anti-diabetics and anti-hypertensives) and traditional CHD risk factors (smoking, systolic blood pressure, diabetes, total and HDL cholesterol). Replication in the validation set was defined as a logistic regression coefficient of p<0.05 for the metabolites selected by 3 or 4 methods (tier 1), or a FDR adjusted p<0.05 for metabolites selected by only 1 or 2 methods (tier 2). Results: Sixty-seven metabolites were selected in the discovery data set (30 tier 1 and 37 tier 2). Twenty-six successfully replicated in the validation data set (21 tier 1 and 5 tier 2), with 25 significant with adjusting for matching factors only and 11 significant after additionally adjusting for medications and CHD risk factors. Validated metabolites included amino acids, sugars, nucleosides, eicosanoids, plasmologens, polyunsaturated phospholipids and highly saturated triglycerides. These include novel metabolites as well as metabolites such as glutamate/glutamine, which have been shown in other populations. Conclusions: Multiple metabolites in important physiological pathways with robust associations for risk of CHD in women were identified and replicated. These results may offer insights into biological mechanisms of CHD as well as identify potential markers of risk.


2021 ◽  
pp. 107110072110581
Author(s):  
Wenye Song ◽  
Naohiro Shibuya ◽  
Daniel C. Jupiter

Background: Ankle fractures in patients with diabetes mellitus have long been recognized as a challenge to practicing clinicians. Ankle fracture patients with diabetes may experience prolonged healing, higher risk of hardware failure, an increased risk of wound dehiscence and infection, and higher pain scores pre- and postoperatively, compared to patients without diabetes. However, the duration of opioid use among this patient cohort has not been previously evaluated. The purpose of this study is to retrospectively compare the time span of opioid utilization between ankle fracture patients with and without diabetes mellitus. Methods: We conducted a retrospective cohort study using our institution’s TriNetX database. A total of 640 ankle fracture patients were included in the analysis, of whom 73 had diabetes. All dates of opioid use for each patient were extracted from the data set, including the first and last date of opioid prescription. Descriptive analysis and logistic regression models were employed to explore the differences in opioid use between patients with and without diabetes after ankle fracture repair. A 2-tailed P value of .05 was set as the threshold for statistical significance. Results: Logistic regression models revealed that patients with diabetes are less likely to stop using opioids within 90 days, or within 180 days, after repair compared to patients without diabetes. Female sex, neuropathy, and prefracture opioid use are also associated with prolonged opioid use after ankle fracture repair. Conclusion: In our study cohort, ankle fracture patients with diabetes were more likely to require prolonged opioid use after fracture repair. Level of Evidence: Level III, prognostic.


2021 ◽  
Author(s):  
Christian A Betancourt ◽  
Panagiota Kitsantas ◽  
Deborah G Goldberg ◽  
Beth A Hawks

ABSTRACT Introduction Military veterans continue to struggle with addiction even after receiving treatment for substance use disorders (SUDs). Identifying factors that may influence SUD relapse upon receiving treatment in veteran populations is crucial for intervention and prevention efforts. The purpose of this study was to examine risk factors that contribute to SUD relapse upon treatment completion in a sample of U.S. veterans using logistic regression and classification tree analysis. Materials and Methods Data from the 2017 Treatment Episode Data Set—Discharge (TEDS-D) included 40,909 veteran episode observations. Descriptive statistics and multivariable logistic regression analysis were conducted to determine factors associated with SUD relapse after treatment discharge. Classification trees were constructed to identify high-risk subgroups for substance use after discharge from treatment for SUDs. Results Approximately 94% of the veterans relapsed upon discharge from outpatient or residential SUD treatment. Veterans aged 18-34 years old were significantly less likely to relapse than the 35-64 age group (odds ratio [OR] 0.73, 95% confidence interval [CI]: 0.66, 0.82), while males were more likely than females to relapse (OR 1.55, 95% CI: 1.34, 1.79). Unemployed veterans (OR 1.92, 95% CI: 1.67, 2.22) or veterans not in the labor force (OR 1.29, 95% CI: 1.13, 1.47) were more likely to relapse than employed veterans. Homeless vs. independently housed veterans had 3.26 (95% CI: 2.55, 4.17) higher odds of relapse after treatment. Veterans with one arrest vs. none were more likely to relapse (OR 1.52, 95% CI: 1.19, 1.95). Treatment completion was critical to maintain sobriety, as every other type of discharge led to more than double the odds of relapse. Veterans who received care at 24-hour detox facilities were 1.49 (95% CI: 1.23, 1.80) times more likely to relapse than those at rehabilitative/residential treatment facilities. Classification tree analysis indicated that homelessness upon discharge was the most important predictor in SUD relapse among veterans. Conclusion Aside from numerous challenges that veterans face after leaving military service, SUD relapse is intensified by risk factors such as homelessness, unemployment, and insufficient SUD treatment. As treatment and preventive care for SUD relapse is an active field of study, further research on SUD relapse among homeless veterans is necessary to better understand the epidemiology of substance addiction among this vulnerable population. The findings of this study can inform healthcare policy and practices targeting veteran-tailored treatment programs to improve SUD treatment completion and lower substance use after treatment.


2001 ◽  
Vol 64 (10) ◽  
pp. 1584-1591 ◽  
Author(s):  
A. GELMAN ◽  
L. GLATMAN ◽  
V. DRABKIN ◽  
S. HARPAZ

Sensory and microbiological characteristics of pond-raised freshwater silver perch (Bidyanus bidyanus) fish, during cold storage over a period of 25 days were evaluated. Whole fish (averaging 400 g each) were stored in cold storage rooms at either 0 to 2°C, 5°C, or 5°C + potassium sorbate as a preservative. The organoleptic and hypoxanthine test results show that the treatment of potassium sorbate can slow the process of spoilage by about 5 days. Yet, the most important factor affecting the shelf life of these fish is the storage temperature. Keeping the fish at 0 to 2°C can prolong the storage prior to spoilage by 10 days compared with those kept at 5°C. These results obtained through organoleptic tests are corroborated by both the chemical (hypoxanthine and total volatile basic nitrogen) and to some extent by the physical (cosmos) tests. The initial total bacteriological counts were 5 × 102 CFU/cm2 for fish surface and &lt;102 CFU/g for fish flesh, and these counts rose continuously, reaching about 106 CFU/g (0 to 2°C) and 107 CFU/g (5°C) in flesh and 107 to 108 CFU/cm2 on the surface by the end of the storage period. The addition of potassium sorbate led to a smaller increase in bacterial numbers, especially during the first 15 days. Bacterial composition fluctuated during storage. The initial load on the fish surface was predominantly mesophilic and gram positive and consisted mostly (80%) of Micrococci, Bacillus, and Corynebacterium. During the next 10 days, these bacteria were practically replaced by gram-negative flora comprised mostly of Pseudomonas fluorescens that rapidly increased with storage time and accounted for 95% after 15 days.


Sign in / Sign up

Export Citation Format

Share Document