Wine Quality Prediction Using Machine Learning

Abstract: Nowadays people are living a luxurious lifestyle, wine has become a part of one's culture. consumption of wine is very common throughout the world so its quality is very important. hence its important to analyse wine quality quality of the wines are usually checked by humans through tasting but it has other physicochemical attributes which affects the taste but the process is slow hence machine learning methods can be used for the same. dataset is taken and feature selection is done using pca feature selection and then accuracy is find using SVM, backpropagation neural network and Random forest algorithm to find which model fits best and gives greater accuracy. Keywords: Data Extraction, PCA, SVM,BP neural network, Randomforest

Download Full-text

How to Guarantee Food Safety via Grain Storage? An Approach to Improve Management Effectiveness by Machine Learning Algorithms

Journal of Biomedical Research & Environmental Sciences ◽

10.37871/jbres1296 ◽

2021 ◽

Vol 2 (8) ◽

pp. 675-684

Author(s):

Jin Wang ◽

Youjun Jiang ◽

Li Li ◽

Chao Yang ◽

Ke Li ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Support Vector Machine ◽

Prediction Model ◽

Bp Neural Network ◽

Machine Learning Algorithms ◽

Support Vector ◽

Grain Storage ◽

Management Effectiveness

The purpose of grain storage management is to dynamically analyze the quality change of the reserved grains, adopt scientific and effective management methods to delay the speed of the quality deterioration, and reduce the loss rate during storage. At present, the supervision of the grain quality in the reserve mainly depends on the periodic measurements of the quality of the grains and the milled products. The data obtained by the above approach is accurate and reliable, but the workload is too large while the frequency is high. The obtained conclusions are also limited to the studied area and not applicable to be extended into other scenarios. Therefore, there is an urgent need of a general method that can quickly predict the quality of grains given different species, regions and storage periods based on historical data. In this study, we introduced Back-Propagation (BP) neural network algorithm and support vector machine algorithm into the quality prediction of the reserved grains. We used quality index, temperature and humidity data to build both an intertemporal prediction model and a synchronous prediction model. The results show that the BP neural network based on the storage characters from the first three periods can accurately predict the key storage characters intertemporally. The support vector machine can provide precise predictions of the key storage characters synchronously. The average predictive error for each of wheat, rice and corn is less than 15%, while the one for soybean is about 20%, all of which can meet the practical demands. In conclusion, the machine learning algorithms are helpful to improve the management effectiveness of grain storage.

Download Full-text

The Ensembles of Machine Learning Methods for Survival Predicting after Kidney Transplantation

Applied Sciences ◽

10.3390/app112110380 ◽

2021 ◽

Vol 11 (21) ◽

pp. 10380

Author(s):

Yaroslav Tolstyak ◽

Rostyslav Zhuk ◽

Igor Yakovlev ◽

Nataliya Shakhovska ◽

Michal Gregus ml ◽

...

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Predictive Models ◽

Survival Prediction ◽

Pairwise Correlation ◽

Factors Associated ◽

Machine Learning Methods ◽

Early Graft ◽

Machine Learning Models

Machine learning is used to develop predictive models to diagnose different diseases, particularly kidney transplant survival prediction. The paper used the collected dataset of patients’ individual parameters to predict the critical risk factors associated with early graft rejection. Our study shows the high pairwise correlation between a massive subset of the parameters listed in the dataset. Hence the proper feature selection is needed to increase the quality of a prediction model. Several methods are used for feature selection, and results are summarized using hard voting. Modeling the onset of critical events for the elements of a particular set is made based on the Kapplan-Meier method. Four novel ensembles of machine learning models are built on selected features for the classification task. Proposed stacking allows obtaining an accuracy, sensitivity, and specifity of more than 0.9. Further research will include the development of a two-stage predictor.

Download Full-text

Hybrid Machine Learning Approaches and a Systematic Model Selection Process for Predicting Soot Emissions in Compression Ignition Engines

Energies ◽

10.3390/en14237865 ◽

2021 ◽

Vol 14 (23) ◽

pp. 7865

Author(s):

Saeid Shahpouri ◽

Armin Norouzi ◽

Christopher Hayduk ◽

Reza Rezaei ◽

Mahdi Shahbakhti ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Feature Selection ◽

Black Box ◽

Box Model ◽

Feature Sets ◽

Machine Learning Methods ◽

Box Models ◽

Soot Emissions ◽

Black Box Models

The standards for emissions from diesel engines are becoming more stringent and accurate emission modeling is crucial in order to control the engine to meet these standards. Soot emissions are formed through a complex process and are challenging to model. A comprehensive analysis of diesel engine soot emissions modeling for control applications is presented in this paper. Physical, black-box, and gray-box models are developed for soot emissions prediction. Additionally, different feature sets based on the least absolute shrinkage and selection operator (LASSO) feature selection method and physical knowledge are examined to develop computationally efficient soot models with good precision. The physical model is a virtual engine modeled in GT-Power software that is parameterized using a portion of experimental data. Different machine learning methods, including Regression Tree (RT), Ensemble of Regression Trees (ERT), Support Vector Machines (SVM), Gaussian Process Regression (GPR), Artificial Neural Network (ANN), and Bayesian Neural Network (BNN) are used to develop the black-box models. The gray-box models include a combination of the physical and black-box models. A total of five feature sets and eight different machine learning methods are tested. An analysis of the accuracy, training time and test time of the models is performed using the K-means clustering algorithm. It provides a systematic way for categorizing the feature sets and methods based on their performance and selecting the best method for a specific application. According to the analysis, the black-box model consisting of GPR and feature selection by LASSO shows the best performance with test R2 of 0.96. The best gray-box model consists of SVM-based method with physical insight feature set along with LASSO for feature selection with test R2 of 0.97.

Download Full-text

Use of Machine Learning and Deep Learning to Predict the Outcomes of Major League Baseball Matches

Applied Sciences ◽

10.3390/app11104499 ◽

2021 ◽

Vol 11 (10) ◽

pp. 4499

Author(s):

Mei-Ling Huang ◽

Yun-Zhi Li

Keyword(s):

Neural Network ◽

Machine Learning ◽

Feature Selection ◽

Deep Learning ◽

Prediction Accuracy ◽

Major League Baseball ◽

Support Vector ◽

Learning Methods ◽

Machine Learning Methods ◽

Major League

Major League Baseball (MLB) is the highest level of professional baseball in the world and accounts for some of the most popular international sporting events. Many scholars have conducted research on predicting the outcome of MLB matches. The accuracy in predicting the results of baseball games is low. Therefore, deep learning and machine learning methods were used to build models for predicting the outcomes (win/loss) of MLB matches and investigate the differences between the models in terms of their performance. The match data of 30 teams during the 2019 MLB season with only the starting pitcher or with all pitchers in the pitcher category were collected to compare the prediction accuracy. A one-dimensional convolutional neural network (1DCNN), a traditional machine learning artificial neural network (ANN), and a support vector machine (SVM) were used to predict match outcomes with fivefold cross-validation to evaluate model performance. The highest prediction accuracies were 93.4%, 93.91%, and 93.90% with the 1DCNN, ANN, SVM models, respectively, before feature selection; after feature selection, the highest accuracies obtained were 94.18% and 94.16% with the ANN and SVM models, respectively. The prediction results obtained with the three models were similar, and the prediction accuracies were much higher than those obtained in related studies. Moreover, a 1DCNN was used for the first time for predicting the outcome of MLB matches, and it achieved a prediction accuracy similar to that achieved by machine learning methods.

Download Full-text

A Method of Ore Blending Based on the Quality of Beneficiation and Its Application in a Concentrator

Applied Sciences ◽

10.3390/app11115092 ◽

2021 ◽

Vol 11 (11) ◽

pp. 5092

Author(s):

Bingyu Liu ◽

Dingsen Zhang ◽

Xianwen Gao

Keyword(s):

Neural Network ◽

Bp Neural Network ◽

Industrial Test ◽

Modeling Method ◽

Daily Work ◽

Production Site ◽

Neural Network Algorithm ◽

The Relationship ◽

Ore Dressing

Ore blending is an essential part of daily work in the concentrator. Qualified ore dressing products can make the ore dressing more smoothly. The existing ore blending modeling usually only considers the quality of ore blending products and ignores the effect of ore blending on ore dressing. This research proposes an ore blending modeling method based on the quality of the beneficiation concentrate. The relationship between the properties of ore blending products and the total concentrate recovery is fitted by the ABC-BP neural network algorithm, taken as the optimization goal to guarantee the quality of ore dressing products at the source. The ore blending system was developed and operated stably on the production site. The industrial test and actual production results have proved the effectiveness and reliability of this method.

Download Full-text

Predicting the Quality of High-power Connector Joints with Different Machine Learning Methods

2020 10th International Electric Drives Production Conference (EDPC) ◽

10.1109/edpc51184.2020.9388211 ◽

2020 ◽

Author(s):

Elisabeth Birgit Schwarz ◽

Fabian Bleier ◽

Jean-Pierre Bergmann

Keyword(s):

Machine Learning ◽

High Power ◽

Learning Methods ◽

Machine Learning Methods

Download Full-text

Prediction of sports attendance: A comparative analysis

Proceedings of the Institution of Mechanical Engineers Part P Journal of Sports Engineering and Technology ◽

10.1177/1754337120983135 ◽

2020 ◽

pp. 175433712098313

Author(s):

Mehmet Şahin ◽

Murat Uçar

Keyword(s):

Neural Network ◽

Machine Learning ◽

Comparative Analysis ◽

National Football League ◽

Relevant Literature ◽

Performance Evaluations ◽

Influential Factor ◽

Gradient Boosting ◽

Home Team ◽

Machine Learning Methods

In this study, a comparative analysis for predicting sports attendance demand is presented based on econometric, artificial intelligence, and machine learning methodologies. Data from more than 20,000 games from three major leagues, namely the National Basketball Association (NBA), National Football League (NFL), and Major League Baseball (MLB), were used for training and testing the approaches. The relevant literature was examined to determine the most useful variables as potential regressors in forecasting. To reveal the most effective approach, three scenarios containing seven cases were constructed. In the first scenario, each league was evaluated separately. In the second scenario, the three possible combinations of league pairings were evaluated, while in the third scenario, all three leagues were evaluated together. The performance evaluations of the results suggest that one of the machine learning methods, Gradient Boosting, outperformed the other methods used. However, the Artificial Neural Network, deep Convolutional Neural Network, and Decision Trees also provided productive and competitive predictions for sports games. Based on the results, the predictions for the NBA and NFL leagues are more satisfactory than the predictions of the MLB, which may be caused by the structure of the MLB. The results of the sensitivity analysis indicate that the performance of the home team is the most influential factor for all three leagues.

Download Full-text

Feature Selection and Machine Learning Methods for Optimal Identification and Prediction of Subtypes in Parkinson's Disease

Computer Methods and Programs in Biomedicine ◽

10.1016/j.cmpb.2021.106131 ◽

2021 ◽

pp. 106131

Author(s):

Mohammad R. Salmanpour ◽

Mojtaba Shamsaei ◽

Arman Rahmim

Keyword(s):

Machine Learning ◽

Parkinson’S Disease ◽

Parkinson's Disease ◽

Feature Selection ◽

Learning Methods ◽

Machine Learning Methods

Download Full-text

Statistics and machine learning methods for EHR data – From data extraction to data analytics, by Hulin Wu et al., CRC Press

Journal of Biopharmaceutical Statistics ◽

10.1080/10543406.2021.1928833 ◽

2021 ◽

pp. 1-2

Author(s):

Madan G. Kundu

Keyword(s):

Machine Learning ◽

Data Analytics ◽

Data Extraction ◽

Learning Methods ◽

Machine Learning Methods

Download Full-text

Possibility of Autonomous Estimation of Shiba Goat’s Estrus and Non-Estrus Behavior by Machine Learning Methods

Animals ◽

10.3390/ani10050771 ◽

2020 ◽

Vol 10 (5) ◽

pp. 771

Author(s):

Toshiya Arakawa

Keyword(s):

Neural Network ◽

Machine Learning ◽

Random Forest ◽

Markov Models ◽

Tracking System ◽

Video Tracking ◽

Training Data ◽

Support Vector ◽

Learning Methods ◽

Machine Learning Methods

Mammalian behavior is typically monitored by observation. However, direct observation requires a substantial amount of effort and time, if the number of mammals to be observed is sufficiently large or if the observation is conducted for a prolonged period. In this study, machine learning methods as hidden Markov models (HMMs), random forests, support vector machines (SVMs), and neural networks, were applied to detect and estimate whether a goat is in estrus based on the goat’s behavior; thus, the adequacy of the method was verified. Goat’s tracking data was obtained using a video tracking system and used to estimate whether they, which are in “estrus” or “non-estrus”, were in either states: “approaching the male”, or “standing near the male”. Totally, the PC of random forest seems to be the highest. However, The percentage concordance (PC) value besides the goats whose data were used for training data sets is relatively low. It is suggested that random forest tend to over-fit to training data. Besides random forest, the PC of HMMs and SVMs is high. However, considering the calculation time and HMM’s advantage in that it is a time series model, HMM is better method. The PC of neural network is totally low, however, if the more goat’s data were acquired, neural network would be an adequate method for estimation.

Download Full-text