Marginal Learning Algorithms in Statistical Machine Learning

Medical diagnoses have important implications for improving patient care, research, and policy. For a medical diagnosis, health professionals use different kinds of pathological methods to make decisions on medical reports in terms of the patients’ medical conditions. Recently, clinicians have been actively engaged in improving medical diagnoses. The use of artificial intelligence and machine learning in combination with clinical findings has further improved disease detection. In the modern era, with the advantage of computers and technologies, one can collect data and visualize many hidden outcomes such as dealing with missing data in medical research. Statistical machine learning algorithms based on specific problems can assist one to make decisions. Machine learning (ML), data-driven algorithms can be utilized to validate existing methods and help researchers to make potential new decisions. The purpose of this study was to extract significant predictors for liver disease from the medical analysis of 615 humans using ML algorithms. Data visualizations were implemented to reveal significant findings such as missing values. Multiple imputations by chained equations (MICEs) were applied to generate missing data points, and principal component analysis (PCA) was used to reduce the dimensionality. Variable importance ranking using the Gini index was implemented to verify significant predictors obtained from the PCA. Training data (ntrain=399) for learning and testing data (ntest=216) in the ML methods were used for predicting classifications. The study compared binary classifier machine learning algorithms (i.e., artificial neural network, random forest (RF), and support vector machine), which were utilized on a published liver disease data set to classify individuals with liver diseases, which will allow health professionals to make a better diagnosis. The synthetic minority oversampling technique was applied to oversample the minority class to regulate overfitting problems. The RF significantly contributed (p<0.001) to a higher accuracy score of 98.14% compared to the other methods. Thus, this suggests that ML methods predict liver disease by incorporating the risk factors, which may improve the inference-based diagnosis of patients.

Download Full-text

Music Signal Analysis: Regression Analysis

10.5121/csit.2021.111205 ◽

2021 ◽

Author(s):

V. N. Aditya Datta Chivukula ◽

Sri Keshava Reddy Adupala

Keyword(s):

Machine Learning ◽

Regression Analysis ◽

Deep Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Statistical Machine Learning ◽

Ongoing Research ◽

Music Signal ◽

Learning Techniques

Machine learning techniques have become a vital part of every ongoing research in technical areas. In recent times the world has witnessed many beautiful applications of machine learning in a practical sense which amaze us in every aspect. This paper is all about whether we should always rely on deep learning techniques or is it really possible to overcome the performance of simple deep learning algorithms by simple statistical machine learning algorithms by understanding the application and processing the data so that it can help in increasing the performance of the algorithm by a notable amount. The paper mentions the importance of data pre-processing than that of the selection of the algorithm. It discusses the functions involving trigonometric, logarithmic, and exponential terms and also talks about functions that are purely trigonometric. Finally, we discuss regression analysis on music signals.

Download Full-text

Applications of Statistical Machine Learning Algorithms in Agriculture Management Processes

10.1109/ispcc53510.2021.9609476 ◽

2021 ◽

Author(s):

Karri Divya Jyothi ◽

M. S. R. Sekhar ◽

Sanjeev Kumar

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Statistical Machine Learning ◽

Management Processes ◽

Agriculture Management

Download Full-text

What transcends the Algorithm - A Critique of the Notions of a Postdigital and a Subsymbolic

10.33767/osf.io/nqdbf ◽

2021 ◽

Author(s):

Bastian Weiß

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Optimization Methods ◽

Computational Logic ◽

Statistical Machine Learning ◽

New Paradigm ◽

Human Thinking ◽

Digital Computers

Computational logic has, although since its breakthrough with the emergence of digital computers there has always been doubt, mostly been seen as something very different from human thinking; one can e.g. refer to Dreyfus’ famous criticism about what computers can’t do. Facing statistical machine learning as a new paradigm of computing, many seem to think that these lines are getting somewhat blurry. Learning algorithms, their functions not longer explicitly coded, but acquired via optimization methods, are seen as a kind of third mode, located somewhere between classical computational paradigms and human thinking. This view seems to manifest itself in the notions of postdigital and subsymbolic computing. I will argue that this view is mistaken, and machine learning does not soften boundaries posed by the digital and the symbolic, as they were already in effect regarding classical computational logic.

Download Full-text

A Comparative Study on Statistical Machine Learning Algorithms and Thresholding Strategies for Automatic Text Categorization

Lecture Notes in Computer Science - PRICAI 2002: Trends in Artificial Intelligence ◽

10.1007/3-540-45683-x_48 ◽

2002 ◽

pp. 444-453 ◽

Cited By ~ 4

Author(s):

Kang Hyuk Lee ◽

Judy Kay ◽

Byeong Ho Kang ◽

Uwe Rosebrock

Keyword(s):

Machine Learning ◽

Comparative Study ◽

Text Categorization ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Statistical Machine Learning ◽

Automatic Text

Download Full-text

Supplemental Material for One Model to Rule Them All? Using Machine Learning Algorithms to Determine the Number of Factors in Exploratory Factor Analysis

Psychological Methods ◽

10.1037/met0000262.supp ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Factor Analysis ◽

Exploratory Factor Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Number Of Factors

Download Full-text

Forecasting US movies box office performances in Turkey using machine learning algorithms

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189120 ◽

2020 ◽

Vol 39 (5) ◽

pp. 6579-6590

Author(s):

Sandy Çağlıyor ◽

Başar Öztayşi ◽

Selime Sezgin

Keyword(s):

Machine Learning ◽

Global Economy ◽

Learning Algorithms ◽

Forecast Model ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

High Stakes ◽

Box Office ◽

Industry Forecast ◽

The Impact

The motion picture industry is one of the largest industries worldwide and has significant importance in the global economy. Considering the high stakes and high risks in the industry, forecast models and decision support systems are gaining importance. Several attempts have been made to estimate the theatrical performance of a movie before or at the early stages of its release. Nevertheless, these models are mostly used for predicting domestic performances and the industry still struggles to predict box office performances in overseas markets. In this study, the aim is to design a forecast model using different machine learning algorithms to estimate the theatrical success of US movies in Turkey. From various sources, a dataset of 1559 movies is constructed. Firstly, independent variables are grouped as pre-release, distributor type, and international distribution based on their characteristic. The number of attendances is discretized into three classes. Four popular machine learning algorithms, artificial neural networks, decision tree regression and gradient boosting tree and random forest are employed, and the impact of each group is observed by compared by the performance models. Then the number of target classes is increased into five and eight and results are compared with the previously developed models in the literature.

Download Full-text