Nonparametric Online Machine Learning with Kernels

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/758 ◽

2017 ◽

Cited By ~ 1

Author(s):

Khanh Nguyen

Keyword(s):

Machine Learning ◽

Bayesian Inference ◽

Model Selection ◽

Kernel Methods ◽

Model Parameters ◽

Grid Search ◽

Learning Context ◽

Real World Applications ◽

Online Setting ◽

Model Selection Problem

Max-margin and kernel methods are dominant approaches to solve many tasks in machine learning. However, the paramount question is how to solve model selection problem in these methods. It becomes urgent in online learning context. Grid search is a common approach, but it turns out to be highly problematic in real-world applications. Our approach is to view max-margin and kernel methods under a Bayesian setting, then use Bayesian inference tools to learn model parameters and infer hyper-parameters in principle ways for both batch and online setting.

Download Full-text

Efficient inference for agent-based models of real-world phenomena

10.1101/2021.10.04.462980 ◽

2021 ◽

Author(s):

Andreas Christ Sølvsten Jørgensen ◽

Atiyo Ghosh ◽

Marc Sturrock ◽

Vahid Shahrezaei

Keyword(s):

Machine Learning ◽

Case Studies ◽

Parameter Space ◽

Real World ◽

Autonomous Agents ◽

Stochastic Simulations ◽

Model Parameters ◽

Learning Approaches ◽

Real World Applications ◽

Real World Problems

AbstractThe modelling of many real-world problems relies on computationally heavy simulations. Since statistical inference rests on repeated simulations to sample the parameter space, the high computational expense of these simulations can become a stumbling block. In this paper, we compare two ways to mitigate this issue based on machine learning methods. One approach is to construct lightweight surrogate models to substitute the simulations used in inference. Alternatively, one might altogether circumnavigate the need for Bayesian sampling schemes and directly estimate the posterior distribution. We focus on stochastic simulations that track autonomous agents and present two case studies of real-world applications: tumour growths and the spread of infectious diseases. We demonstrate that good accuracy in inference can be achieved with a relatively small number of simulations, making our machine learning approaches orders of magnitude faster than classical simulation-based methods that rely on sampling the parameter space. However, we find that while some methods generally produce more robust results than others, no algorithm offers a one-size-fits-all solution when attempting to infer model parameters from observations. Instead, one must choose the inference technique with the specific real-world application in mind. The stochastic nature of the considered real-world phenomena poses an additional challenge that can become insurmountable for some approaches. Overall, we find machine learning approaches that create direct inference machines to be promising for real-world applications. We present our findings as general guidelines for modelling practitioners.Author summaryComputer simulations play a vital role in modern science as they are commonly used to compare theory with observations. One can thus infer the properties of a observed system by comparing the data to the predicted behaviour in different scenarios. Each of these scenarios corresponds to a simulation with slightly different settings. However, since real-world problems are highly complex, the simulations often require extensive computational resources, making direct comparisons with data challenging, if not insurmountable. It is, therefore, necessary to resort to inference methods that mitigate this issue, but it is not clear-cut what path to choose for any specific research problem. In this paper, we provide general guidelines for how to make this choice. We do so by studying examples from oncology and epidemiology and by taking advantage of developments in machine learning. More specifically, we focus on simulations that track the behaviour of autonomous agents, such as single cells or individuals. We show that the best way forward is problem-dependent and highlight the methods that yield the most robust results across the different case studies. We demonstrate that these methods are highly promising and produce reliable results in a small fraction of the time required by classic approaches that rely on comparisons between data and individual simulations. Rather than relying on a single inference technique, we recommend employing several methods and selecting the most reliable based on predetermined criteria.

Download Full-text

ModelTeller: Model Selection for Optimal Phylogenetic Reconstruction Using Machine Learning

Molecular Biology and Evolution ◽

10.1093/molbev/msaa154 ◽

2020 ◽

Vol 37 (11) ◽

pp. 3338-3352

Author(s):

Shiran Abadi ◽

Oren Avram ◽

Saharon Rosset ◽

Tal Pupko ◽

Itay Mayrose

Keyword(s):

Machine Learning ◽

Model Selection ◽

Selection Criteria ◽

Sequence Data ◽

Phylogenetic Reconstruction ◽

Branch Length ◽

Model Parameters ◽

Model Selection Criteria ◽

Learning Framework ◽

Length Estimation

Abstract Statistical criteria have long been the standard for selecting the best model for phylogenetic reconstruction and downstream statistical inference. Although model selection is regarded as a fundamental step in phylogenetics, existing methods for this task consume computational resources for long processing time, they are not always feasible, and sometimes depend on preliminary assumptions which do not hold for sequence data. Moreover, although these methods are dedicated to revealing the processes that underlie the sequence data, they do not always produce the most accurate trees. Notably, phylogeny reconstruction consists of two related tasks, topology reconstruction and branch-length estimation. It was previously shown that in many cases the most complex model, GTR+I+G, leads to topologies that are as accurate as using existing model selection criteria, but overestimates branch lengths. Here, we present ModelTeller, a computational methodology for phylogenetic model selection, devised within the machine-learning framework, optimized to predict the most accurate nucleotide substitution model for branch-length estimation. We demonstrate that ModelTeller leads to more accurate branch-length inference than current model selection criteria on data sets simulated under realistic processes. ModelTeller relies on a readily implemented machine-learning model and thus the prediction according to features extracted from the sequence data results in a substantial decrease in running time compared with existing strategies. By harnessing the machine-learning framework, we distinguish between features that mostly contribute to branch-length optimization, concerning the extent of sequence divergence, and features that are related to estimates of the model parameters that are important for the selection made by current criteria.

Download Full-text

Bayesian Model Selection in Fisheries Management and Ecology

Journal of Fish and Wildlife Management ◽

10.3996/042019-jfwm-024 ◽

2019 ◽

Vol 10 (2) ◽

pp. 691-707

Author(s):

Jason C. Doll ◽

Stephen J. Jacquemin

Keyword(s):

Bayesian Inference ◽

Model Selection ◽

Fisheries Management ◽

Bayesian Model ◽

Bayes Factor ◽

Evaluation Process ◽

Information Criterion ◽

Bayesian Model Selection ◽

Model Parameters ◽

Inference Methods

Abstract Researchers often test ecological hypotheses relating to a myriad of questions ranging from assemblage structure, population dynamics, demography, abundance, growth rate, and more using mathematical models that explain trends in data. To aid in the evaluation process when faced with competing hypotheses, we employ statistical methods to evaluate the validity of these multiple hypotheses with the goal of deriving the most robust conclusions possible. In fisheries management and ecology, frequentist methodologies have largely dominated this approach. However, in recent years, researchers have increasingly used Bayesian inference methods to estimate model parameters. Our aim with this perspective is to provide the practicing fisheries ecologist with an accessible introduction to Bayesian model selection. Here we discuss Bayesian inference methods for model selection in the context of fisheries management and ecology with empirical examples to guide researchers in the use of these methods. In this perspective we discuss three methods for selecting among competing models. For comparing two models we discuss Bayes factor and for more complex models we discuss Watanabe–Akaike information criterion and leave-one-out cross-validation. We also describe what kinds of information to report when conducting Bayesian inference. We conclude this review with a discussion of final thoughts about these model selection techniques.

Download Full-text

Hyperparameter Tuning and Pipeline Optimization via Grid Search Method and Tree-Based AutoML in Breast Cancer Prediction

Journal of Personalized Medicine ◽

10.3390/jpm11100978 ◽

2021 ◽

Vol 11 (10) ◽

pp. 978

Author(s):

Siti Fairuz Mat Radzi ◽

Muhammad Khalis Abdul Karim ◽

M Iqbal Saripan ◽

Mohd Amiruddin Abdul Rahman ◽

Iza Nurzawani Che Isa ◽

...

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Model Selection ◽

Principal Component ◽

Receiver Operating Curve ◽

Support Vector ◽

Grid Search ◽

Breast Cancer Data ◽

Data Set ◽

Cancer Data

Automated machine learning (AutoML) has been recognized as a powerful tool to build a system that automates the design and optimizes the model selection machine learning (ML) pipelines. In this study, we present a tree-based pipeline optimization tool (TPOT) as a method for determining ML models with significant performance and less complex breast cancer diagnostic pipelines. Some features of pre-processors and ML models are defined as expression trees and optimal gene programming (GP) pipelines, a stochastic search system. Features of radiomics have been presented as a guide for the ML pipeline selection from the breast cancer data set based on TPOT. Breast cancer data were used in a comparative analysis of the TPOT-generated ML pipelines with the selected ML classifiers, optimized by a grid search approach. The principal component analysis (PCA) random forest (RF) classification was proven to be the most reliable pipeline with the lowest complexity. The TPOT model selection technique exceeded the performance of grid search (GS) optimization. The RF classifier showed an outstanding outcome amongst the models in combination with only two pre-processors, with a precision of 0.83. The grid search optimized for support vector machine (SVM) classifiers generated a difference of 12% in comparison, while the other two classifiers, naïve Bayes (NB) and artificial neural network—multilayer perceptron (ANN-MLP), generated a difference of almost 39%. The method’s performance was based on sensitivity, specificity, accuracy, precision, and receiver operating curve (ROC) analysis.

Download Full-text

Comparing two sequential Monte Carlo samplers for exact and approximate Bayesian inference on biological models

Journal of The Royal Society Interface ◽

10.1098/rsif.2017.0340 ◽

2017 ◽

Vol 14 (134) ◽

pp. 20170340 ◽

Cited By ~ 6

Author(s):

Aidan C. Daly ◽

Jonathan Cooper ◽

David J. Gavaghan ◽

Chris Holmes

Keyword(s):

Bayesian Inference ◽

Bayesian Methods ◽

Sequential Monte Carlo ◽

Model Parameters ◽

Biological Models ◽

Exact Inference ◽

Modelling Studies ◽

Approximate Bayesian ◽

Approximate Bayesian Inference ◽

Abc Methods

Bayesian methods are advantageous for biological modelling studies due to their ability to quantify and characterize posterior variability in model parameters. When Bayesian methods cannot be applied, due either to non-determinism in the model or limitations on system observability, approximate Bayesian computation (ABC) methods can be used to similar effect, despite producing inflated estimates of the true posterior variance. Owing to generally differing application domains, there are few studies comparing Bayesian and ABC methods, and thus there is little understanding of the properties and magnitude of this uncertainty inflation. To address this problem, we present two popular strategies for ABC sampling that we have adapted to perform exact Bayesian inference, and compare them on several model problems. We find that one sampler was impractical for exact inference due to its sensitivity to a key normalizing constant, and additionally highlight sensitivities of both samplers to various algorithmic parameters and model conditions. We conclude with a study of the O'Hara–Rudy cardiac action potential model to quantify the uncertainty amplification resulting from employing ABC using a set of clinically relevant biomarkers. We hope that this work serves to guide the implementation and comparative assessment of Bayesian and ABC sampling techniques in biological models.

Download Full-text

Kernel methods in system identification, machine learning and function estimation: A survey

Automatica ◽

10.1016/j.automatica.2014.01.001 ◽

2014 ◽

Vol 50 (3) ◽

pp. 657-682 ◽

Cited By ~ 294

Author(s):

Gianluigi Pillonetto ◽

Francesco Dinuzzo ◽

Tianshi Chen ◽

Giuseppe De Nicolao ◽

Lennart Ljung

Keyword(s):

Machine Learning ◽

System Identification ◽

Kernel Methods ◽

Function Estimation ◽

And Function

Download Full-text

A Review of Machine Learning Classification Using Quantum Annealing for Real-World Applications

SN Computer Science ◽

10.1007/s42979-021-00751-0 ◽

2021 ◽

Vol 2 (5) ◽

Author(s):

Rajdeep Kumar Nath ◽

Himanshu Thapliyal ◽

Travis S. Humble

Keyword(s):

Machine Learning ◽

Real World ◽

Quantum Annealing ◽

Machine Learning Classification ◽

Real World Applications

Download Full-text

Trigonometric Inference Providing Learning in Deep Neural Networks

Applied Sciences ◽

10.3390/app11156704 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6704

Author(s):

Jingyong Cai ◽

Masashi Takemoto ◽

Yuming Qiu ◽

Hironori Nakajo

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Deep Neural Networks ◽

Activation Function ◽

Trigonometric Approximation ◽

Model Parameters ◽

Training Algorithms ◽

Activation Functions ◽

Classical Training ◽

Sum Formula

Despite being heavily used in the training of deep neural networks (DNNs), multipliers are resource-intensive and insufficient in many different scenarios. Previous discoveries have revealed the superiority when activation functions, such as the sigmoid, are calculated by shift-and-add operations, although they fail to remove multiplications in training altogether. In this paper, we propose an innovative approach that can convert all multiplications in the forward and backward inferences of DNNs into shift-and-add operations. Because the model parameters and backpropagated errors of a large DNN model are typically clustered around zero, these values can be approximated by their sine values. Multiplications between the weights and error signals are transferred to multiplications of their sine values, which are replaceable with simpler operations with the help of the product to sum formula. In addition, a rectified sine activation function is utilized for further converting layer inputs into sine values. In this way, the original multiplication-intensive operations can be computed through simple add-and-shift operations. This trigonometric approximation method provides an efficient training and inference alternative for devices with insufficient hardware multipliers. Experimental results demonstrate that this method is able to obtain a performance close to that of classical training algorithms. The approach we propose sheds new light on future hardware customization research for machine learning.

Download Full-text

Novel Meta-Features for Automated Machine Learning Model Selection in Anomaly Detection

IEEE Access ◽

10.1109/access.2021.3090936 ◽

2021 ◽

pp. 1-1

Author(s):

Milos Kotlar ◽

Marija Punt ◽

Zaharije Radivojevic ◽

Milos Cvetanovic ◽

Veljko Milutinovic

Keyword(s):

Machine Learning ◽

Model Selection ◽

Anomaly Detection ◽

Learning Model ◽

Machine Learning Model ◽

Automated Machine Learning

Download Full-text

A Survey on Bias and Fairness in Machine Learning

ACM Computing Surveys ◽

10.1145/3457607 ◽

2021 ◽

Vol 54 (6) ◽

pp. 1-35

Author(s):

Ninareh Mehrabi ◽

Fred Morstatter ◽

Nripsuta Saxena ◽

Kristina Lerman ◽

Aram Galstyan

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Deep Learning ◽

Real World ◽

State Of The Art ◽

Future Directions ◽

Discriminatory Behavior ◽

Real World Applications ◽

Near Future ◽

Different Sources

With the widespread use of artificial intelligence (AI) systems and applications in our everyday lives, accounting for fairness has gained significant importance in designing and engineering of such systems. AI systems can be used in many sensitive environments to make important and life-changing decisions; thus, it is crucial to ensure that these decisions do not reflect discriminatory behavior toward certain groups or populations. More recently some work has been developed in traditional machine learning and deep learning that address such challenges in different subdomains. With the commercialization of these systems, researchers are becoming more aware of the biases that these applications can contain and are attempting to address them. In this survey, we investigated different real-world applications that have shown biases in various ways, and we listed different sources of biases that can affect AI applications. We then created a taxonomy for fairness definitions that machine learning researchers have defined to avoid the existing bias in AI systems. In addition to that, we examined different domains and subdomains in AI showing what researchers have observed with regard to unfair outcomes in the state-of-the-art methods and ways they have tried to address them. There are still many future directions and solutions that can be taken to mitigate the problem of bias in AI systems. We are hoping that this survey will motivate researchers to tackle these issues in the near future by observing existing work in their respective fields.

Download Full-text