BOWL: Bayesian Optimization for Weight Learning in Probabilistic Soft Logic

Probabilistic soft logic (PSL) is a statistical relational learning framework that represents complex relational models with weighted first-order logical rules. The weights of the rules in PSL indicate their importance in the model and influence the effectiveness of the model on a given task. Existing weight learning approaches often attempt to learn a set of weights that maximizes some function of data likelihood. However, this does not always translate to optimal performance on a desired domain metric, such as accuracy or F1 score. In this paper, we introduce a new weight learning approach called Bayesian optimization for weight learning (BOWL) based on Gaussian process regression that directly optimizes weights on a chosen domain performance metric. The key to the success of our approach is a novel projection that captures the semantic distance between the possible weight configurations. Our experimental results show that our proposed approach outperforms likelihood-based approaches and yields up to a 10% improvement across a variety of performance metrics. Further, we performed experiments to measure the scalability and robustness of our approach on various realworld datasets.

Download Full-text

A taxonomy of weight learning methods for statistical relational learning

Machine Learning ◽

10.1007/s10994-021-06069-5 ◽

2021 ◽

Author(s):

Sriram Srinivasan ◽

Charles Dickens ◽

Eriq Augustine ◽

Golnoosh Farnadi ◽

Lise Getoor

Keyword(s):

Probabilistic Models ◽

Performance Metrics ◽

Relational Learning ◽

Empirical Evaluation ◽

Statistical Relational Learning ◽

Weight Space ◽

Learning Approaches ◽

Extensive Evaluation ◽

Real World Datasets ◽

Weight Learning

AbstractStatistical relational learning (SRL) frameworks are effective at defining probabilistic models over complex relational data. They often use weighted first-order logical rules where the weights of the rules govern probabilistic interactions and are usually learned from data. Existing weight learning approaches typically attempt to learn a set of weights that maximizes some function of data likelihood; however, this does not always translate to optimal performance on a desired domain metric, such as accuracy or F1 score. In this paper, we introduce a taxonomy of search-based weight learning approaches for SRL frameworks that directly optimize weights on a chosen domain performance metric. To effectively apply these search-based approaches, we introduce a novel projection, referred to as scaled space (SS), that is an accurate representation of the true weight space. We show that SS removes redundancies in the weight space and captures the semantic distance between the possible weight configurations. In order to improve the efficiency of search, we also introduce an approximation of SS which simplifies the process of sampling weight configurations. We demonstrate these approaches on two state-of-the-art SRL frameworks: Markov logic networks and probabilistic soft logic. We perform empirical evaluation on five real-world datasets and evaluate them each on two different metrics. We also compare them against four other weight learning approaches. Our experimental results show that our proposed search-based approaches outperform likelihood-based approaches and yield up to a 10% improvement across a variety of performance metrics. Further, we perform an extensive evaluation to measure the robustness of our approach to different initializations and hyperparameters. The results indicate that our approach is both accurate and robust.

Download Full-text

Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization

CLEI electronic journal ◽

10.19153/cleiej.21.2.1 ◽

2018 ◽

Vol 21 (2) ◽

Cited By ~ 2

Author(s):

Juan Cruz Barsce ◽

Jorge Andrés Palombarini ◽

Ernesto Carlos Martínez

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Gaussian Process Regression ◽

Control Policy ◽

Bayesian Optimization ◽

Learning Framework ◽

User Expertise ◽

Key Factor ◽

Self Driving Cars ◽

Reinforcement Learning Algorithm

With the increase of machine learning usage by industries and scientific communities in a variety of tasks such as text mining, image recognition and self-driving cars, automatic setting of hyper-parameter in learning algorithms is a key factor for obtaining good performances regardless of user expertise in the inner workings of the techniques and methodologies. In particular, for a reinforcement learning algorithm, the efficiency of an agent learning a control policy in an uncertain environment is heavily dependent on the hyper-parameters used to balance exploration with exploitation. In this work, an autonomous learning framework that integrates Bayesian optimization with Gaussian process regression to optimize the hyper-parameters of a reinforcement learning algorithm, is proposed. Also, a bandits-based approach to achieve a balance between computational costs and decreasing uncertainty about the \textit{Q}-values, is presented. A gridworld example is used to highlight how hyper-parameter configurations of a learning algorithm (SARSA) are iteratively improved based on two performance functions.

Download Full-text

Machine Learning Methods Applied to the Prediction of Pseudo-nitzschia spp. Blooms in the Galician Rias Baixas (NW Spain)

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10040199 ◽

2021 ◽

Vol 10 (4) ◽

pp. 199

Author(s):

Francisco M. Bellas Aláez ◽

Jesus M. Torres Palenzuela ◽

Evangelos Spyrakos ◽

Luis González Vilas

Keyword(s):

Machine Learning ◽

Performance Metrics ◽

Prediction Models ◽

Support Vector ◽

False Alarms ◽

Learning Approaches ◽

Learning Methods ◽

Machine Learning Methods ◽

Rías Baixas ◽

New Algorithms

This work presents new prediction models based on recent developments in machine learning methods, such as Random Forest (RF) and AdaBoost, and compares them with more classical approaches, i.e., support vector machines (SVMs) and neural networks (NNs). The models predict Pseudo-nitzschia spp. blooms in the Galician Rias Baixas. This work builds on a previous study by the authors (doi.org/10.1016/j.pocean.2014.03.003) but uses an extended database (from 2002 to 2012) and new algorithms. Our results show that RF and AdaBoost provide better prediction results compared to SVMs and NNs, as they show improved performance metrics and a better balance between sensitivity and specificity. Classical machine learning approaches show higher sensitivities, but at a cost of lower specificity and higher percentages of false alarms (lower precision). These results seem to indicate a greater adaptation of new algorithms (RF and AdaBoost) to unbalanced datasets. Our models could be operationally implemented to establish a short-term prediction system.

Download Full-text

Bayesian optimization for tuning chaotic systems

Nonlinear Processes in Geophysics Discussions ◽

10.5194/npgd-1-1283-2014 ◽

2014 ◽

Vol 1 (2) ◽

pp. 1283-1312

Author(s):

M. Abbas ◽

A. Ilin ◽

A. Solonen ◽

J. Hakkarainen ◽

E. Oja ◽

...

Keyword(s):

Chaotic Systems ◽

Climate Models ◽

Atmospheric Model ◽

Bayesian Optimization ◽

Tuning Parameters ◽

Performance Metric ◽

Weather And Climate ◽

Computationally Expensive ◽

Low Dimensional ◽

Complex Chaotic Systems

Abstract. In this work, we consider the Bayesian optimization (BO) approach for tuning parameters of complex chaotic systems. Such problems arise, for instance, in tuning the sub-grid scale parameterizations in weather and climate models. For such problems, the tuning procedure is generally based on a performance metric which measures how well the tuned model fits the data. This tuning is often a computationally expensive task. We show that BO, as a tool for finding the extrema of computationally expensive objective functions, is suitable for such tuning tasks. In the experiments, we consider tuning parameters of two systems: a simplified atmospheric model and a low-dimensional chaotic system. We show that BO is able to tune parameters of both the systems with a low number of objective function evaluations and without the need of any gradient information.

Download Full-text

Attack and Anomaly Detection in IoT Networks Using Supervised Machine Learning Approaches

Revue d intelligence artificielle ◽

10.18280/ria.350102 ◽

2021 ◽

Vol 35 (1) ◽

pp. 11-21

Author(s):

Himani Tyagi ◽

Rajendra Kumar

Keyword(s):

Machine Learning ◽

Performance Metrics ◽

Detection System ◽

Feature Reduction ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Testing Time ◽

Learning Approaches ◽

Reduction Techniques ◽

Share Data

IoT is characterized by communication between things (devices) that constantly share data, analyze, and make decisions while connected to the internet. This interconnected architecture is attracting cyber criminals to expose the IoT system to failure. Therefore, it becomes imperative to develop a system that can accurately and automatically detect anomalies and attacks occurring in IoT networks. Therefore, in this paper, an Intrsuion Detection System (IDS) based on extracted novel feature set synthesizing BoT-IoT dataset is developed that can swiftly, accurately and automatically differentiate benign and malicious traffic. Instead of using available feature reduction techniques like PCA that can change the core meaning of variables, a unique feature set consisting of only seven lightweight features is developed that is also IoT specific and attack traffic independent. Also, the results shown in the study demonstrates the effectiveness of fabricated seven features in detecting four wide variety of attacks namely DDoS, DoS, Reconnaissance, and Information Theft. Furthermore, this study also proves the applicability and efficiency of supervised machine learning algorithms (KNN, LR, SVM, MLP, DT, RF) in IoT security. The performance of the proposed system is validated using performance Metrics like accuracy, precision, recall, F-Score and ROC. Though the accuracy of Decision Tree (99.9%) and Randon Forest (99.9%) Classifiers are same but other metrics like training and testing time shows Random Forest comparatively better.

Download Full-text

Machine learning approaches to calibrate individual-based infectious disease models

10.1101/2021.01.27.21250484 ◽

2021 ◽

Author(s):

Theresa Reiker ◽

Monica Golumbeanu ◽

Andrew Shattock ◽

Lydia Burgert ◽

Thomas A. Smith ◽

...

Keyword(s):

Machine Learning ◽

Disease Transmission ◽

Goodness Of Fit ◽

Epidemiological Data ◽

Bayesian Optimization ◽

Model Complexity ◽

Learning Approaches ◽

Dimensional Parameter ◽

Novel Approach ◽

Transmission Models

AbstractIndividual-based models have become important tools in the global battle against infectious diseases, yet model complexity can make calibration to biological and epidemiological data challenging. We propose a novel approach to calibrate disease transmission models via a Bayesian optimization framework employing machine learning emulator functions to guide a global search over a multi-objective landscape. We demonstrate our approach by application to an established individual-based model of malaria, optimizing over a high-dimensional parameter space with respect to a portfolio of multiple fitting objectives built from datasets capturing the natural history of malaria transmission and disease progression. Outperforming other calibration methodologies, the new approach quickly reaches an improved final goodness of fit. Per-objective parameter importance and sensitivity diagnostics provided by our approach offer epidemiological insights and enhance trust in predictions through greater interpretability.One Sentence SummaryWe propose a novel, fast, machine learning-based approach to calibrate disease transmission models that outperforms other methodologies

Download Full-text

Learning Cultures and Multiculturalism

Multicultural Awareness and Technology in Higher Education - Advances in Higher Education and Professional Development ◽

10.4018/978-1-4666-5876-9.ch010 ◽

2014 ◽

pp. 197-217 ◽

Cited By ~ 2

Author(s):

Hanna Teräs ◽

Irja Leppisaari ◽

Marko Teräs ◽

Jan Herrington

Keyword(s):

Learning Environments ◽

Knowledge Society ◽

Culture Shock ◽

Learning Culture ◽

Traditional Learning ◽

Learning Approaches ◽

Cultural Backgrounds ◽

Online Programs ◽

Learning Framework ◽

E Learning

In the rapidly globalizing 21st century knowledge society, multicultural understanding plays a major role. However, what do we mean by “culture” in the educational context, what aspects have or should have an impact on our learning environments, and might some of these assumptions direct the development of our learning environments in an unintended and possibly undesirable way? New learning models that differ from traditional learning approaches might cause a type of a “learning culture shock” for some learners. What are the best ways to avoid and overcome cultural clashes in online learning? This chapter discusses the experiences of two cases from multicultural and multidisciplinary online programs for teacher education and professional development. Both of the programs are based on the principles of authentic e-learning framework described by Herrington, Reeves, and Oliver (2010). The aim of the study was to find out how learners with different cultural backgrounds experience the authentic e-learning process, as well as to find out what impact the authentic e-learning model has on the development of the learning culture.

Download Full-text

Gaussian Process Regression Based Robust Optimization with Observer Uncertainty for Reconfigurable Self-x Sensory Electronics for Industry 4.0

tm - Technisches Messen ◽

10.1515/teme-2021-0061 ◽

2021 ◽

Vol 88 (s1) ◽

pp. s83-s88

Author(s):

Qummar Zaman ◽

Senan Alraho ◽

Andreas König

Keyword(s):

Robust Optimization ◽

Gaussian Process ◽

Industry 4.0 ◽

Performance Metrics ◽

Optimization Technique ◽

Gaussian Process Regression ◽

Search Space ◽

Instrumentation Amplifier ◽

Statistical Regression ◽

Current Feedback

Abstract This paper presents a robust optimization technique for the reconfigurable measurement of sensory electronics for industry 4.0 to obtain a robust solution even in the presence of observer uncertainty using a cost-effective performance measurement method. The extrinsic evaluation of the proposed methodology is performed on an indirect current-feedback instrumentation amplifier (CFIA), which is a fundamental part of sensory systems. To reduce the CFIA device performance evaluation set-up cost, a low-cost test stimulus is applied to the circuit under test, and the output response of the circuit is examined to correlate with the device’s performance parameters. Due to the complexity of the smart sensory electronics search space, the meta-heuristic optimization algorithm is being selected as an optimizer. For objective space or observer uncertainty, the Gaussian process regression from the Bayesian statistical regression process is used to estimate the uncertainty level efficiently. Six different classical metrics have been used to evaluate the regression model accuracy. The highest achieved average expected error metrics value is 0.313, and the minimum value of correlation performance metrics is 0.908. The device is implemented using 0.35 μm austriamicrosystems technology.

Download Full-text

Considerations for performance metrics of metagenomic next generation sequencing analyses

10.1101/2020.12.17.423212 ◽

2020 ◽

Author(s):

Jason G. Kralj ◽

Stephanie L. Servetas ◽

Samuel P. Forry ◽

Scott A. Jackson

Keyword(s):

Performance Metrics ◽

Limit Of Detection ◽

Clinical Performance ◽

Ground Truth ◽

Negative Control ◽

Fitness For Purpose ◽

Performance Metric ◽

The One ◽

Sensitivity Specificity ◽

Harmonic Means

AbstractEvaluating the performance of metagenomics analyses has proven a challenge, due in part to limited ground-truth standards, broad application space, and numerous evaluation methods and metrics. Application of traditional clinical performance metrics (i.e. sensitivity, specificity, etc.) using taxonomic classifiers do not fit the “one-bug-one-test” paradigm. Ultimately, users need methods that evaluate fitness-for-purpose and identify their analyses’ strengths and weaknesses. Within a defined cohort, reporting performance metrics by taxon, rather than by sample, will clarify this evaluation. An estimated limit of detection, positive and negative control samples, and true positive and negative true results are necessary criteria for all investigated taxa. Use of summary metrics should be restricted to comparing results of similar cohorts and data, and should employ harmonic means and continuous products for each performance metric rather than arithmetic mean. Such consideration will ensure meaningful comparisons and evaluation of fitness-for-purpose.

Download Full-text

Optimal Stiffness Design for an Exhaustive Parallel Compliance Matrix in Multiactuator Robotic Limbs

Journal of Mechanisms and Robotics ◽

10.1115/1.4039772 ◽

2018 ◽

Vol 10 (3) ◽

Cited By ~ 1

Author(s):

Nathan M. Cahill ◽

Thomas Sugar ◽

Yi Ren ◽

Kyle Schroeder

Keyword(s):

Energy Efficient ◽

Peak Power ◽

Performance Metrics ◽

Slow Growth ◽

Legged Robots ◽

Power Requirement ◽

Compliance Matrix ◽

Performance Metric ◽

Stiffness Design ◽

Optimal Stiffness

Comparatively slow growth in energy density of both power storage and generation technologies has placed added emphasis on the need for energy-efficient designs in legged robots. This paper explores the potential of parallel springs in robot limb design. We start by adding what we call the exhaustive parallel compliance matrix (EPCM) to the design. The EPCM is a set of parallel springs, which includes a parallel spring for each joint and a multijoint parallel spring for all possible combinations of the robot's joints. Then, we carefully formulate and compare two performance metrics, which improve various aspects of the system performance. Each performance metric is analyzed and compared, their strengths and weaknesses being rigorously presented. The performance benefits associated with this approach are dramatic. Implementing the spring matrix reduces the sum of square power (SSP) exerted by the actuators by up to 47%, the peak power requirement by almost 40%, the sum of squared current by 55%, and the peak current by 55%. These results were generated using a planar robot limb and a gait trajectory borrowed from biology. We use a fully dynamic model of the robotic system including inertial effects. We also test the design robustness using a perturbation study, which shows that the parallel springs are effective even in the presence of trajectory perturbation.

Download Full-text