Directed Policy Search for Decision Making Using Relevance Vector Machines

Several recent learning approaches in decision making under uncertainty suggest the use of classifiers for representing policies compactly. The space of possible policies, even under such structured representations, is huge and must be searched carefully to avoid computationally expensive policy simulations (rollouts). In our recent work, we proposed a method for directed exploration of policy space using support vector classifiers, whereby rollouts are directed to states around the boundaries between different action choices indicated by the separating hyperplanes in the represented policies. While effective, this method suffers from the growing number of support vectors in the underlying classifiers as the number of training examples increases. In this paper, we propose an alternative method for directed policy search based on relevance vector machines. Relevance vector machines are used both for classification (to represent a policy) and regression (to approximate the corresponding relative action advantage function). Classification is enhanced by anomaly detection for accurate policy representation. Exploiting the internal structure of the regressor, we guide the probing of the state space only to critical areas corresponding to changes of action dominance in the underlying policy. This directed focus on critical parts of the state space iteratively leads to refinement and improvement of the underlying policy and delivers excellent control policies in only a few iterations, while the small number of relevance vectors yields significant computational time savings. We demonstrate the proposed approach and compare it with our previous method on standard reinforcement learning domains (inverted pendulum and mountain car).

Download Full-text

Regional-Scale Mineral Prospectivity Mapping: Support Vector Machines and an Improved Data-Driven Multi-criteria Decision-Making Technique

Natural Resources Research ◽

10.1007/s11053-021-09842-4 ◽

2021 ◽

Author(s):

Reza Ghezelbash ◽

Abbas Maghsoudi ◽

Amirreza Bigdeli ◽

Emmanuel John M. Carranza

Keyword(s):

Decision Making ◽

Support Vector Machines ◽

Regional Scale ◽

Data Driven ◽

Support Vector ◽

Multi Criteria Decision Making ◽

Mineral Prospectivity Mapping ◽

Vector Machines ◽

Mineral Prospectivity ◽

Prospectivity Mapping

Download Full-text

Application of support vector machines and relevance vector machines in predicting uniaxial compressive strength of volcanic rocks

Journal of African Earth Sciences ◽

10.1016/j.jafrearsci.2014.08.006 ◽

2014 ◽

Vol 100 ◽

pp. 634-644 ◽

Cited By ~ 45

Author(s):

Nurcihan Ceryan

Keyword(s):

Compressive Strength ◽

Support Vector Machines ◽

Uniaxial Compressive Strength ◽

Volcanic Rocks ◽

Support Vector ◽

Relevance Vector Machines ◽

Vector Machines

Download Full-text

ChemTok: A New Rule Based Tokenizer for Chemical Named Entity Recognition

BioMed Research International ◽

10.1155/2016/4248026 ◽

2016 ◽

Vol 2016 ◽

pp. 1-9 ◽

Cited By ~ 5

Author(s):

Abbas Akkasi ◽

Ekrem Varoğlu ◽

Nazife Dimililer

Keyword(s):

Conditional Random Fields ◽

Named Entity Recognition ◽

Classification Performance ◽

Entity Recognition ◽

Support Vector ◽

Learning Approaches ◽

Data Set ◽

Rule Based ◽

Named Entity ◽

Vector Machines

Named Entity Recognition (NER) from text constitutes the first step in many text mining applications. The most important preliminary step for NER systems using machine learning approaches is tokenization where raw text is segmented into tokens. This study proposes an enhanced rule based tokenizer, ChemTok, which utilizes rules extracted mainly from the train data set. The main novelty of ChemTok is the use of the extracted rules in order to merge the tokens split in the previous steps, thus producing longer and more discriminative tokens. ChemTok is compared to the tokenization methods utilized by ChemSpot and tmChem. Support Vector Machines and Conditional Random Fields are employed as the learning algorithms. The experimental results show that the classifiers trained on the output of ChemTok outperforms all classifiers trained on the output of the other two tokenizers in terms of classification performance, and the number of incorrectly segmented entities.

Download Full-text

Modeling Knowledge Employee’s Turnover Based on P-SVM

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.121-122.825 ◽

2010 ◽

Vol 121-122 ◽

pp. 825-831

Author(s):

Yong Zhao ◽

Ye Zheng Liu

Keyword(s):

Genetic Algorithm ◽

Decision Making ◽

Model Development ◽

Forecast Model ◽

Support Vector ◽

Multi Criteria Decision Making ◽

Parameters Selection ◽

Vector Machines ◽

Decision Making Problem ◽

Simulation Results

Knowledge employee’s turnover forecast is a multi-criteria decision-making problem involving various factors. In order to forecast accurately turnover of knowledge employees, the potential support vector machines(P-SVM) is introduced to develop a turnover forecast model. In the model development, a chaos algorithm and a genetic algorithm (GA) are employed to optimize P-SVM parameters selection. The simulation results show that the model based on potential support vector machine with chaos not only has much stronger generalization ability but also has the ability of feature selection.

Download Full-text

Linear Support Vector Machines for Prediction of Student Performance in School-Based Education

Mathematical Problems in Engineering ◽

10.1155/2020/4761468 ◽

2020 ◽

Vol 2020 ◽

pp. 1-7

Author(s):

Nalindren Naicker ◽

Timothy Adeliyi ◽

Jeanette Wing

Keyword(s):

Machine Learning ◽

Support Vector Machines ◽

Student Performance ◽

State Of The Art ◽

Learning Algorithms ◽

The State ◽

Machine Learning Algorithms ◽

Superior Performance ◽

Support Vector ◽

Vector Machines

Educational Data Mining (EDM) is a rich research field in computer science. Tools and techniques in EDM are useful to predict student performance which gives practitioners useful insights to develop appropriate intervention strategies to improve pass rates and increase retention. The performance of the state-of-the-art machine learning classifiers is very much dependent on the task at hand. Investigating support vector machines has been used extensively in classification problems; however, the extant of literature shows a gap in the application of linear support vector machines as a predictor of student performance. The aim of this study was to compare the performance of linear support vector machines with the performance of the state-of-the-art classical machine learning algorithms in order to determine the algorithm that would improve prediction of student performance. In this quantitative study, an experimental research design was used. Experiments were set up using feature selection on a publicly available dataset of 1000 alpha-numeric student records. Linear support vector machines benchmarked with ten categorical machine learning algorithms showed superior performance in predicting student performance. The results of this research showed that features like race, gender, and lunch influence performance in mathematics whilst access to lunch was the primary factor which influences reading and writing performance.

Download Full-text

Machine Learning Approaches for Detecting Tropical Cyclone Formation Using Satellite Data

Remote Sensing ◽

10.3390/rs11101195 ◽

2019 ◽

Vol 11 (10) ◽

pp. 1195 ◽

Cited By ~ 6

Author(s):

Minsang Kim ◽

Myung-Sook Park ◽

Jungho Im ◽

Seonyoung Park ◽

Myong-In Lee

Keyword(s):

Machine Learning ◽

Tropical Cyclone ◽

Surface Wind ◽

Support Vector ◽

Learning Approaches ◽

Yearly Variation ◽

Linear Discriminant ◽

Forecast Lead Time ◽

Vector Machines ◽

First Time

This study compared detection skill for tropical cyclone (TC) formation using models based on three different machine learning (ML) algorithms-decision trees (DT), random forest (RF), and support vector machines (SVM)-and a model based on Linear Discriminant Analysis (LDA). Eight predictors were derived from WindSat satellite measurements of ocean surface wind and precipitation over the western North Pacific for 2005–2009. All of the ML approaches performed better with significantly higher hit rates ranging from 94 to 96% compared with LDA performance (~77%), although false alarm rate by MLs is slightly higher (21–28%) than that by LDA (~13%). Besides, MLs could detect TC formation at the time as early as 26–30 h before the first time diagnosed as tropical depression by the JTWC best track, which was also 5 to 9 h earlier than that by LDA. The skill differences across MLs were relatively smaller than difference between MLs and LDA. Large yearly variation in forecast lead time was common in all models due to the limitation in sampling from orbiting satellite. This study highlights that ML approaches provide an improved skill for detecting TC formation compared with conventional linear approaches.

Download Full-text

Support vector machines versus logistic regression: improving prospective performance in clinical decision-making

Ultrasound in Obstetrics and Gynecology ◽

10.1002/uog.2791 ◽

2006 ◽

Vol 27 (6) ◽

pp. 607-608 ◽

Cited By ~ 25

Author(s):

N. L. M. M. Pochet ◽

J. A. K. Suykens

Keyword(s):

Decision Making ◽

Logistic Regression ◽

Support Vector Machines ◽

Clinical Decision Making ◽

Clinical Decision ◽

Support Vector ◽

Vector Machines

Download Full-text

Incorporating support vector machines with multiple criteria decision making for financial crisis analysis

Quality & Quantity ◽

10.1007/s11135-012-9735-y ◽

2012 ◽

Vol 47 (6) ◽

pp. 3481-3492 ◽

Cited By ~ 7

Author(s):

Ming-Fu Hsu ◽

Ping-Feng Pai

Keyword(s):

Decision Making ◽

Support Vector Machines ◽

Financial Crisis ◽

Multiple Criteria Decision Making ◽

Multiple Criteria ◽

Support Vector ◽

Vector Machines

Download Full-text

Bearing defects decision making using higher order spectra features and support vector machines

14th International Conference on Sciences and Techniques of Automatic Control & Computer Engineering - STA'2013 ◽

10.1109/sta.2013.6783165 ◽

2013 ◽

Cited By ~ 6

Author(s):

L. Saidi ◽

F. Fnaiech

Keyword(s):

Decision Making ◽

Support Vector Machines ◽

Higher Order ◽

Support Vector ◽

Bearing Defects ◽

Vector Machines ◽

Higher Order Spectra

Download Full-text

FORECASTING CORPORATE FINANCIAL PERFORMANCE USING SENTIMENT IN ANNUAL REPORTS FOR STAKEHOLDERS’ DECISION-MAKING

Technological and Economic Development of Economy ◽

10.3846/20294913.2014.979456 ◽

2014 ◽

Vol 20 (4) ◽

pp. 721-738 ◽

Cited By ~ 29

Author(s):

Petr Hajek ◽

Vladimir Olej ◽

Renata Myskova

Keyword(s):

Decision Making ◽

Financial Performance ◽

Prediction Models ◽

Annual Reports ◽

Support Vector ◽

Forecasting Accuracy ◽

Linear Relationships ◽

Vector Machines ◽

Support Decision Making

This paper is aimed at examining the role of annual reports’ sentiment in forecasting financial performance. The sentiment (tone, opinion) is assessed using several categorization schemes in order to explore various aspects of language used in the annual reports of U.S. companies. Further, we employ machine learning methods and neural networks to predict financial performance expressed in terms of the Z-score bankruptcy model. Eleven categories of sentiment (ranging from negative and positive to active and common) are used as the inputs of the prediction models. Support vector machines provide the highest forecasting accuracy. This evidence suggests that there exist non-linear relationships between the sentiment and financial performance. The results indicate that the sentiment information is an important forecasting determinant of financial performance and, thus, can be used to support decision-making process of corporate stakeholders.

Download Full-text