Risk-aware multi-armed bandit problem with application to portfolio selection

Xiaoguang Huo; Feng Fu

doi:10.1098/rsos.171377

Risk-aware multi-armed bandit problem with application to portfolio selection

Royal Society Open Science ◽

10.1098/rsos.171377 ◽

2017 ◽

Vol 4 (11) ◽

pp. 171377 ◽

Cited By ~ 2

Author(s):

Xiaoguang Huo ◽

Feng Fu

Keyword(s):

Portfolio Selection ◽

Risk Measure ◽

Mathematical Framework ◽

Risk Awareness ◽

Bandit Problem ◽

Sequential Decision ◽

Quantitative Finance ◽

Coherent Risk ◽

Natural Connection ◽

Learning Policies

Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential decision-making under uncertainty, namely the exploration versus exploitation dilemma, and therefore provides a natural connection to portfolio selection. In this paper, we incorporate risk awareness into the classic multi-armed bandit setting and introduce an algorithm to construct portfolio. Through filtering assets based on the topological structure of the financial market and combining the optimal multi-armed bandit policy with the minimization of a coherent risk measure, we achieve a balance between risk and return.

Download Full-text

Portfolio selection based on asymmetric Laplace distribution, coherent risk measure, and expectation-maximization estimation

Quantitative Finance and Economics ◽

10.3934/qfe.2018.4.776 ◽

2018 ◽

Vol 2 (4) ◽

pp. 776-797 ◽

Cited By ~ 2

Author(s):

Yue Shi ◽

◽

Chi Tim Ng ◽

Ka-Fai Cedric Yiu ◽

◽

...

Keyword(s):

Portfolio Selection ◽

Expectation Maximization ◽

Risk Measure ◽

Laplace Distribution ◽

Coherent Risk Measure ◽

Asymmetric Laplace Distribution ◽

Coherent Risk

Download Full-text

Portfolio optimization with optimal expected utility risk measures

Annals of Operations Research ◽

10.1007/s10479-021-04403-7 ◽

2021 ◽

Author(s):

S. Geissel ◽

H. Graf ◽

J. Herbinger ◽

F. T. Seifried

Keyword(s):

Portfolio Optimization ◽

Portfolio Selection ◽

Expected Utility ◽

Value At Risk ◽

Risk Measures ◽

Risk Measure ◽

Risk Attitude ◽

Risk Averse ◽

Coherent Risk ◽

Copula Approach

AbstractThe purpose of this article is to evaluate optimal expected utility risk measures (OEU) in a risk-constrained portfolio optimization context where the expected portfolio return is maximized. We compare the portfolio optimization with OEU constraint to a portfolio selection model using value at risk as constraint. The former is a coherent risk measure for utility functions with constant relative risk aversion and allows individual specifications to the investor’s risk attitude and time preference. In a case study with three indices, we investigate how these theoretical differences influence the performance of the portfolio selection strategies. A copula approach with univariate ARMA-GARCH models is used in a rolling forecast to simulate monthly future returns and calculate the derived measures for the optimization. The results of this study illustrate that both optimization strategies perform considerably better than an equally weighted portfolio and a buy and hold portfolio. Moreover, our results illustrate that portfolio optimization with OEU constraint experiences individualized effects, e.g., less risk-averse investors lose more portfolio value in the financial crises but outperform their more risk-averse counterparts in bull markets.

Download Full-text

Phase Transition in Global Financial Markets: Empirical Evidence, Risk Measure, and Portfolio Selection

SSRN Electronic Journal ◽

10.2139/ssrn.3044222 ◽

2017 ◽

Author(s):

Qi Zhou

Keyword(s):

Phase Transition ◽

Financial Markets ◽

Empirical Evidence ◽

Portfolio Selection ◽

Risk Measure

Download Full-text

Optimal Policies for Quantum Markov Decision Processes

International Journal of Automation and Computing ◽

10.1007/s11633-021-1278-z ◽

2021 ◽

Author(s):

Ming-Sheng Ying ◽

Yuan Feng ◽

Sheng-Gang Ying

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Quantum Systems ◽

Sequential Decision Making ◽

Mathematical Framework ◽

Sequential Decision ◽

Learning Techniques ◽

Optimal Policies ◽

Markov Decision ◽

Programming Algorithms

AbstractMarkov decision process (MDP) offers a general framework for modelling sequential decision making where outcomes are random. In particular, it serves as a mathematical framework for reinforcement learning. This paper introduces an extension of MDP, namely quantum MDP (qMDP), that can serve as a mathematical model of decision making about quantum systems. We develop dynamic programming algorithms for policy evaluation and finding optimal policies for qMDPs in the case of finite-horizon. The results obtained in this paper provide some useful mathematical tools for reinforcement learning techniques applied to the quantum world.

Download Full-text

A new methodology for multi-period portfolio selection based on the risk measure of lower partial moments

Expert Systems with Applications ◽

10.1016/j.eswa.2019.113032 ◽

2020 ◽

Vol 144 ◽

pp. 113032 ◽

Cited By ~ 3

Author(s):

Hamid Hosseini Nesaz ◽

Milad Jasemi ◽

Leslie Monplaisir

Keyword(s):

Portfolio Selection ◽

Risk Measure ◽

Lower Partial Moments

Download Full-text

Remark on the Paper “Entropic Value-at-Risk: A New Coherent Risk Measure” by Amir Ahmadi-Javid, J. Optim. Theory Appl., 155(3) (2001) 1105–1123

Risk and Stochastics ◽

10.1142/9781786341952_0009 ◽

2019 ◽

pp. 151-158

Author(s):

Freddy Delbaen

Keyword(s):

At Risk ◽

Value At Risk ◽

Risk Measure ◽

Coherent Risk Measure ◽

Coherent Risk ◽

Optim Theory

Download Full-text

Portfolio Selection in a Noisy Environment Using Absolute Deviation as a Risk Measure

Practical Fruits of Econophysics ◽

10.1007/4-431-28915-1_40 ◽

2006 ◽

pp. 220-225 ◽

Cited By ~ 1

Author(s):

Imre Kondor ◽

Szilárd Pafka ◽

Richárd Karádi ◽

Gábor Nagy

Keyword(s):

Portfolio Selection ◽

Risk Measure ◽

Noisy Environment ◽

Absolute Deviation

Download Full-text

Entropic Value-at-Risk: A New Coherent Risk Measure

Journal of Optimization Theory and Applications ◽

10.1007/s10957-011-9968-2 ◽

2011 ◽

Vol 155 (3) ◽

pp. 1105-1123 ◽

Cited By ~ 76

Author(s):

A. Ahmadi-Javid

Keyword(s):

At Risk ◽

Value At Risk ◽

Risk Measure ◽

Coherent Risk Measure ◽

Coherent Risk

Download Full-text

Algorithm portfolio selection as a bandit problem with unbounded losses

Annals of Mathematics and Artificial Intelligence ◽

10.1007/s10472-011-9228-z ◽

2011 ◽

Vol 61 (2) ◽

pp. 49-86 ◽

Cited By ~ 14

Author(s):

Matteo Gagliolo ◽

Jürgen Schmidhuber

Keyword(s):

Portfolio Selection ◽

Bandit Problem ◽

Algorithm Portfolio

Download Full-text

Coherent risk measure for derivatives under Black‐Scholes economy with regime switching

Managerial Finance ◽

10.1108/03074351111167910 ◽

2011 ◽

Vol 37 (11) ◽

pp. 1011-1024 ◽

Cited By ~ 2

Author(s):

Fangcheng Hao ◽

Hailiang Yang

Keyword(s):

Regime Switching ◽

Risk Measure ◽

Coherent Risk Measure ◽

Coherent Risk ◽

Black Scholes

Download Full-text