Modular reinforcement learning for dynamic portfolio optimization in the KOSPI market

AbstractDeep reinforcement learning is gaining popularity in many different fields. An interesting sector is related to the definition of dynamic decision-making systems. A possible example is dynamic portfolio optimization, where an agent has to continuously reallocate an amount of fund into a number of different financial assets with the final goal of maximizing return and minimizing risk. In this work, a novel deep Q-learning portfolio management framework is proposed. The framework is composed by two elements: a set of local agents that learn assets behaviours and a global agent that describes the global reward function. The framework is tested on a crypto portfolio composed by four cryptocurrencies. Based on our results, the deep reinforcement portfolio management framework has proven to be a promising approach for dynamic portfolio optimization.

Download Full-text

Dynamic Portfolio Optimization with Expected Value-Variance Criteria 1

IFAC Proceedings Volumes ◽

10.1016/s1474-6670(17)40832-9 ◽

2001 ◽

Vol 34 (8) ◽

pp. 295-300 ◽

Cited By ~ 1

Author(s):

Przemysław Magiera ◽

Andrzej Karbowski

Keyword(s):

Portfolio Optimization ◽

Expected Value ◽

Dynamic Portfolio ◽

Dynamic Portfolio Optimization

Download Full-text

Dynamic Portfolio Optimization in Ultra-High Frequency Environment

Applications of Evolutionary Computation - Lecture Notes in Computer Science ◽

10.1007/978-3-319-55849-3_3 ◽

2017 ◽

pp. 34-50 ◽

Cited By ~ 2

Author(s):

Patryk Filipiak ◽

Piotr Lipinski

Keyword(s):

Portfolio Optimization ◽

High Frequency ◽

Ultra High Frequency ◽

Dynamic Portfolio ◽

Dynamic Portfolio Optimization

Download Full-text

III Dynamic portfolio optimization

Portfolio Optimization and Performance Analysis ◽

10.1201/9781420010930-14 ◽

2007 ◽

pp. 181-184

Keyword(s):

Portfolio Optimization ◽

Dynamic Portfolio ◽

Dynamic Portfolio Optimization

Download Full-text

Portfolio optimization for cointelated pairs: SDEs vs Machine learning

Algorithmic Finance ◽

10.3233/af-200311 ◽

2021 ◽

Vol 8 (3-4) ◽

pp. 101-125

Author(s):

Babak Mahdavi-Damghani ◽

Konul Mustafayeva ◽

Cristin Buescu ◽

Stephen Roberts

Keyword(s):

Machine Learning ◽

Portfolio Optimization ◽

Maximization Problem ◽

Financial Mathematics ◽

Pairs Trading ◽

Power Utility ◽

Dynamic Portfolio ◽

Dynamic Portfolio Optimization ◽

Mean Variance

With the recent rise of Machine Learning (ML) as a candidate to partially replace classic Financial Mathematics (FM) methodologies, we investigate the performances of both in solving the problem of dynamic portfolio optimization in continuous-time, finite-horizon setting for a portfolio of two assets that are intertwined. In the Financial Mathematics approach we model the asset prices not via the common approaches used in pairs trading such as a high correlation or cointegration, but with the cointelation model in Mahdavi-Damghani (2013) that aims to reconcile both short-term risk and long-term equilibrium. We maximize the overall P&L with Financial Mathematics approach that dynamically switches between a mean-variance optimal strategy and a power utility maximizing strategy. We use a stochastic control formulation of the problem of power utility maximization and solve numerically the resulting HJB equation with the Deep Galerkin method introduced in Sirignano and Spiliopoulos (2018). We turn to Machine Learning for the same P&L maximization problem and use clustering analysis to devise bands, combined with in-band optimization. Although this approach is model agnostic, results obtained with data simulated from the same cointelation model gives a slight competitive advantage to the ML over the FM methodology1.

Download Full-text