Interactive Exploration-Exploitation Balancing for Generative Melody Composition

Author(s):  
Yijun Zhou ◽  
Yuki Koyama ◽  
Masataka Goto ◽  
Takeo Igarashi
2005 ◽  
Author(s):  
Neta Moye ◽  
Lucy L. Gilson ◽  
Jill E. Perry-Smith

Mathematics ◽  
2021 ◽  
Vol 9 (16) ◽  
pp. 1839
Author(s):  
Broderick Crawford ◽  
Ricardo Soto ◽  
José Lemus-Romani ◽  
Marcelo Becerra-Rozas ◽  
José M. Lanza-Gutiérrez ◽  
...  

One of the central issues that must be resolved for a metaheuristic optimization process to work well is the dilemma of the balance between exploration and exploitation. The metaheuristics (MH) that achieved this balance can be called balanced MH, where a Q-Learning (QL) integration framework was proposed for the selection of metaheuristic operators conducive to this balance, particularly the selection of binarization schemes when a continuous metaheuristic solves binary combinatorial problems. In this work the use of this framework is extended to other recent metaheuristics, demonstrating that the integration of QL in the selection of operators improves the exploration-exploitation balance. Specifically, the Whale Optimization Algorithm and the Sine-Cosine Algorithm are tested by solving the Set Covering Problem, showing statistical improvements in this balance and in the quality of the solutions.


Author(s):  
Humoud Alsabah ◽  
Agostino Capponi ◽  
Octavio Ruiz Lacedelli ◽  
Matt Stern

Abstract We introduce a reinforcement learning framework for retail robo-advising. The robo-advisor does not know the investor’s risk preference but learns it over time by observing her portfolio choices in different market environments. We develop an exploration–exploitation algorithm that trades off costly solicitations of portfolio choices by the investor with autonomous trading decisions based on stale estimates of investor’s risk aversion. We show that the approximate value function constructed by the algorithm converges to the value function of an omniscient robo-advisor over a number of periods that is polynomial in the state and action space. By correcting for the investor’s mistakes, the robo-advisor may outperform a stand-alone investor, regardless of the investor’s opportunity cost for making portfolio decisions.


2016 ◽  
Vol 37 (8) ◽  
pp. 719-726 ◽  
Author(s):  
Powell Patrick Cheng Tan ◽  
Sanja Rogic ◽  
Anton Zoubarev ◽  
Cameron McDonald ◽  
Frances Lui ◽  
...  

2006 ◽  
Vol 21 (3) ◽  
pp. 261-267 ◽  
Author(s):  
ENO THERESKA ◽  
DUSHYANTH NARAYANAN ◽  
GREGORY R. GANGER

Today, management and tuning questions are approached using if… then… rules of thumb. This reactive approach requires expertise regarding system behavior, making it difficult to deal with unforeseen uses of a system’s resources and leading to system unpredictability and large system management overheads. We propose a What…if… approach that allows interactive exploration of the effects of system changes, thus converting complex tuning problem into simpler search problems. Through two concrete management problems, automating system upgrades and deciding on service migrations, we identify system design changes that enable a system to answer What…if… questions about itself.


Methods ◽  
2017 ◽  
Vol 115 ◽  
pp. 100-109 ◽  
Author(s):  
Zhongyu Li ◽  
Dimitris N. Metaxas ◽  
Aidong Lu ◽  
Shaoting Zhang

Sign in / Sign up

Export Citation Format

Share Document