scholarly journals MQLV: Optimal Policy of Money Management in Retail Banking with Q-Learning

Author(s):  
Jeremy Charlier ◽  
Gaston Ormazabal ◽  
Radu State ◽  
Jean Hilger
Entropy ◽  
2021 ◽  
Vol 23 (6) ◽  
pp. 737
Author(s):  
Fengjie Sun ◽  
Xianchang Wang ◽  
Rui Zhang

An Unmanned Aerial Vehicle (UAV) can greatly reduce manpower in the agricultural plant protection such as watering, sowing, and pesticide spraying. It is essential to develop a Decision-making Support System (DSS) for UAVs to help them choose the correct action in states according to the policy. In an unknown environment, the method of formulating rules for UAVs to help them choose actions is not applicable, and it is a feasible solution to obtain the optimal policy through reinforcement learning. However, experiments show that the existing reinforcement learning algorithms cannot get the optimal policy for a UAV in the agricultural plant protection environment. In this work we propose an improved Q-learning algorithm based on similar state matching, and we prove theoretically that there has a greater probability for UAV choosing the optimal action according to the policy learned by the algorithm we proposed than the classic Q-learning algorithm in the agricultural plant protection environment. This proposed algorithm is implemented and tested on datasets that are evenly distributed based on real UAV parameters and real farm information. The performance evaluation of the algorithm is discussed in detail. Experimental results show that the algorithm we proposed can efficiently learn the optimal policy for UAVs in the agricultural plant protection environment.


2020 ◽  
pp. 016555152095979
Author(s):  
M Irfan Uddin ◽  
Syed Atif Ali Shah ◽  
Mahmoud Ahmad Al-Khasawneh ◽  
Ala Abdulsalam Alarood ◽  
Eesa Alsolami

COVID-19 has changed the lifestyle of many people due to its rapid human-to-human transmission. The spread started at the end of January 2020, and different countries used different approaches in terms of testing, sanitization, lock down and quarantine centres to control the spread of the virus. People are getting back to working and routine life activities with new normal standards of testing, sanitization, social distancing and lock down. People are regularly tested to identify those who are infected with COVID-19 and isolate them from general public. However, testing all people unnecessarily is an expensive operation in terms of resources usage. There must be an optimal policy to test only those who have higher chances of being COVID-19 positive. Similarly, sanitization is used for individuals and streets to disinfect people and places. However, sanitization is also an expensive operation in terms of resources, and it is not possible to disinfect each and every individual and street. Social separating or lock down or quarantine centres focuses are different methodologies that are utilised to control the human-to-human transmission of the infection and separate the individuals who are contaminated with COVID-19. However, lock down and quarantine centres are expensive operations in terms of resources as it disturbs the affairs of state and the growth of economy. At the same time, it negatively affects the quality of life of a society. It is also not possible to provide resources to all citizens by locking them inside homes or quarantine centres for infinite time. All these parameters are expensive in terms of resources and have an effect on controlling the spread of the virus, quality of life of human, resources and economy. In this article, a novel intelligent method based on reinforcement learning (RL) is built up that quantifies the unique levels of testing, disinfection and lock down alongside its impact on the spread of the infection, personal satisfaction or quality of life, resource use and economy. Different RL algorithms are actualized and agents are prepared with these algorithms to interact with the environment to gain proficiency with the best strategy. The examinations exhibit that deep learning–based algorithms, for example, DQN and DDPG are performing better than customary RL algorithms, for example, Q-Learning and SARSA.


2020 ◽  
Vol 10 (15) ◽  
pp. 5252
Author(s):  
Min-Jae Paek ◽  
Yu-Jin Na ◽  
Won-Seok Lee ◽  
Jae-Hyun Ro ◽  
Hyoung-Kyu Song

In wireless communication systems, reliability, low latency and power are essential in large scale multi-hop environment. Multi-hop based cooperative communication is an efficient way to achieve goals of wireless networks. This paper proposes a relay selection scheme for reliable transmission by selecting an optimal relay. The proposed scheme uses a signal-to-noise ratio (SNR) based Q-learning relay selection scheme to select an optimal relay in multi-hop transmission. Q-learning consists of an agent, environment, state, action and reward. When the learning is converged, the agent learns the optimal policy which is a rule of the actions that maximize the reward. In other words, the base station (BS) knows the optimal relay to select and transmit the signal. At this time, the cooperative communication scheme used in this paper is a decode-and-forward (DF) scheme in orthogonal frequency division multiplexing (OFDM) system. The Q-learning in the proposed scheme defines an environment to maximize a reward which is defined as SNR. After the learning process, the proposed scheme finds an optimal policy. Furthermore, this paper defines a reward which is based on the SNR. The simulation results show that the proposed scheme has the same bit error rate (BER) performance as the conventional relay selection scheme. However, this paper proposes an advantage of selecting fewer relays than conventional scheme when the target BER is satisfied. This can reduce the latency and the waste of resources. Therefore, the performance of the multi-hop transmission in wireless networks is enhanced.


2018 ◽  
Vol 41 ◽  
Author(s):  
Samuel G. B. Johnson

AbstractProfessional money management appears to require little skill, yet its practitioners command astronomical salaries. Singh's theory of shamanism provides one possible explanation: Financial professionals are the shamans of the global economy. They cultivate the perception of superhuman traits, maintain grueling initiation rituals, and rely on esoteric divination rituals. An anthropological view of markets can usefully supplement economic and psychological approaches.


1976 ◽  
Vol 40 (6) ◽  
pp. 358-361
Author(s):  
GG Burger
Keyword(s):  

2004 ◽  
Author(s):  
Joseph A. DiVanna
Keyword(s):  

Liquidity ◽  
2018 ◽  
Vol 5 (2) ◽  
pp. 95-105
Author(s):  
Dede Dahlan

There are many understanding of society, that cash waqf it should not be legal. So is the trust factor of people's money management institutions waqf (Nazhir) is still a constraint. Research conducted in Tabung Wakaf Indonesia (TWI) and Wakaf Al Azhar this analysis method, namely the principles of Good Corporate Governance (GCG). Here researchers using purposive sampling, followed by giving a score using the Likert Scale. To determine whether the data obtained in the field is valid or not, the researchers used a method tri angular source. The results of the assessment of GCG in TWI and Wakaf Al-Azhar obtain a total score of at Tabung Wakaf Indonesia amounting to 3.15. Then the bias is said that the implementation of GCG at TWI and Wakaf Al-Azhar declared "GOOD ENOUGH". While the results of the evaluation tri angular mention, that the data obtained from the results of research in the field both TWI and in Wakaf Al-Azhar, when compared with the corporate governance principles can be declared invalid according to the KNKG.


2019 ◽  
pp. 39-54
Author(s):  
Marco Ieva ◽  
Cristina Ziliani

Customer Experience develops through a journey of touchpoints. However, little is known on the role of touchpoints in contributing to customer loyalty, which is the final aim of Customer Experience Management. This study provides an examination of the relative and moderating role of frequency and positivity of exposure to more than twenty touchpoints and their interplay in contributing to customer loyalty. An online survey on more than three thousand consumers is run with reference to retail banking. Results show that only a small number of touchpoints is significantly related to customer loyalty. Findings point companies' attention to invest their efforts in managing both the frequency and positivity of specific touchpoints.


Sign in / Sign up

Export Citation Format

Share Document