Negotiation and Persuasion Approach Using Reinforcement Learning Technique on Broker's Board Agent System

This paper describes a multi-agent influence learning approach and reinforcement learning adaptation to it. This learning technique is used for distributed, adaptive and self-organizing control in multi-agent system. This technique is quite simple and uses agent’s influences to estimate learning error between them. The best influences are rewarded via reinforcement learning which is a well-proven learning technique. It is shown that this learning rule supports positive-reward interactions between agents and does not require any additional information than standard reinforcement learning algorithm. This technique produces optimal behavior of multi-agent system with fast convergence patterns.

Download Full-text

Computation offloading through mobile vehicles in IoT-edge-cloud network

EURASIP Journal on Wireless Communications and Networking ◽

10.1186/s13638-020-01848-5 ◽

2020 ◽

Vol 2020 (1) ◽

Author(s):

Jun Long ◽

Yueyi Luo ◽

Xiaoyu Zhu ◽

Entao Luo ◽

Mingfeng Huang

Keyword(s):

Reinforcement Learning ◽

Energy Consumption ◽

Low Cost ◽

Computation Offloading ◽

Mobile Edge Computing ◽

Challenging Problem ◽

Learning Technique ◽

Cloud Network ◽

The City ◽

Task Offloading

AbstractWith the developing of Internet of Things (IoT) and mobile edge computing (MEC), more and more sensing devices are widely deployed in the smart city. These sensing devices generate various kinds of tasks, which need to be sent to cloud to process. Usually, the sensing devices do not equip with wireless modules, because it is neither economical nor energy saving. Thus, it is a challenging problem to find a way to offload tasks for sensing devices. However, many vehicles are moving around the city, which can communicate with sensing devices in an effective and low-cost way. In this paper, we propose a computation offloading scheme through mobile vehicles in IoT-edge-cloud network. The sensing devices generate tasks and transmit the tasks to vehicles, then the vehicles decide to compute the tasks in the local vehicle, MEC server or cloud center. The computation offloading decision is made based on the utility function of the energy consumption and transmission delay, and the deep reinforcement learning technique is adopted to make decisions. Our proposed method can make full use of the existing infrastructures to implement the task offloading of sensing devices, the experimental results show that our proposed solution can achieve the maximum reward and decrease delay.

Download Full-text

Adaptive Resource Management Utilizing Reinforcement Learning Technique in Inter-Cloud Environments

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/1055/1/012124 ◽

2021 ◽

Vol 1055 (1) ◽

pp. 012124

Author(s):

Vadla Pradeep Kumar ◽

Kolla Bhanu Prakash

Keyword(s):

Reinforcement Learning ◽

Resource Management ◽

Adaptive Resource Management ◽

Learning Technique ◽

Cloud Environments ◽

Adaptive Resource

Download Full-text

Reinforcement Learning Technique for Parameterization in Powertrain Controls

10.4271/2021-26-0045 ◽

2021 ◽

Author(s):

Gautham Sidharthan ◽

Vivek Venkobarao

Keyword(s):

Reinforcement Learning ◽

Learning Technique

Download Full-text

Applying a Deep Q Network for OpenAIs Car Racing Game

10.14293/s2199-1006.1.sor-.ppd7fvs.v1 ◽

2020 ◽

Author(s):

Ali Fakhry

Keyword(s):

Machine Learning ◽

Reinforcement Learning ◽

Transfer Learning ◽

State Of The Art ◽

Learning Techniques ◽

Car Racing ◽

Custom Made ◽

Learning Technique ◽

Reward Threshold

The applications of Deep Q-Networks are seen throughout the field of reinforcement learning, a large subsect of machine learning. Using a classic environment from OpenAI, CarRacing-v0, a 2D car racing environment, alongside a custom based modification of the environment, a DQN, Deep Q-Network, was created to solve both the classic and custom environments. The environments are tested using custom made CNN architectures and applying transfer learning from Resnet18. While DQNs were state of the art years ago, using it for CarRacing-v0 appears somewhat unappealing and not as effective as other reinforcement learning techniques. Overall, while the model did train and the agent learned various parts of the environment, attempting to reach the reward threshold for the environment with this reinforcement learning technique seems problematic and difficult as other techniques would be more useful.

Download Full-text

Q Value Reinforcement Learning Algorithm Based on Multi Agent System

Journal of Physics Conference Series ◽

10.1088/1742-6596/1069/1/012094 ◽

2018 ◽

Vol 1069 ◽

pp. 012094 ◽

Cited By ~ 1

Author(s):

Xijie Yin ◽

Dongxin Yang

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Multi Agent System ◽

Q Value ◽

Agent System ◽

Multi Agent ◽

Reinforcement Learning Algorithm

Download Full-text

Website Navigation Recommendation Based on Reinforcement Learning Technique

The 3rd International Workshop on Intelligent Data Analysis and Management - Springer Proceedings in Complexity ◽

10.1007/978-94-007-7293-9_10 ◽

2013 ◽

pp. 87-99

Author(s):

Yin-Ling Tang ◽

I-Hsien Ting ◽

Shyue-Liang Wang

Keyword(s):

Reinforcement Learning ◽

Learning Technique

Download Full-text

Parallel Implementation of Reinforcement Learning Q-Learning Technique for FPGA

IEEE Access ◽

10.1109/access.2018.2885950 ◽

2019 ◽

Vol 7 ◽

pp. 2782-2798 ◽

Cited By ~ 9

Author(s):

Lucileide M. D. Da Silva ◽

Matheus F. Torquato ◽

Marcelo A. C. Fernandes

Keyword(s):

Reinforcement Learning ◽

Parallel Implementation ◽

Q Learning ◽

Learning Technique

Download Full-text

Hidden Link Prediction in Criminal Networks Using the Deep Reinforcement Learning Technique

Computers ◽

10.3390/computers8010008 ◽

2019 ◽

Vol 8 (1) ◽

pp. 8 ◽

Cited By ~ 7

Author(s):

Marcus Lim ◽

Azween Abdullah ◽

NZ Jhanjhi ◽

Mahadevan Supramaniam

Keyword(s):

Machine Learning ◽

Reinforcement Learning ◽

Network Analysis ◽

Prediction Model ◽

Supervised Learning ◽

Link Prediction ◽

Supervised Machine Learning ◽

Criminal Networks ◽

Criminal Network ◽

Learning Technique

Criminal network activities, which are usually secret and stealthy, present certain difficulties in conducting criminal network analysis (CNA) because of the lack of complete datasets. The collection of criminal activities data in these networks tends to be incomplete and inconsistent, which is reflected structurally in the criminal network in the form of missing nodes (actors) and links (relationships). Criminal networks are commonly analyzed using social network analysis (SNA) models. Most machine learning techniques that rely on the metrics of SNA models in the development of hidden or missing link prediction models utilize supervised learning. However, supervised learning usually requires the availability of a large dataset to train the link prediction model in order to achieve an optimum performance level. Therefore, this research is conducted to explore the application of deep reinforcement learning (DRL) in developing a criminal network hidden links prediction model from the reconstruction of a corrupted criminal network dataset. The experiment conducted on the model indicates that the dataset generated by the DRL model through self-play or self-simulation can be used to train the link prediction model. The DRL link prediction model exhibits a better performance than a conventional supervised machine learning technique, such as the gradient boosting machine (GBM) trained with a relatively smaller domain dataset.

Download Full-text

Combining reinforcement learning with GA to find co-ordinated control rules for multi-agent system

Proceedings of IEEE International Conference on Evolutionary Computation ICEC-96 ◽

10.1109/icec.1996.542389 ◽

2002 ◽

Cited By ~ 2

Author(s):

S. Mikami ◽

M. Wada ◽

Y. Kakazu

Keyword(s):

Reinforcement Learning ◽

Multi Agent System ◽

Agent System ◽

Control Rules ◽

Multi Agent

Download Full-text