A novel approach of dynamic base station switching strategy based on Markov decision process for interference alignment in VANETs

As transportation becomes more convenient and efficient, users move faster and faster. When a user leaves the service range of the original edge server, the original edge server needs to migrate the tasks offloaded by the user to other edge servers. An effective task migration strategy needs to fully consider the location of users, the load status of edge servers, and energy consumption, which make designing an effective task migration strategy a challenge. In this paper, we innovatively proposed a mobile edge computing (MEC) system architecture consisting of multiple smart mobile devices (SMDs), multiple unmanned aerial vehicle (UAV), and a base station (BS). Moreover, we establish the model of the Markov decision process with unknown rewards (MDPUR) based on the traditional Markov decision process (MDP), which comprehensively considers the three aspects of the migration distance, the residual energy status of the UAVs, and the load status of the UAVs. Based on the MDPUR model, we propose a advantage-based value iteration (ABVI) algorithm to obtain the effective task migration strategy, which can help the UAV group to achieve load balancing and reduce the total energy consumption of the UAV group under the premise of ensuring user service quality. Finally, the results of simulation experiments show that the ABVI algorithm is effective. In particular, the ABVI algorithm has better performance than the traditional value iterative algorithm. And in a dynamic environment, the ABVI algorithm is also very robust.

Download Full-text

A Novel Approach for Youtube Video Spam Detection using Markov Decision Process

2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI) ◽

10.1109/icacci.2018.8554405 ◽

2018 ◽

Cited By ~ 1

Author(s):

Simran Kanodia ◽

Rachna Sasheendran ◽

Vinod Pathari

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Spam Detection ◽

Novel Approach ◽

Markov Decision

Download Full-text

Energy-Efficient Base Station Control Framework for 5G Cellular Networks Based on Markov Decision Process

IEEE Transactions on Vehicular Technology ◽

10.1109/tvt.2019.2931304 ◽

2019 ◽

Vol 68 (9) ◽

pp. 9267-9279 ◽

Cited By ~ 5

Author(s):

Fateh Elsherif ◽

Edwin K. P. Chong ◽

Jeong-Ho Kim

Keyword(s):

Cellular Networks ◽

Markov Decision Process ◽

Decision Process ◽

Energy Efficient ◽

Base Station ◽

Control Framework ◽

Markov Decision

Download Full-text

N-Element Beam Antennas System Design for Wireless Sensor Node Using Semi-Markov Decision Process with Variable Sojourn Time

International Journal of Electrical and Electronic Engineering & Telecommunications ◽

10.18178/ijeetc.9.5.289-298 ◽

2020 ◽

pp. 289-298

Author(s):

Abayomi Ajofoyinbo ◽

◽

David O. Olowokere ◽

Oye Ibidapo-Obe

Keyword(s):

System Design ◽

Markov Decision Process ◽

Decision Process ◽

Sojourn Time ◽

Sensor Node ◽

Wireless Sensor ◽

Wireless Sensor Node ◽

Novel Approach ◽

Entire Duration ◽

Markov Decision

The paper presents N-element switchable beam antennas (BAs) system design for Wireless Sensor Node (WSN), in which the operation of the BAs was characterized by semi-Markov Decision Process (SMDP) with variable sojourn time. A matrix-based switching methodology was introduced for selecting an operational BA, based on the received signal power by each BA. Optimal analysis was carried-out to obtain optimal results in terms of the maximum total of sum of discounted reward in current states. Also developed in the study is the methodology for switching a BA from non-operational to operational state and vice-versa. The effectiveness of this switchable BAs system design was tested via numerical analysis implemented in MATLAB software. Numerical results show that this novel approach enables the WSN equipped with BAs to select and maintain an operational BA in receive (or transmit) mode for the entire duration of packets reception (or transmission). The authors found no paper in the existing literature that provides this capability.

Download Full-text

Exploitation-Oriented Learning PS-r#

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2009.p0624 ◽

2009 ◽

Vol 13 (6) ◽

pp. 624-630 ◽

Cited By ~ 20

Author(s):

Kazuteru Miyazaki ◽

◽

Shigenobu Kobayashi ◽

Keyword(s):

Random Walk ◽

Reinforcement Learning ◽

Markov Decision Process ◽

Decision Process ◽

Sensory Input ◽

Numerical Examples ◽

Novel Approach ◽

Partially Observed ◽

Markov Decision ◽

Directed Learning

Exploitation-oriented learning (XoL) is a novel approach to goal-directed learning from interaction. Reinforcement learning is much more focused on learning and ensures optimality in Markov decision process (MDP) environments, XoL involves learning a rational policy that obtains rewards continuously and very quickly. PS-r*, a form of XoL, involves learning a useful rational policy not inferior to the random walk in the partially observed Markov decision process (POMDP) where reward types number one. PS-r*, however, requires O(MN2) memory where N is the number of sensory input types and M is an action. We propose PS-r#for learning a useful rational policy in the POMDP using O(MN) memory. PS-r#effectiveness is confirmed in numerical examples.

Download Full-text

A Markov Decision Process Approach for Cost-Benefit Analysis of Infrastructure Resilience Upgrades

SSRN Electronic Journal ◽

10.2139/ssrn.3657479 ◽

2020 ◽

Author(s):

Qianru Zhu ◽

Benjamin D. Leibowicz

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Cost Benefit Analysis ◽

Cost Benefit ◽

Process Approach ◽

Benefit Analysis ◽

Markov Decision ◽

Infrastructure Resilience

Download Full-text

A Markov Decision Process Workflow for Automating Interior Design

KSCE Journal of Civil Engineering ◽

10.1007/s12205-021-1272-6 ◽

2021 ◽

Author(s):

Ebrahim Karan ◽

Sadegh Asgari ◽

Abbas Rashidi

Keyword(s):

Markov Decision Process ◽

Interior Design ◽

Decision Process ◽

Markov Decision

Download Full-text

A constraint partially observable semi-Markov decision process for the attack–defence relationships in various critical infrastructures

Cyber-Physical Systems ◽

10.1080/23335777.2021.1879935 ◽

2021 ◽

pp. 1-26

Author(s):

Nadia Niknami ◽

Jie Wu

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Critical Infrastructures ◽

Markov Decision ◽

Partially Observable

Download Full-text

Development of a Shipment Policy for Collection Centers

Mathematics ◽

10.3390/math9121385 ◽

2021 ◽

Vol 9 (12) ◽

pp. 1385

Author(s):

Irais Mora-Ochomogo ◽

Marco Serrato ◽

Jaime Mora-Vargas ◽

Raha Akhavan-Tabatabaei

Keyword(s):

Climate Change ◽

Natural Disasters ◽

Markov Decision Process ◽

Decision Process ◽

Necessary Conditions ◽

Decision Makers ◽

Humanitarian Organizations ◽

The World ◽

Markov Decision ◽

Unsatisfied Demand

Natural disasters represent a latent threat for every country in the world. Due to climate change and other factors, statistics show that they continue to be on the rise. This situation presents a challenge for the communities and the humanitarian organizations to be better prepared and react faster to natural disasters. In some countries, in-kind donations represent a high percentage of the supply for the operations, which presents additional challenges. This research proposes a Markov Decision Process (MDP) model to resemble operations in collection centers, where in-kind donations are received, sorted, packed, and sent to the affected areas. The decision addressed is when to send a shipment considering the uncertainty of the donations’ supply and the demand, as well as the logistics costs and the penalty of unsatisfied demand. As a result of the MDP a Monotone Optimal Non-Decreasing Policy (MONDP) is proposed, which provides valuable insights for decision-makers within this field. Moreover, the necessary conditions to prove the existence of such MONDP are presented.

Download Full-text