Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

Author(s):  
Aiqin Hou ◽  
Chase Q. Wu ◽  
Dingyi Fang ◽  
Liudong Zuo ◽  
Michelle M. Zhu ◽  
...  
2021 ◽  
Author(s):  
Aiqin Hou ◽  
Chase Q. Wu ◽  
Qiang Duan ◽  
Dawei Quan ◽  
Liudong Zuo ◽  
...  

2020 ◽  
Vol 22 (2) ◽  
pp. 130-144
Author(s):  
Aiqin Hou ◽  
Chase Qishi Wu ◽  
Liudong Zuo ◽  
Xiaoyang Zhang ◽  
Tao Wang ◽  
...  

2017 ◽  
Vol 85 ◽  
pp. 47-55 ◽  
Author(s):  
Aiqin Hou ◽  
Chase Q. Wu ◽  
Dingyi Fang ◽  
Yongqiang Wang ◽  
Meng Wang

2020 ◽  
Vol 8 (4) ◽  
pp. 1189-1198 ◽  
Author(s):  
Abdulsalam Yassine ◽  
Ali Asghar Nazari Shirehjini ◽  
Shervin Shirmohammadi

2020 ◽  
Vol 10 (21) ◽  
pp. 7586
Author(s):  
Jose E. Lozano-Rizk ◽  
Juan I. Nieto-Hipolito ◽  
Raul Rivera-Rodriguez ◽  
Maria A. Cosio-Leon ◽  
Mabel Vazquez-Briseño ◽  
...  

When Internet of Things (IoT) big data analytics (BDA) require to transfer data streams among software defined network (SDN)-based distributed data centers, the data flow forwarding in the communication network is typically done by an SDN controller using a traditional shortest path algorithm or just considering bandwidth requirements by the applications. In BDA, this scheme could affect their performance resulting in a longer job completion time because additional metrics were not considered, such as end-to-end delay, jitter, and packet loss rate in the data transfer path. These metrics are quality of service (QoS) parameters in the communication network. This research proposes a solution called QoSComm, an SDN strategy to allocate QoS-based data flows for BDA running across distributed data centers to minimize their job completion time. QoSComm operates in two phases: (i) based on the current communication network conditions, it calculates the feasible paths for each data center using a multi-objective optimization method; (ii) it distributes the resultant paths among data centers configuring their openflow Switches (OFS) dynamically. Simulation results show that QoSComm can improve BDA job completion time by an average of 18%.


2018 ◽  
Vol 8 (11) ◽  
pp. 2216
Author(s):  
Jiahui Jin ◽  
Qi An ◽  
Wei Zhou ◽  
Jiakai Tang ◽  
Runqun Xiong

Network bandwidth is a scarce resource in big data environments, so data locality is a fundamental problem for data-parallel frameworks such as Hadoop and Spark. This problem is exacerbated in multicore server-based clusters, where multiple tasks running on the same server compete for the server’s network bandwidth. Existing approaches solve this problem by scheduling computational tasks near the input data and considering the server’s free time, data placements, and data transfer costs. However, such approaches usually set identical values for data transfer costs, even though a multicore server’s data transfer cost increases with the number of data-remote tasks. Eventually, this hampers data-processing time, by minimizing it ineffectively. As a solution, we propose DynDL (Dynamic Data Locality), a novel data-locality-aware task-scheduling model that handles dynamic data transfer costs for multicore servers. DynDL offers greater flexibility than existing approaches by using a set of non-decreasing functions to evaluate dynamic data transfer costs. We also propose online and offline algorithms (based on DynDL) that minimize data-processing time and adaptively adjust data locality. Although DynDL is NP-complete (nondeterministic polynomial-complete), we prove that the offline algorithm runs in quadratic time and generates optimal results for DynDL’s specific uses. Using a series of simulations and real-world executions, we show that our algorithms are 30% better than algorithms that do not consider dynamic data transfer costs in terms of data-processing time. Moreover, they can adaptively adjust data localities based on the server’s free time, data placement, and network bandwidth, and schedule tens of thousands of tasks within subseconds or seconds.


Sign in / Sign up

Export Citation Format

Share Document