Bandwidth scheduling for big data transfer with two variable node-disjoint paths

Journal of Communications and Networks ◽

10.1109/jcn.2020.000004 ◽

2020 ◽

Vol 22 (2) ◽

pp. 130-144

Author(s):

Aiqin Hou ◽

Chase Qishi Wu ◽

Liudong Zuo ◽

Xiaoyang Zhang ◽

Tao Wang ◽

...

Keyword(s):

Big Data ◽

Data Transfer ◽

Disjoint Paths ◽

Bandwidth Scheduling ◽

Download Full-text

Bandwidth scheduling for big data transfer using multiple fixed node-disjoint paths

Journal of Network and Computer Applications ◽

10.1016/j.jnca.2016.12.011 ◽

2017 ◽

Vol 85 ◽

pp. 47-55 ◽

Author(s):

Aiqin Hou ◽

Chase Q. Wu ◽

Dingyi Fang ◽

Yongqiang Wang ◽

Meng Wang

Keyword(s):

Big Data ◽

Data Transfer ◽

Disjoint Paths ◽

Bandwidth Scheduling ◽

Download Full-text

Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

2018 IEEE/ACM Innovating the Network for Data-Intensive Science (INDIS) ◽

10.1109/indis.2018.00009 ◽

2018 ◽

Author(s):

Aiqin Hou ◽

Chase Q. Wu ◽

Dingyi Fang ◽

Liudong Zuo ◽

Michelle M. Zhu ◽

...

Keyword(s):

Big Data ◽

Data Centers ◽

Data Transfer ◽

Bandwidth Scheduling ◽

Deadline Constraint

Download Full-text

Bandwidth scheduling with multiple variable node-disjoint paths in high-performance networks

2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC) ◽

10.1109/pccc.2016.7820600 ◽

2016 ◽

Author(s):

Aiqin Hou ◽

Chase Q. Wu ◽

Dingyi Fang ◽

Yongqiang Wang ◽

Meng Wang ◽

...

Keyword(s):

High Performance ◽

Disjoint Paths ◽

Bandwidth Scheduling ◽

Variable Node ◽

Multiple Variable

Download Full-text

Concurrent bandwidth scheduling for big data transfer over a dedicated channel

International Journal of Communication Networks and Distributed Systems ◽

10.1504/ijcnds.2015.070970 ◽

2015 ◽

Vol 15 (2/3) ◽

pp. 169 ◽

Author(s):

Liudong Zuo ◽

Michelle M. Zhu ◽

Chase Q. Wu

Keyword(s):

Big Data ◽

Data Transfer ◽

Bandwidth Scheduling

Download Full-text

SDN-based bandwidth scheduling for prioritized data transfer between data centers

Cluster Computing ◽

10.1007/s10586-021-03364-7 ◽

2021 ◽

Author(s):

Aiqin Hou ◽

Chase Q. Wu ◽

Qiang Duan ◽

Dawei Quan ◽

Liudong Zuo ◽

...

Keyword(s):

Data Centers ◽

Data Transfer ◽

Bandwidth Scheduling

Download Full-text

Service Scheduling and Resource Allocation for Big Data Transfer in Elastic Optical Network

GLOBECOM 2020 - 2020 IEEE Global Communications Conference ◽

10.1109/globecom42002.2020.9322185 ◽

2020 ◽

Author(s):

Mehdi Tarhani ◽

Sanjib Sarkar ◽

Mehdi Shadaram

Keyword(s):

Resource Allocation ◽

Big Data ◽

Data Transfer ◽

Optical Network ◽

Service Scheduling ◽

Elastic Optical Network

Download Full-text

Client-Based Intelligence for Resource Efficient Vehicular Big Data Transfer in Future 6G Networks

IEEE Transactions on Vehicular Technology ◽

10.1109/tvt.2021.3060459 ◽

2021 ◽

pp. 1-1

Author(s):

Benjamin Sliwa ◽

Rick Adam ◽

Christian Wietfeld

Keyword(s):

Big Data ◽

Download Full-text

DynDL: Scheduling Data-Locality-Aware Tasks with Dynamic Data Transfer Cost for Multicore-Server-Based Big Data Clusters

Applied Sciences ◽

10.3390/app8112216 ◽

2018 ◽

Vol 8 (11) ◽

pp. 2216

Author(s):

Jiahui Jin ◽

Qi An ◽

Wei Zhou ◽

Jiakai Tang ◽

Runqun Xiong

Keyword(s):

Big Data ◽

Data Processing ◽

Processing Time ◽

Data Transfer ◽

Data Locality ◽

Free Time ◽

Time Data ◽

Dynamic Data ◽

Network Bandwidth ◽

Network bandwidth is a scarce resource in big data environments, so data locality is a fundamental problem for data-parallel frameworks such as Hadoop and Spark. This problem is exacerbated in multicore server-based clusters, where multiple tasks running on the same server compete for the server’s network bandwidth. Existing approaches solve this problem by scheduling computational tasks near the input data and considering the server’s free time, data placements, and data transfer costs. However, such approaches usually set identical values for data transfer costs, even though a multicore server’s data transfer cost increases with the number of data-remote tasks. Eventually, this hampers data-processing time, by minimizing it ineffectively. As a solution, we propose DynDL (Dynamic Data Locality), a novel data-locality-aware task-scheduling model that handles dynamic data transfer costs for multicore servers. DynDL offers greater flexibility than existing approaches by using a set of non-decreasing functions to evaluate dynamic data transfer costs. We also propose online and offline algorithms (based on DynDL) that minimize data-processing time and adaptively adjust data locality. Although DynDL is NP-complete (nondeterministic polynomial-complete), we prove that the offline algorithm runs in quadratic time and generates optimal results for DynDL’s specific uses. Using a series of simulations and real-world executions, we show that our algorithms are 30% better than algorithms that do not consider dynamic data transfer costs in terms of data-processing time. Moreover, they can adaptively adjust data localities based on the server’s free time, data placement, and network bandwidth, and schedule tens of thousands of tasks within subseconds or seconds.

Download Full-text

High-Performance End-to-End Integrity Verification on Big Data Transfer

IEICE Transactions on Information and Systems ◽

10.1587/transinf.2018edp7297 ◽

2019 ◽

Vol E102.D (8) ◽

pp. 1478-1488

Author(s):

Eun-Sung JUNG ◽

Si LIU ◽

Rajkumar KETTIMUTHU ◽

Sungwook CHUNG

Keyword(s):

Big Data ◽

High Performance ◽

Data Transfer ◽

Integrity Verification ◽

Download Full-text

Efficient data transfer protocols for big data

2012 IEEE 8th International Conference on E-Science ◽

10.1109/escience.2012.6404462 ◽

2012 ◽

Author(s):

Brian Tierney ◽

Ezra Kissel ◽

Martin Swany ◽

Eric Pouyoul

Keyword(s):

Big Data ◽

Data Transfer ◽

Download Full-text