edge partitioning Latest Research Papers

Graph edge partitioning, which is essential for the efficiency of distributed graph computation systems, divides a graph into several balanced partitions within a given size to minimize the number of vertices to be cut. Existing graph partitioning models can be classified into two categories: offline and streaming graph partitioning models. The former requires global graph information during the partitioning, which is expensive in terms of time and memory for large-scale graphs. The latter creates partitions based solely on the received graph information. However, the streaming model may result in a lower partitioning quality compared with the offline model. Therefore, this study introduces a Local Graph Edge Partitioning model, which considers only the local information (i.e., a portion of a graph instead of the entire graph) during the partitioning. Considering only the local graph information is meaningful because acquiring complete information for large-scale graphs is expensive. Based on the Local Graph Edge Partitioning model, two local graph edge partitioning algorithms—Two-stage Local Partitioning and Adaptive Local Partitioning—are given. Experimental results obtained on 14 real-world graphs demonstrate that the proposed algorithms outperform rival algorithms in most tested cases. Furthermore, the proposed algorithms are proven to significantly improve the efficiency of the real graph computation system GraphX.

Download Full-text

CuSP

ACM SIGOPS Operating Systems Review ◽

10.1145/3469379.3469385 ◽

2021 ◽

Vol 55 (1) ◽

pp. 47-60

Author(s):

Loc Hoang ◽

Roshan Dathathri ◽

Gurbinder Gill ◽

Keshav Pingali

Keyword(s):

Single Machine ◽

Distributed Memory ◽

The State ◽

Main Memory ◽

Input Graph ◽

Large Graphs ◽

Graph Analytics ◽

Graph Partitions ◽

High Level ◽

Edge Partitioning

Graph analytics systems must analyze graphs with billions of vertices and edges which require several terabytes of storage. Distributed-memory clusters are often used for analyzing such large graphs since the main memory of a single machine is usually restricted to a few hundreds of gigabytes. This requires partitioning the graph among the machines in the cluster. Existing graph analytics systems use a built-in partitioner that incorporates a particular partitioning policy, but the best policy is dependent on the algorithm, input graph, and platform. Therefore, built-in partitioners are not sufficiently flexible. Stand-alone graph partitioners are available, but they too implement only a few policies. CuSP is a fast streaming edge partitioning framework which permits users to specify the desired partitioning policy at a high level of abstraction and quickly generates highquality graph partitions. For example, it can partition wdc12, the largest publicly available web-crawl graph with 4 billion vertices and 129 billion edges, in under 2 minutes for clusters with 128 machines. Our experiments show that it can produce quality partitions 6× faster on average than the state-of-theart stand-alone partitioner in the literature while supporting a wider range of partitioning policies.

Download Full-text

Taking Heuristic Based Graph Edge Partitioning One Step Ahead via OffStream Partitioning Approach

2021 IEEE 37th International Conference on Data Engineering (ICDE) ◽

10.1109/icde51399.2021.00204 ◽

2021 ◽

Author(s):

Tewodros Ayall ◽

Hancong Duan ◽

Changhong Liu ◽

Fantahun Gereme ◽

Mohammed Abegaz ◽

...

Keyword(s):

One Step ◽

Edge Partitioning

Download Full-text

Group Reassignment for Dynamic Edge Partitioning

IEEE Transactions on Parallel and Distributed Systems ◽

10.1109/tpds.2021.3069292 ◽

2021 ◽

pp. 1-1

Author(s):

He Li ◽

Hang Yuan ◽

Jianbin Huang ◽

Jiangtao Cui ◽

Xiaoke Ma ◽

...

Keyword(s):

Edge Partitioning

Download Full-text

OffStreamNG: Partial Stream Hybrid Graph Edge Partitioning Based on Neighborhood Expansion and Greedy Heuristic

Communications in Computer and Information Science - New Trends in Databases and Information Systems ◽

10.1007/978-3-030-54623-6_11 ◽

2020 ◽

pp. 118-128

Author(s):

Tewodros Ayall ◽

Hancong Duan ◽

Changhong Liu ◽

Fantahun Gereme ◽

Mesay Deleli

Keyword(s):

Greedy Heuristic ◽

Edge Partitioning

Download Full-text

RBSEP: a reassignment and buffer based streaming edge partitioning approach

Journal Of Big Data ◽

10.1186/s40537-019-0257-5 ◽

2019 ◽

Vol 6 (1) ◽

Author(s):

Monireh Taimouri ◽

Hamid Saadatfar

Keyword(s):

Graph Partitioning ◽

Input Data ◽

Large Datasets ◽

Greedy Heuristics ◽

The Internet ◽

Large Graphs ◽

New Approach ◽

The Past ◽

Edge Partitioning

Abstract In recent years, the rapid growth of the Internet has led to creation of massively large graphs. Since databases have become very large nowadays, they cannot be processed by a simple machine at an acceptable time anymore; therefore, traditional graph partitioning methods, which are often based on having a complete image of the entire graph, are not applicable to large datasets. This challenge has led to the appearance of a new approach called streaming graph partitioning. In streaming graph partitioning, a stream of input data is received by a partitioner, and partitioner decides which computational machine the data should be transferred to. Often, streaming partitioner does not have any information about the whole graph, and usually distributes the vertices based on some greedy heuristics which may not be optimal for incoming vertices. Hence, partitioner’s decision can be significantly improved if more information about the graph is utilized. In this paper, we present a new vertex-cut streaming graph partitioning approach. The proposed method uses the idea of postponing the decision for some of the edges (by means of an intelligent buffering) and corrects some of the past decisions to improve the quality of the graph partitioning. The proposed approach is evaluated using from real-world graphs. The experimental results show that the performance of the proposed method is superior in comparison with the previous HDRF method.

Download Full-text