big graph Latest Research Papers

The scale of knowledge is growing rapidly in the big data environment, and traditional knowledge organization and services have faced the dilemma of semantic inaccuracy and untimeliness. From a knowledge fusion perspective-combining the precise semantic superiority of traditional ontology with the large-scale graph processing power and the predicate attribute expression ability of property graph-this paper presents an ontology and property graph fusion framework (OPGFF). The fusion process is divided into content layer fusion and constraint layer fusion. The result of the fusion, that is, the knowledge representation model is called knowledge big graph. In addition, this paper applies the knowledge big graph model to the ownership network in the China’s financial field and builds a financial ownership knowledge big graph. Furthermore, this paper designs and implements six consistency inference algorithms for finding contradictory data and filling in missing data in the financial ownership knowledge big graph, five of which are completely domain agnostic. The correctness and validity of the algorithms have been experimentally verified with actual data. The fusion OPGFF framework and the implementation method of the knowledge big graph could provide technical reference for big data knowledge organization and services.

Download Full-text

GraphMP: I/O-Efficient Big Graph Analytics on a Single Commodity Machine

IEEE Transactions on Big Data ◽

10.1109/tbdata.2019.2908384 ◽

2020 ◽

Vol 6 (4) ◽

pp. 816-829 ◽

Cited By ~ 1

Author(s):

Peng Sun ◽

Yonggang Wen ◽

Ta Nguyen Binh Duong ◽

Xiaokui Xiao

Keyword(s):

Graph Analytics ◽

Big Graph ◽

Single Commodity

Download Full-text

DHPV: a distributed algorithm for large-scale graph partitioning

Journal Of Big Data ◽

10.1186/s40537-020-00357-y ◽

2020 ◽

Vol 7 (1) ◽

Author(s):

Wilfried Yves Hamilton Adoni ◽

Tarik Nahhal ◽

Moez Krichen ◽

Abdeltif El byed ◽

Ismail Assayad

Keyword(s):

Graph Partitioning ◽

Distributed Algorithm ◽

Large Scale ◽

Data Exploration ◽

Significant Gain ◽

Complete Problem ◽

Big Graph ◽

Big Graphs ◽

Strongly Connected ◽

Np Complete

Abstract Big graphs are part of the movement of “Not Only SQL” databases (also called NoSQL) focusing on the relationships between data, rather than the values themselves. The data is stored in vertices while the edges model the interactions or relationships between these data. They offer flexibility in handling data that is strongly connected to each other. The analysis of a big graph generally involves exploring all of its vertices. Thus, this operation is costly in time and resources because big graphs are generally composed of millions of vertices connected through billions of edges. Consequently, the graph algorithms are expansive compared to the size of the big graph, and are therefore ineffective for data exploration. Thus, partitioning the graph stands out as an efficient and less expensive alternative for exploring a big graph. This technique consists in partitioning the graph into a set of k sub-graphs in order to reduce the complexity of the queries. Nevertheless, it presents many challenges because it is an NP-complete problem. In this article, we present DPHV (Distributed Placement of Hub-Vertices) an efficient parallel and distributed heuristic for large-scale graph partitioning. An application on a real-world graphs demonstrates the feasibility and reliability of our method. The experiments carried on a 10-nodes Spark cluster proved that the proposed methodology achieves significant gain in term of time and outperforms JA-BE-JA, Greedy, DFEP.

Download Full-text

DHPV: A Distributed Algorithm for Large-Scale Graph Partitioning

10.21203/rs.3.rs-30206/v2 ◽

2020 ◽

Author(s):

Wilfried Yves Hamilton Adoni ◽

Tarik Nahhal ◽

Moez Krichen ◽

Abdeltif El byed ◽

Ismail Assayad

Keyword(s):

Graph Partitioning ◽

Distributed Algorithm ◽

Large Scale ◽

Data Exploration ◽

Significant Gain ◽

Complete Problem ◽

Big Graph ◽

Big Graphs ◽

Strongly Connected ◽

Np Complete

Abstract Big graphs are part of the movement of "Not Only SQL" databases (also called NoSQL) focusing on the relationships between data, rather than the values themselves. The data is stored in vertices while the edges model the interactions or relationships between these data. They offer flexibility in handling data that is strongly connected to each other. The analysis of a big graph generally involves exploring all of its vertices. Thus, this operation is costly in time and resources because big graphs are generally composed of millions of vertices connected through billions of edges. Consequently, the graph algorithms are expansive compared to the size of the big graph, and are therefore ineffective for data exploration. Thus, partitioning the graph stands out as an efficient and less expensive alternative for exploring a big graph. This technique consists in partitioning the graph into a set of k sub-graphs in order to reduce the complexity of the queries. Nevertheless, it presents many challenges because it is an NP-complete problem. In this article, we present DPHV (Distributed Placement of Hub-Vertices) an efficient parallel and distributed heuristic for large-scale graph partitioning. An application on a real-world graphs demonstrates the feasibility and reliability of our method. The experiments carried on a 10-nodes Spark cluster proved that the proposed methodology achieves significant gain in term of time and outperforms JA-BE-JA, Greedy, DFEP.

Download Full-text

Multi-fuzzy-constrained graph pattern matching with big graph data

Intelligent Data Analysis ◽

10.3233/ida-194653 ◽

2020 ◽

Vol 24 (4) ◽

pp. 941-958

Author(s):

Guliu Liu ◽

Lei Li ◽

Xindong Wu

Keyword(s):

Pattern Matching ◽

Graph Pattern Matching ◽

Graph Data ◽

Graph Pattern ◽

Big Graph

Download Full-text

DHPV: A Distributed Algorithm for Large-Scale Graph Partitioning

10.21203/rs.3.rs-30206/v1 ◽

2020 ◽

Author(s):

Wilfried Yves Hamilton Adoni ◽

Tarik Nahhal ◽

Moez Krichen ◽

Abdeltif El byed ◽

Ismail Assayad

Keyword(s):

Graph Partitioning ◽

Distributed Algorithm ◽

Large Scale ◽

Data Exploration ◽

Significant Gain ◽

Complete Problem ◽

Big Graph ◽

Big Graphs ◽

Strongly Connected ◽

Np Complete

Abstract Big graphs are part of the movement of "Not Only SQL" databases (also called NoSQL) focusing on the relationships between data, rather than the values themselves. The data is stored in vertices while the edges model the interactions or relationships between these data. They offer flexibility in handling data that is strongly connected to each other. The analysis of a big graph generally involves exploring all of its vertices. Thus, this operation is costly in time and resources because big graphs are generally composed of millions of vertices connected through billions of edges. Consequently, the graph algorithms are expansive compared to the size of the big graph, and are therefore ineffective for data exploration. Thus, partitioning the graph stands out as an efficient and less expensive alternative for exploring a big graph. This technique consists in partitioning the graph into a set of k sub-graphs in order to reduce the complexity of the queries. Nevertheless, it presents many challenges because it is an NP-complete problem. In this article, we present DPHV (Distributed Placement of Hub-Vertices) an efficient parallel and distributed heuristic for large-scale graph partitioning. An application on a real-world graphs demonstrates the feasibility and reliability of our method. The experiments carried on a 10-nodes Spark cluster proved that the proposed methodology achieves significant gain in term of time and outperforms JA-BE-JA, Greedy, DFEP

Download Full-text