Finding patterns in the degree distribution of real-world complex networks: going beyond power law

In the Target Set Selection (TSS) problem, we want to find the minimum set of individuals in a network to spread information across the entire network. This problem is NP-hard, so find good strategies to deal with it, even for a particular case, is something of interest. We introduce preprocessing rules that allow reducing the size of the input without losing the optimality of the solution when the input graph is a complex network. Such type of network has a set of topological properties that commonly occurs in graphs that model real systems. We present computational experiments with real-world complex networks and synthetic power law graphs. Our strategies do particularly well on graphs with power law degree distribution, such as several real-world complex networks. Such rules provide a notable reduction in the size of the problem and, consequently, gains in scalability.

Download Full-text

Power-law distribution of degree–degree distance: A better representation of the scale-free property of complex networks

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1918901117 ◽

2020 ◽

Vol 117 (26) ◽

pp. 14812-14818 ◽

Cited By ~ 2

Author(s):

Bin Zhou ◽

Xiangyi Meng ◽

H. Eugene Stanley

Keyword(s):

Complex Networks ◽

Complex Network ◽

Power Law ◽

Real World ◽

Preferential Attachment ◽

Statistical Tests ◽

Finite Size ◽

Distance Distribution ◽

Scale Free ◽

Degree Distance

Whether real-world complex networks are scale free or not has long been controversial. Recently, in Broido and Clauset [A. D. Broido, A. Clauset,Nat. Commun.10, 1017 (2019)], it was claimed that the degree distributions of real-world networks are rarely power law under statistical tests. Here, we attempt to address this issue by defining a fundamental property possessed by each link, the degree–degree distance, the distribution of which also shows signs of being power law by our empirical study. Surprisingly, although full-range statistical tests show that degree distributions are not often power law in real-world networks, we find that in more than half of the cases the degree–degree distance distributions can still be described by power laws. To explain these findings, we introduce a bidirectional preferential selection model where the link configuration is a randomly weighted, two-way selection process. The model does not always produce solid power-law distributions but predicts that the degree–degree distance distribution exhibits stronger power-law behavior than the degree distribution of a finite-size network, especially when the network is dense. We test the strength of our model and its predictive power by examining how real-world networks evolve into an overly dense stage and how the corresponding distributions change. We propose that being scale free is a property of a complex network that should be determined by its underlying mechanism (e.g., preferential attachment) rather than by apparent distribution statistics of finite size. We thus conclude that the degree–degree distance distribution better represents the scale-free property of a complex network.

Download Full-text

A Spatial Biosurveillance Synthetic Data Generator in R

Online Journal of Public Health Informatics ◽

10.5210/ojphi.v9i1.7583 ◽

2017 ◽

Vol 9 (1) ◽

Cited By ~ 1

Author(s):

Drew Levin ◽

Patrick Finley

Keyword(s):

Power Law ◽

Real World ◽

Degree Distribution ◽

Transportation Network ◽

Synthetic Data ◽

Data Sets ◽

Real World Data ◽

Data Set ◽

Scale Free ◽

Data Generator

ObjectiveTo develop a spatially accurate biosurveillance synthetic datagenerator for the testing, evaluation, and comparison of new outbreakdetection techniques.IntroductionDevelopment of new methods for the rapid detection of emergingdisease outbreaks is a research priority in the field of biosurveillance.Because real-world data are often proprietary in nature, scientists mustutilize synthetic data generation methods to evaluate new detectionmethodologies. Colizza et. al. have shown that epidemic spread isdependent on the airline transportation network [1], yet current datagenerators do not operate over network structures.Here we present a new spatial data generator that models thespread of contagion across a network of cities connected by airlineroutes. The generator is developed in the R programming languageand produces data compatible with the popular `surveillance’ softwarepackage.MethodsColizza et. al. demonstrate the power-law relationships betweencity population, air traffic, and degree distribution [1]. We generate atransportation network as a Chung-Lu random graph [2] that preservesthese scale-free relationships (Figure 1).First, given a power-law exponent and a desired number of cities,a probability mass function (PMF) is generated that mirrors theexpected degree distribution for the given power-law relationship.Values are then sampled from this PMF to generate an expecteddegree (number of connected cities) for each city in the network.Edges (airline connections) are added to the network probabilisticallyas described in [2]. Unconnected graph components are each joinedto the largest component using linear preferential attachment. Finally,city sizes are calculated based on an observed three-quarter power-law scaling relationship with the sampled degree distribution.Each city is represented as a customizable stochastic compartmentalSIR model. Transportation between cities is modeled similar to [2].An infection is initialized in a single random city and infection countsare recorded in each city for a fixed period of time. A consistentfraction of the modeled infection cases are recorded as daily clinicvisits. These counts are then added onto statically generated baselinedata for each city to produce a full synthetic data set. Alternatively,data sets can be generated using real-world networks, such as the onemaintained by the International Air Transport Association.ResultsDynamics such as the number of cities, degree distribution power-law exponent, traffic flow, and disease kinetics can be customized.In the presented example (Figure 2) the outbreak spreads over a 20city transportation network. Infection spreads rapidly once the morepopulated hub cities are infected. Cities that are multiple flights awayfrom the initially infected city are infected late in the process. Thegenerator is capable of creating data sets of arbitrary size, length, andconnectivity to better mirror a diverse set of observed network types.ConclusionsNew computational methods for outbreak detection andsurveillance must be compared to established approaches. Outbreakmitigation strategies require a realistic model of human transportationbehavior to best evaluate impact. These actions require test data thataccurately reflect the complexity of the real-world data they wouldbe applied to. The outbreak data generated here represents thecomplexity of modern transportation networks and are made to beeasily integrated with established software packages to allow for rapidtesting and deployment.Randomly generated scale-free transportation network with a power-lawdegree exponent ofλ=1.8. City and link sizes are scaled to reflect their weight.An example of observed daily outbreak-related clinic visits across a randomlygenerated network of 20 cities. Each city is colored by the number of flightsrequired to reach the city from the initial infection location. These generatedcounts are then added onto baseline data to create a synthetic data set forexperimentation.KeywordsSimulation; Network; Spatial; Synthetic; Data

Download Full-text

Specialization Models of Network Growth

Journal of Complex Networks ◽

10.1093/comnet/cny024 ◽

2018 ◽

Vol 7 (3) ◽

pp. 375-392 ◽

Cited By ~ 1

Author(s):

L A Bunimovich ◽

D C Smith ◽

B Z Webb

Keyword(s):

Power Law ◽

Network Topology ◽

Real World ◽

Degree Distribution ◽

Small World ◽

Hierarchical Network ◽

Network Growth ◽

Complex Tasks ◽

Small World Property ◽

Law Degree

AbstractOne of the most important features observed in real networks is that, as a network’s topology evolves so does the network’s ability to perform various complex tasks. To explain this, it has also been observed that as a network grows certain subnetworks begin to specialize the function(s) they perform. Herein, we introduce a class of models of network growth based on this notion of specialization and show that as a network is specialized using this method its topology becomes increasingly sparse, modular and hierarchical, each of which are important properties observed in real networks. This procedure is also highly flexible in that a network can be specialized over any subset of its elements. This flexibility allows those studying specific networks the ability to search for mechanisms that describe their growth. For example, we find that by randomly selecting these elements a network’s topology acquires some of the most well-known properties of real networks including the small-world property, disassortativity and a right-skewed degree distribution. Beyond this, we show how this model can be used to generate networks with real-world like clustering coefficients and power-law degree distributions, respectively. As far as the authors know, this is the first such class of models that can create an increasingly modular and hierarchical network topology with these properties.

Download Full-text

Width of a scale-free tree

Journal of Applied Probability ◽

10.1017/s0021900200000814 ◽

2005 ◽

Vol 42 (03) ◽

pp. 839-850 ◽

Cited By ~ 4

Author(s):

Zsolt Katona

Keyword(s):

Theoretical Result ◽

Power Law ◽

Real World ◽

Degree Distribution ◽

Graph Model ◽

Random Graph Model ◽

Scale Free ◽

Slight Generalization ◽

Free Tree ◽

Law Degree

Consider the random graph model of Barabási and Albert, where we add a new vertex in every step and connect it to some old vertices with probabilities proportional to their degrees. If we connect it to only one of the old vertices then this will be a tree. These graphs have been shown to have a power-law degree distribution, the same as that observed in some large real-world networks. We are interested in the width of the tree and we show that it is at the nth step; this also holds for a slight generalization of the model with another constant. We then see how this theoretical result can be applied to directory trees.

Download Full-text

Width of a scale-free tree

Journal of Applied Probability ◽

10.1239/jap/1127322031 ◽

2005 ◽

Vol 42 (3) ◽

pp. 839-850 ◽

Cited By ~ 4

Author(s):

Zsolt Katona

Keyword(s):

Theoretical Result ◽

Power Law ◽

Real World ◽

Degree Distribution ◽

Graph Model ◽

Random Graph Model ◽

Scale Free ◽

Slight Generalization ◽

Free Tree ◽

Law Degree

Consider the random graph model of Barabási and Albert, where we add a new vertex in every step and connect it to some old vertices with probabilities proportional to their degrees. If we connect it to only one of the old vertices then this will be a tree. These graphs have been shown to have a power-law degree distribution, the same as that observed in some large real-world networks. We are interested in the width of the tree and we show that it is at the nth step; this also holds for a slight generalization of the model with another constant. We then see how this theoretical result can be applied to directory trees.

Download Full-text

Analysis of E-Commerce Product Graphs

10.36227/techrxiv.12814244.v1 ◽

2020 ◽

Author(s):

Shalin Shah

Keyword(s):

Random Graph ◽

Random Graphs ◽

Power Law ◽

Real World ◽

Degree Distribution ◽

Graph Model ◽

Graph Analysis ◽

Random Graph Model ◽

Product Graphs ◽

Clustering Coefficients

Consumer behavior in retail stores gives rise to product graphs based on copurchasingor co-viewing behavior. These product graphs can be analyzed usingthe known methods of graph analysis. In this paper, we analyze the product graphat Target Corporation based on the Erd˝os-Renyi random graph model. In particular,we compute clustering coefficients of actual and random graphs, and we find thatthe clustering coefficients of actual graphs are much higher than random graphs.We conduct the analysis on the entire set of products and also on a per categorybasis and find interesting results. We also compute the degree distribution andwe find that the degree distribution is a power law as expected from real worldnetworks, contrasting with the ER random graph.

Download Full-text

Polygon-Based Hierarchical Planar Networks Based on Generalized Apollonian Construction

Physics ◽

10.3390/physics3040063 ◽

2021 ◽

Vol 3 (4) ◽

pp. 998-1014

Author(s):

Mikhail Tamm ◽

Dmitry Koval ◽

Vladimir Stadnichuk

Keyword(s):

Complex Networks ◽

Power Law ◽

Degree Distribution ◽

Small World ◽

Scale Free ◽

Model Network ◽

Suggested Procedure ◽

Similar Construction ◽

Scale Free Graphs ◽

Planar Networks

Experimentally observed complex networks are often scale-free, small-world and have an unexpectedly large number of small cycles. An Apollonian network is one notable example of a model network simultaneously having all three of these properties. This network is constructed by a deterministic procedure of consequentially splitting a triangle into smaller and smaller triangles. In this paper, a similar construction based on the consequential splitting of tetragons and other polygons with an even number of edges is presented. The suggested procedure is stochastic and results in the ensemble of planar scale-free graphs. In the limit of a large number of splittings, the degree distribution of the graph converges to a true power law with an exponent, which is smaller than three in the case of tetragons and larger than three for polygons with a larger number of edges. It is shown that it is possible to stochastically mix tetragon-based and hexagon-based constructions to obtain an ensemble of graphs with a tunable exponent of degree distribution. Other possible planar generalizations of the Apollonian procedure are also briefly discussed.

Download Full-text

Using Power-Law Degree Distribution to Accelerate PageRank

Computer Engineering and Applications Journal ◽

10.18495/comengapp.v1i2.8 ◽

2012 ◽

Vol 1 (2) ◽

pp. 63-70

Author(s):

Zhaoyan Jin ◽

Quanyuan Wu

Keyword(s):

World Wide Web ◽

Power Law ◽

Real World ◽

Degree Distribution ◽

World Wide ◽

Power Law Distribution ◽

The World ◽

Real World Applications ◽

Real World Datasets ◽

Law Degree

The PageRank vector of a network is very important, for it can reflect the importance of a Web page in the World Wide Web, or of a people in a social network. However, with the growth of the World Wide Web and social networks, it needs more and more time to compute the PageRank vector of a network. In many real-world applications, the degree and PageRank distributions of these complex networks conform to the Power-Law distribution. This paper utilizes the degree distribution of a network to initialize its PageRank vector, and presents a Power-Law degree distribution accelerating algorithm of PageRank computation. Experiments on four real-world datasets show that the proposed algorithm converges more quickly than the original PageRank algorithm.DOI:Â 10.18495/comengapp.12.063070

Download Full-text

Detecting different topologies immanent in scale-free networks with the same degree distribution

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1816842116 ◽

2019 ◽

Vol 116 (14) ◽

pp. 6701-6706 ◽

Cited By ~ 12

Author(s):

Dimitrios Tsiotas

Keyword(s):

Complex Networks ◽

Power Law ◽

Network Topology ◽

Degree Distribution ◽

Growth Process ◽

Preferential Attachment ◽

Self Similarity ◽

Scale Free ◽

Scale Free Networks ◽

Definition Of

The scale-free (SF) property is a major concept in complex networks, and it is based on the definition that an SF network has a degree distribution that follows a power-law (PL) pattern. This paper highlights that not all networks with a PL degree distribution arise through a Barabási−Albert (BA) preferential attachment growth process, a fact that, although evident from the literature, is often overlooked by many researchers. For this purpose, it is demonstrated, with simulations, that established measures of network topology do not suffice to distinguish between BA networks and other (random-like and lattice-like) SF networks with the same degree distribution. Additionally, it is examined whether an existing self-similarity metric proposed for the definition of the SF property is also capable of distinguishing different SF topologies with the same degree distribution. To contribute to this discrimination, this paper introduces a spectral metric, which is shown to be more capable of distinguishing between different SF topologies with the same degree distribution, in comparison with the existing metrics.

Download Full-text