network sampling
Recently Published Documents


TOTAL DOCUMENTS

97
(FIVE YEARS 20)

H-INDEX

19
(FIVE YEARS 1)

2021 ◽  
Vol 23 ◽  
Author(s):  
Caijun Qin

This paper proposes a novel, exploration-based network sampling algorithm called caterpillar quota walk sampling (CQWS) inspired by the caterpillar tree. Network sampling identifies a subset of nodes and edges from a network, creating an induced graph. Beginning from an initial node, exploration-based sampling algorithms grow the induced set by traversing and tracking unvisited neighboring nodes from the original network. Tunable and trainable parameters allow CQWS to maximize the sum of the degrees of the induced graph from multiple trials when sampling dense networks. A network spread model renders effective use in various applications, including tracking the spread of epidemics, visualizing information transmissions through social media, and cell-to-cell spread of neurodegenerative diseases. CQWS generates a spread model as its sample by visiting the highest-degree neighbors of previously visited nodes. For each previously visited node, a top proportion of the highest-degree neighbors fulfills a quota and branches into a new caterpillar tree. Sampling more high-degree nodes constitutes an objective among various applications. Many exploration-based sampling algorithms suffer drawbacks that limit the sum of degrees of visited nodes and thus the number of high-degree nodes visited. Furthermore, a strategy may not be adaptable to volatile degree frequencies throughout the original network architecture, which influences how deep into the original network an algorithm could sample. This paper analyzes CQWS in comparison to four other exploration-based network in tackling these two problems by sampling sparse and dense randomly generated networks.


Author(s):  
Lucas Marcondes Pavelski ◽  
Marie-Eleonore Kessaci ◽  
Myriam Delgado

2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Eyal Soreq ◽  
Ines R. Violante ◽  
Richard E. Daws ◽  
Adam Hampshire

AbstractDespite a century of research, it remains unclear whether human intelligence should be studied as one dominant, several major, or many distinct abilities, and how such abilities relate to the functional organisation of the brain. Here, we combine psychometric and machine learning methods to examine in a data-driven manner how factor structure and individual variability in cognitive-task performance relate to dynamic-network connectomics. We report that 12 sub-tasks from an established intelligence test can be accurately multi-way classified (74%, chance 8.3%) based on the network states that they evoke. The proximities of the tasks in behavioural-psychometric space correlate with the similarities of their network states. Furthermore, the network states were more accurately classified for higher relative to lower performing individuals. These results suggest that the human brain uses a high-dimensional network-sampling mechanism to flexibly code for diverse cognitive tasks. Population variability in intelligence test performance relates to the fidelity of expression of these task-optimised network states.


2021 ◽  
Vol 15 (4) ◽  
pp. 1-27
Author(s):  
Nesreen K. Ahmed ◽  
Nick Duffield ◽  
Ryan A. Rossi

Temporal networks representing a stream of timestamped edges are seemingly ubiquitous in the real world. However, the massive size and continuous nature of these networks make them fundamentally challenging to analyze and leverage for descriptive and predictive modeling tasks. In this work, we propose a general framework for temporal network sampling with unbiased estimation. We develop online, single-pass sampling algorithms, and unbiased estimators for temporal network sampling. The proposed algorithms enable fast, accurate, and memory-efficient statistical estimation of temporal network patterns and properties. In addition, we propose a temporally decaying sampling algorithm with unbiased estimators for studying networks that evolve in continuous time, where the strength of links is a function of time, and the motif patterns are temporally weighted. In contrast to the prior notion of a △ t -temporal motif, the proposed formulation and algorithms for counting temporally weighted motifs are useful for forecasting tasks in networks such as predicting future links, or a future time-series variable of nodes and links. Finally, extensive experiments on a variety of temporal networks from different domains demonstrate the effectiveness of the proposed algorithms. A detailed ablation study is provided to understand the impact of the various components of the proposed framework.


2021 ◽  
Vol 15 (2) ◽  
pp. 1-21
Author(s):  
Michele Coscia

In this article, we propose a new method to perform topological network sampling. Topological network sampling is a process for extracting a subset of nodes and edges from a network, such that analyses on the sample provide results and conclusions comparable to the ones they would return if run on whole structure. We need network sampling because the largest online network datasets are accessed through low-throughput application programming interface (API) systems, rendering the collection of the whole network infeasible. Our method is inspired by the literature on network backboning, specifically the noise-corrected backbone. We select the next node to explore by following the edge we identify as the one providing the largest information gain, given the topology of the sample explored so far. We evaluate our method against the most commonly used sampling methods. We do so in a realistic framework, considering a wide array of network topologies, network analysis, and features of API systems. There is no method that can provide the best sample in all possible scenarios, thus in our results section, we show the cases in which our method performs best and the cases in which it performs worst. Overall, the noise-corrected network sampling performs well: it has the best rank average among the tested methods across a wide range of applications.


Author(s):  
Elizabeth M. Y. Lee ◽  
Thomas Ludwig ◽  
Boyuan Yu ◽  
Aayush R. Singh ◽  
François Gygi ◽  
...  

Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1905
Author(s):  
Jaekoo Lee ◽  
MyungKeun Yoon ◽  
Song Noh

Recently, researchers have paid attention to many types of huge networks such as the Internet of Things, sensor networks, social networks, and traffic networks because of their untapped potential for theoretical and practical outcomes. A major obstacle in studying large-scale networks is that their size tends to increase exponentially. In addition, access to large network databases is limited for security or physical connection reasons. In this paper, we propose a novel sampling method that works effectively for large-scale networks. The proposed approach makes multiple heterogeneous Markov chains by adjusting random-walk traits on the given network to explore the target space efficiently. This approach provides better unbiased sampling results with reduced asymptotic variance within reasonable execution time than previous random-walk-based sampling approaches. We perform various experiments on large networks databases obtained from synthesis to real–world applications. The results demonstrate that the proposed method outperforms existing network sampling methods.


2021 ◽  
Vol 7 (1) ◽  
pp. 205630512098447
Author(s):  
Felix Victor Münch ◽  
Ben Thies ◽  
Cornelius Puschmann ◽  
Axel Bruns

Twitter continuously tightens the access to its data via the publicly accessible, cost-free standard APIs. This especially applies to the follow network. In light of this, we successfully modified a network sampling method to work efficiently with the Twitter standard API in order to retrieve the most central and influential accounts of a language-based Twitter follow network: the German Twittersphere. We provide evidence that the method is able to approximate a set of the top 1% to 10% of influential accounts in the German Twittersphere in terms of activity, follower numbers, coverage, and reach. Furthermore, we demonstrate the usefulness of these data by presenting the first overview of topical communities within the German Twittersphere and their network structure. The presented data mining method opens up further avenues of enquiry, such as the collection and comparison of language-based Twitterspheres other than the German one, its further development for the collection of follow networks around certain topics or accounts of interest, and its application to other online social networks and platforms in conjunction with concepts such as agenda setting and opinion leadership.


Sign in / Sign up

Export Citation Format

Share Document