scholarly journals Mutual Information for the Stochastic Block Model by the Adaptive Interpolation Method

Author(s):  
Jean Barbier ◽  
Chun Lam Chan ◽  
Nicolas Macris
Author(s):  
Clément Luneau ◽  
Jean Barbier ◽  
Nicolas Macris

Abstract We consider a statistical model for finite-rank symmetric tensor factorization and prove a single-letter variational expression for its asymptotic mutual information when the tensor is of even order. The proof applies the adaptive interpolation method originally invented for rank-one factorization. Here we show how to extend the adaptive interpolation to finite-rank and even-order tensors. This requires new non-trivial ideas with respect to the current analysis in the literature. We also underline where the proof falls short when dealing with odd-order tensors.


2014 ◽  
Vol 24 (11) ◽  
pp. 2699-2709 ◽  
Author(s):  
Bian-Fang CHAI ◽  
Jian YU ◽  
Cai-Yan JIA ◽  
Jing-Hong WANG

2020 ◽  
Vol 34 (04) ◽  
pp. 3641-3648 ◽  
Author(s):  
Eli Chien ◽  
Antonia Tulino ◽  
Jaime Llorca

The geometric block model is a recently proposed generative model for random graphs that is able to capture the inherent geometric properties of many community detection problems, providing more accurate characterizations of practical community structures compared with the popular stochastic block model. Galhotra et al. recently proposed a motif-counting algorithm for unsupervised community detection in the geometric block model that is proved to be near-optimal. They also characterized the regimes of the model parameters for which the proposed algorithm can achieve exact recovery. In this work, we initiate the study of active learning in the geometric block model. That is, we are interested in the problem of exactly recovering the community structure of random graphs following the geometric block model under arbitrary model parameters, by possibly querying the labels of a limited number of chosen nodes. We propose two active learning algorithms that combine the use of motif-counting with two different label query policies. Our main contribution is to show that sampling the labels of a vanishingly small fraction of nodes (sub-linear in the total number of nodes) is sufficient to achieve exact recovery in the regimes under which the state-of-the-art unsupervised method fails. We validate the superior performance of our algorithms via numerical simulations on both real and synthetic datasets.


Sign in / Sign up

Export Citation Format

Share Document