mixed membership models
Recently Published Documents


TOTAL DOCUMENTS

29
(FIVE YEARS 2)

H-INDEX

6
(FIVE YEARS 0)

Symmetry ◽  
2021 ◽  
Vol 13 (7) ◽  
pp. 1167
Author(s):  
Barry M. Dillon ◽  
Darius A. Faroughy ◽  
Jernej F. Kamenik ◽  
Manuel Szewc

We summarize our recent work on how to infer on jet formation processes directly from substructure data using generative statistical models. We recount in detail how to cast jet substructure observables’ measurements in terms of Bayesian mixed membership models, in particular Latent Dirichlet Allocation. Using a mixed sample of QCD and boosted tt¯ jet events and focusing on the primary Lund plane observable basis for event measurements, we show how using educated priors on the latent distributions allows to infer on the underlying physical processes in a semi-supervised way.


Stat ◽  
2016 ◽  
Vol 5 (1) ◽  
pp. 57-69 ◽  
Author(s):  
Seppo Virtanen ◽  
Mattias Rost ◽  
Alistair Morrison ◽  
Matthew Chalmers ◽  
Mark Girolami

2015 ◽  
Author(s):  
Aaron A. Behr ◽  
Katherine Z. Liu ◽  
Gracie Liu-Fang ◽  
Priyanka Nakka ◽  
Sohini Ramachandran

Abstract1MotivationA series of methods in population genetics use multilocus genotype data to assign individuals membership in latent clusters. These methods belong to a broad class of mixed-membership models, such as latent Dirichlet allocation used to analyze text corpora. Inference from mixed-membership models can produce different output matrices when repeatedly applied to the same inputs, and the number of latent clusters is a parameter that is often varied in the analysis pipeline. For these reasons, quantifying, visualizing, and annotating the output from mixed-membership models are bottlenecks for investigators across multiple disciplines from ecology to text data mining.2ResultsWe introduce pong, a network-graphical approach for analyzing and visualizing membership in latent clusters with a native D3.js interactive visualization. pong leverages efficient algorithms for solving the Assignment Problem to dramatically reduce runtime while increasing accuracy compared to other methods that process output from mixed-membership models. We apply pong to 225,705 unlinked genome-wide single-nucleotide variants from 2,426 unrelated individuals in the 1000 Genomes Project, and identify previously overlooked aspects of global human population structure. We show that pong outpaces current solutions by more than an order of magnitude in runtime while providing a customizable and interactive visualization of population structure that is more accurate than those produced by current tools.3Availabilitypong is freely available and can be installed using the Python package management system pip. pong’s source code is available at https://github.com/abehr/[email protected],[email protected]


Sign in / Sign up

Export Citation Format

Share Document