Experimental evaluation of latent variable models for dimensionality reduction

Summary The spread of drug resistance amongst clinically-important bacteria is a serious, and growing, problem [1]. However, the analysis of entire genomes requires considerable computational effort, usually including the assembly of the genome and subsequent identification of genes known to be important in pathology. An alternative approach is to use computational algorithms to identify genomic differences between pathogenic and non-pathogenic bacteria, even without knowing the biological meaning of those differences. To overcome this problem, a range of techniques for dimensionality reduction have been developed. One such approach is known as latent-variable models [2]. In latent-variable models dimensionality reduction is achieved by representing a high-dimensional data by a few hidden or latent variables, which are not directly observed but inferred from the observed variables present in the model. Probabilistic Latent Semantic Indexing (PLSA) is an extention of LSA [3]. PLSA is based on a mixture decomposition derived from a latent class model. The main objective of the algorithm, as in LSA, is to represent high-dimensional co-occurrence information in a lower-dimensional way in order to discover the hidden semantic structure of the data using a probabilistic framework. In this work we applied the PLSA approach to analyse the common genomic features in methicillin resistant Staphylococcus aureus, using tokens derived from amino acid sequences rather than DNA. We characterised genome-scale amino acid sequences in terms of their components, and then investigated the relationships between genomes and tokens and the phenotypes they generated. As a control we used the non-pathogenic model Gram-positive bacterium Bacillus subtilis.

Download Full-text

Dimensionality reduction of electropalatographic data using latent variable models

Speech Communication ◽

10.1016/s0167-6393(98)00059-4 ◽

1998 ◽

Vol 26 (4) ◽

pp. 259-282 ◽

Cited By ~ 9

Author(s):

Miguel Á. Carreira-Perpiñán ◽

Steve Renals

Keyword(s):

Dimensionality Reduction ◽

Latent Variable ◽

Latent Variable Models

Download Full-text

Thin Plate Spline Latent Variable Models for dimensionality reduction

The 2012 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2012.6252514 ◽

2012 ◽

Author(s):

Xinwei Jiang ◽

Junbin Gao ◽

Daming Shi ◽

Tianjiang Wang

Keyword(s):

Dimensionality Reduction ◽

Thin Plate ◽

Latent Variable ◽

Latent Variable Models ◽

Thin Plate Spline

Download Full-text

Supplemental Material for Information-Theoretic Latent Distribution Modeling: Distinguishing Discrete and Continuous Latent Variable Models

Psychological Methods ◽

10.1037/1082-989x.11.3.228.supp ◽

2006 ◽

Keyword(s):

Latent Variable ◽

Latent Variable Models ◽

Information Theoretic ◽

Distribution Modeling ◽

Latent Distribution

Download Full-text

Right-Wing Authoritarians Aren't Very Funny: RWA, Personality, and Creative Humor Production

10.31234/osf.io/rwpgn ◽

2020 ◽

Author(s):

Paul Silvia ◽

Alexander P. Christensen ◽

Katherine N. Cotter

Keyword(s):

Young Adults ◽

Latent Variable ◽

Latent Variable Models ◽

Openness To Experience ◽

Rasch Models ◽

Right Wing ◽

Right Wing Authoritarianism ◽

Humor Appreciation ◽

Negative Effect

Right-wing authoritarianism (RWA) has well-known links with humor appreciation, such as enjoying jokes that target deviant groups, but less is known about RWA and creative humor production—coming up with funny ideas oneself. A sample of 186 young adults completed a measure of RWA, the HEXACO-100, and 3 humor production tasks that involved writing funny cartoon captions, creating humorous definitions for quirky concepts, and completing joke stems with punchlines. The humor responses were scored by 8 raters and analyzed with many-facet Rasch models. Latent variable models found that RWA had a large, significant effect on humor production (β = -.47 [-.65, -.30], p < .001): responses created by people high in RWA were rated as much less funny. RWA’s negative effect on humor was smaller but still significant (β = -.25 [-.49, -.01], p = .044) after controlling for Openness to Experience (β = .39 [.20, .59], p < .001) and Conscientiousness (β = -.21 [-.41, -.02], p = .029). Taken together, the findings suggest that people high in RWA just aren’t very funny.

Download Full-text

Looking behind the choice of organic: A cross-country analysis applying Integrated Choice and Latent Variable Models

Appetite ◽

10.1016/j.appet.2021.105591 ◽

2021 ◽

pp. 105591

Author(s):

Ching-Hua Yeh ◽

Monika Hartmann ◽

Matthew Gorton ◽

Barbara Tocco ◽

Virginie Amilien ◽

...

Keyword(s):

Latent Variable ◽

Latent Variable Models ◽

Country Analysis ◽

Cross Country

Download Full-text

Evaluating the Viability of Neurocognition as a Transdiagnostic Construct Using Both Latent Variable Models and Network Analysis

Research on Child and Adolescent Psychopathology ◽

10.1007/s10802-021-00770-8 ◽

2021 ◽

Author(s):

Hana-May Eadeh ◽

Kristian E. Markon ◽

Joel T. Nigg ◽

Molly A. Nikolas

Keyword(s):

Network Analysis ◽

Latent Variable ◽

Latent Variable Models

Download Full-text

Efficient Dimensionality Reduction Methods in Reservoir History Matching

Energies ◽

10.3390/en14113137 ◽

2021 ◽

Vol 14 (11) ◽

pp. 3137

Author(s):

Amine Tadjer ◽

Reider B. Bratvold ◽

Remus G. Hanea

Keyword(s):

Data Assimilation ◽

Dimensionality Reduction ◽

Gaussian Process ◽

Latent Variable ◽

History Matching ◽

Production Performance ◽

Latent Variable Model ◽

Variable Model ◽

Multiple Data ◽

Ensemble Smoother

Production forecasting is the basis for decision making in the oil and gas industry, and can be quite challenging, especially in terms of complex geological modeling of the subsurface. To help solve this problem, assisted history matching built on ensemble-based analysis such as the ensemble smoother and ensemble Kalman filter is useful in estimating models that preserve geological realism and have predictive capabilities. These methods tend, however, to be computationally demanding, as they require a large ensemble size for stable convergence. In this paper, we propose a novel method of uncertainty quantification and reservoir model calibration with much-reduced computation time. This approach is based on a sequential combination of nonlinear dimensionality reduction techniques: t-distributed stochastic neighbor embedding or the Gaussian process latent variable model and clustering K-means, along with the data assimilation method ensemble smoother with multiple data assimilation. The cluster analysis with t-distributed stochastic neighbor embedding and Gaussian process latent variable model is used to reduce the number of initial geostatistical realizations and select a set of optimal reservoir models that have similar production performance to the reference model. We then apply ensemble smoother with multiple data assimilation for providing reliable assimilation results. Experimental results based on the Brugge field case data verify the efficiency of the proposed approach.

Download Full-text

Gaussian Processes Proxy Model with Latent Variable Models and Variogram-Based Sensitivity Analysis for Assisted History Matching

Energies ◽

10.3390/en13174290 ◽

2020 ◽

Vol 13 (17) ◽

pp. 4290

Author(s):

Dongmei Zhang ◽

Yuyang Zhang ◽

Bohou Jiang ◽

Xinwei Jiang ◽

Zhijiang Kang

Keyword(s):

Sensitivity Analysis ◽

Gaussian Processes ◽

Latent Variable ◽

History Matching ◽

Latent Variable Models ◽

High Dimensional ◽

Model Parameters ◽

Variable Model ◽

Assisted History Matching ◽

Proxy Models

Reservoir history matching is a well-known inverse problem for production prediction where enormous uncertain reservoir parameters of a reservoir numerical model are optimized by minimizing the misfit between the simulated and history production data. Gaussian Process (GP) has shown promising performance for assisted history matching due to the efficient nonparametric and nonlinear model with few model parameters to be tuned automatically. Recently introduced Gaussian Processes proxy models and Variogram Analysis of Response Surface-based sensitivity analysis (GP-VARS) uses forward and inverse Gaussian Processes (GP) based proxy models with the VARS-based sensitivity analysis to optimize the high-dimensional reservoir parameters. However, the inverse GP solution (GPIS) in GP-VARS are unsatisfactory especially for enormous reservoir parameters where the mapping from low-dimensional misfits to high-dimensional uncertain reservoir parameters could be poorly modeled by GP. To improve the performance of GP-VARS, in this paper we propose the Gaussian Processes proxy models with Latent Variable Models and VARS-based sensitivity analysis (GPLVM-VARS) where Gaussian Processes Latent Variable Model (GPLVM)-based inverse solution (GPLVMIS) instead of GP-based GPIS is provided with the inputs and outputs of GPIS reversed. The experimental results demonstrate the effectiveness of the proposed GPLVM-VARS in terms of accuracy and complexity. The source code of the proposed GPLVM-VARS is available at https://github.com/XinweiJiang/GPLVM-VARS.

Download Full-text