Robust Bilinear Probabilistic Principal Component Analysis

Principal component analysis (PCA) is one of the most popular tools in multivariate exploratory data analysis. Its probabilistic version (PPCA) based on the maximum likelihood procedure provides a probabilistic manner to implement dimension reduction. Recently, the bilinear PPCA (BPPCA) model, which assumes that the noise terms follow matrix variate Gaussian distributions, has been introduced to directly deal with two-dimensional (2-D) data for preserving the matrix structure of 2-D data, such as images, and avoiding the curse of dimensionality. However, Gaussian distributions are not always available in real-life applications which may contain outliers within data sets. In order to make BPPCA robust for outliers, in this paper, we propose a robust BPPCA model under the assumption of matrix variate t distributions for the noise terms. The alternating expectation conditional maximization (AECM) algorithm is used to estimate the model parameters. Numerical examples on several synthetic and publicly available data sets are presented to demonstrate the superiority of our proposed model in feature extraction, classification and outlier detection.

Download Full-text

Image Quality Measurement by Probabilistic Principal Component Analysis

2020 6th International Symposium on System and Software Reliability (ISSSR) ◽

10.1109/isssr51244.2020.00031 ◽

2020 ◽

Author(s):

Hua-Wen Chang ◽

Kai Chen ◽

Xiao-Dong Bi ◽

Ming-Hui Wang

Keyword(s):

Principal Component Analysis ◽

Image Quality ◽

Quality Measurement ◽

Principal Component ◽

Component Analysis ◽

Probabilistic Principal Component Analysis

Download Full-text

Learning robust manipulation tasks involving contact using trajectory parameterized probabilistic principal component analysis

2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros45743.2020.9364328 ◽

2020 ◽

Author(s):

Cristian Vergara Perico ◽

Joris de Schutter ◽

Erwin Aertbelien

Keyword(s):

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

Probabilistic Principal Component Analysis

Download Full-text

Probabilistic principal component analysis with expectation maximization (PPCA-EM) facilitates volume classification and estimates the missing data

Journal of Structural Biology ◽

10.1016/j.jsb.2010.04.002 ◽

2010 ◽

Vol 171 (1) ◽

pp. 18-30 ◽

Cited By ~ 34

Author(s):

Lingbo Yu ◽

Robert R. Snapp ◽

Teresa Ruiz ◽

Michael Radermacher

Keyword(s):

Principal Component Analysis ◽

Missing Data ◽

Expectation Maximization ◽

Principal Component ◽

Component Analysis ◽

Probabilistic Principal Component Analysis

Download Full-text

3D face dense reconstruction based on sparse points using probabilistic principal component analysis

Multimedia Tools and Applications ◽

10.1007/s11042-021-11707-0 ◽

2021 ◽

Author(s):

Xiaoxiao Xie ◽

Xingce Wang ◽

Zhongke Wu

Keyword(s):

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

3D Face ◽

Dense Reconstruction ◽

Probabilistic Principal Component Analysis

Download Full-text

Dimensionality and Its Reduction

Statistics, Data Mining, and Machine Learning in Astronomy ◽

10.23943/princeton/9780691151687.003.0007 ◽

2014 ◽

Author(s):

Andrew J. Connolly ◽

Jacob T. VanderPlas ◽

Alexander Gray ◽

Andrew J. Connolly ◽

Jacob T. VanderPlas ◽

...

Keyword(s):

Principal Component Analysis ◽

Principal Component ◽

Reduction Technique ◽

High Dimensional ◽

Data Sets ◽

Data Set ◽

Gaussian Distributions ◽

Dimensionality Reduction Technique ◽

Alternative Techniques ◽

New Generation

With the dramatic increase in data available from a new generation of astronomical telescopes and instruments, many analyses must address the question of the complexity as well as size of the data set. This chapter deals with how we can learn which measurements, properties, or combinations thereof carry the most information within a data set. It describes techniques that are related to concepts discussed when describing Gaussian distributions, density estimation, and the concepts of information content. The chapter begins with an exploration of the problems posed by high-dimensional data. It then describes the data sets used in this chapter, and introduces perhaps the most important and widely used dimensionality reduction technique, principal component analysis (PCA). The remainder of the chapter discusses several alternative techniques which address some of the weaknesses of PCA.

Download Full-text

Principal Component Analysis of Hydrological Data

Handbook of Research on Hydroinformatics ◽

10.4018/978-1-61520-907-1.ch018 ◽

2010 ◽

pp. 364-388

Author(s):

Petr Praus

Keyword(s):

Water Quality ◽

Principal Component Analysis ◽

Drinking Water ◽

Ground Water ◽

Principal Component ◽

Component Analysis ◽

Data Sets ◽

Hydrological Data ◽

First Case

In this chapter the principals and applications of principal component analysis (PCA) applied on hydrological data are presented. Four case studies showed the possibility of PCA to obtain information about wastewater treatment process, drinking water quality in a city network and to find similarities in the data sets of ground water quality results and water-related images. In the first case study, the composition of raw and cleaned wastewater was characterised and its temporal changes were displayed. In the second case study, drinking water samples were divided into clusters in consistency with their sampling localities. In the case study III, the similar samples of ground water were recognised by the calculation of cosine similarity, the Euclidean and Manhattan distances. In the case study IV, 32 water-related images were transformed into a large image matrix whose dimensionality was reduced by PCA. The images were clustered using the PCA scatter plots.

Download Full-text

Soft sensor design and fault detection using Bayesian network and probabilistic principal component analysis

Journal of Advanced Manufacturing and Processing ◽

10.1002/amp2.10027 ◽

2019 ◽

Vol 1 (4) ◽

Cited By ~ 2

Author(s):

Ahad Mohammadi ◽

Reza Zarghami ◽

Dimitri Lefebvre ◽

Shahab Golshan ◽

Navid Mostoufi

Keyword(s):

Principal Component Analysis ◽

Fault Detection ◽

Bayesian Network ◽

Principal Component ◽

Component Analysis ◽

Soft Sensor ◽

Sensor Design ◽

Probabilistic Principal Component Analysis

Download Full-text

Appearance Based Generic Object Modeling and Recognition Using Probabilistic Principal Component Analysis

Lecture Notes in Computer Science - Pattern Recognition ◽

10.1007/3-540-45783-6_13 ◽

2002 ◽

pp. 100-108 ◽

Cited By ~ 1

Author(s):

Christopher Drexler ◽

Frank Mattern ◽

Joachim Denzler

Keyword(s):

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

Object Modeling ◽

Probabilistic Principal Component Analysis

Download Full-text

Differentially Expressed Genes Extracted by the Tensor Robust Principal Component Analysis (TRPCA) Method

Complexity ◽

10.1155/2019/6136245 ◽

2019 ◽

Vol 2019 ◽

pp. 1-13 ◽

Cited By ~ 1

Author(s):

Yue Hu ◽

Jin-Xing Liu ◽

Ying-Lian Gao ◽

Sheng-Jun Li ◽

Juan Wang

Keyword(s):

Principal Component Analysis ◽

Differentially Expressed Genes ◽

Principal Component ◽

Component Analysis ◽

Differentially Expressed ◽

Low Rank ◽

Cancer Gene ◽

Sequencing Data ◽

Robust Principal Component Analysis ◽

The Matrix

In the big data era, sequencing technology has produced a large number of biological sequencing data. Different views of the cancer genome data provide sufficient complementary information to explore genetic activity. The identification of differentially expressed genes from multiview cancer gene data is of great importance in cancer diagnosis and treatment. In this paper, we propose a novel method for identifying differentially expressed genes based on tensor robust principal component analysis (TRPCA), which extends the matrix method to the processing of multiway data. To identify differentially expressed genes, the plan is carried out as follows. First, multiview data containing cancer gene expression data from different sources are prepared. Second, the original tensor is decomposed into a sum of a low-rank tensor and a sparse tensor using TRPCA. Third, the differentially expressed genes are considered to be sparse perturbed signals and then identified based on the sparse tensor. Fourth, the differentially expressed genes are evaluated using Gene Ontology and Gene Cards tools. The validity of the TRPCA method was tested using two sets of multiview data. The experimental results showed that our method is superior to the representative methods in efficiency and accuracy aspects.

Download Full-text

A Study of Effectiveness of Principal Component Analysis on Different Data Sets

2017 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC) ◽

10.1109/iccic.2017.8524329 ◽

2017 ◽

Author(s):

Mukti Krishnan ◽

Dipankar Dutta

Keyword(s):

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

Data Sets

Download Full-text