Dimensionality reduction approach for high dimensional text documents

AbstractEmerging single-cell technologies profile multiple types of molecules within individual cells. A fundamental step in the analysis of the produced high-dimensional data is their visualization using dimensionality reduction techniques such as t-SNE and UMAP. We introduce j-SNE and j-UMAP as their natural generalizations to the joint visualization of multimodal omics data. Our approach automatically learns the relative contribution of each modality to a concise representation of cellular identity that promotes discriminative features but suppresses noise. On eight datasets, j-SNE and j-UMAP produce unified embeddings that better agree with known cell types and that harmonize RNA and protein velocity landscapes.

Download Full-text

PCA-KL: a parametric dimensionality reduction approach for unsupervised metric learning

Advances in Data Analysis and Classification ◽

10.1007/s11634-020-00434-3 ◽

2021 ◽

Author(s):

Alexandre L. M. Levada

Keyword(s):

Dimensionality Reduction ◽

Metric Learning ◽

Reduction Approach

Download Full-text

An adaptive and efficient dimensionality reduction algorithm for high-dimensional indexing

Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405) ◽

10.1109/icde.2003.1260784 ◽

2004 ◽

Cited By ~ 22

Author(s):

H. Jin ◽

B.C. Ooi ◽

H.T. Shen ◽

C. Yu ◽

Ao Ying Zhou

Keyword(s):

Dimensionality Reduction ◽

High Dimensional ◽

Reduction Algorithm ◽

High Dimensional Indexing

Download Full-text

Parallel Framework for Dimensionality Reduction of Large-Scale Datasets

Scientific Programming ◽

10.1155/2015/180214 ◽

2015 ◽

Vol 2015 ◽

pp. 1-12 ◽

Cited By ~ 3

Author(s):

Sai Kiranmayee Samudrala ◽

Jaroslaw Zola ◽

Srinivas Aluru ◽

Baskar Ganapathysubramanian

Keyword(s):

Dimensionality Reduction ◽

Organic Solar Cells ◽

Large Scale ◽

Parallel Implementation ◽

High Dimensional Data ◽

Real Life ◽

Processing Parameters ◽

High Dimensional ◽

Morphology Evolution ◽

Reduction Techniques

Dimensionality reduction refers to a set of mathematical techniques used to reduce complexity of the original high-dimensional data, while preserving its selected properties. Improvements in simulation strategies and experimental data collection methods are resulting in a deluge of heterogeneous and high-dimensional data, which often makes dimensionality reduction the only viable way to gain qualitative and quantitative understanding of the data. However, existing dimensionality reduction software often does not scale to datasets arising in real-life applications, which may consist of thousands of points with millions of dimensions. In this paper, we propose a parallel framework for dimensionality reduction of large-scale data. We identify key components underlying the spectral dimensionality reduction techniques, and propose their efficient parallel implementation. We show that the resulting framework can be used to process datasets consisting of millions of points when executed on a 16,000-core cluster, which is beyond the reach of currently available methods. To further demonstrate applicability of our framework we perform dimensionality reduction of 75,000 images representing morphology evolution during manufacturing of organic solar cells in order to identify how processing parameters affect morphology evolution.

Download Full-text

Spectrus: A Dimensionality Reduction Approach for Identifying Dynamical Domains in Protein Complexes from Limited Structural Datasets

Biophysical Journal ◽

10.1016/j.bpj.2015.11.355 ◽

2016 ◽

Vol 110 (3) ◽

pp. 54a

Author(s):

Luca Ponzoni ◽

Guido Polles ◽

Vincenzo Carnevale ◽

Cristian Micheletti

Keyword(s):

Dimensionality Reduction ◽

Protein Complexes ◽

Reduction Approach

Download Full-text