dimension reduction Latest Research Papers

Robust Inference of Conditional Average Treatment Effects Using Dimension Reduction

Statistica Sinica ◽

10.5705/ss.202020.0409 ◽

2023 ◽

Author(s):

Ming-Yueh Huang ◽

Shu Yang

Keyword(s):

Dimension Reduction ◽

Treatment Effects ◽

Robust Inference ◽

Conditional Average ◽

Average Treatment ◽

Average Treatment Effects

A Minimum Discrepancy Approach With Fourier Transform in Sufficient Dimension Reduction

Statistica Sinica ◽

10.5705/ss.202020.0312 ◽

2023 ◽

Author(s):

Jiaying Weng ◽

Xiangrong Yin

Keyword(s):

Fourier Transform ◽

Dimension Reduction ◽

Sufficient Dimension Reduction ◽

Minimum Discrepancy

Distributed Sufficient Dimension Reduction for Heterogeneous Massive Data

Statistica Sinica ◽

10.5705/ss.202021.0031 ◽

2023 ◽

Author(s):

Kelin Xu ◽

Liping Zhu ◽

Jianqing Fan

Keyword(s):

Dimension Reduction ◽

Massive Data ◽

Sufficient Dimension Reduction

Dictionary learning allows model-free pseudotime estimation of transcriptomic data

BMC Genomics ◽

10.1186/s12864-021-08276-9 ◽

2022 ◽

Vol 23 (1) ◽

Author(s):

Mona Rams ◽

Tim O.F. Conrad

Keyword(s):

Dimension Reduction ◽

Dictionary Learning ◽

Real World ◽

Estimation Methods ◽

Dynamic Processes ◽

Dimensional Representation ◽

Transcriptomic Data ◽

Model Free ◽

Real World Datasets ◽

Low Dimensional

Abstract Background Pseudotime estimation from dynamic single-cell transcriptomic data enables characterisation and understanding of the underlying processes, for example developmental processes. Various pseudotime estimation methods have been proposed during the last years. Typically, these methods start with a dimension reduction step because the low-dimensional representation is usually easier to analyse. Approaches such as PCA, ICA or t-SNE belong to the most widely used methods for dimension reduction in pseudotime estimation methods. However, these methods usually make assumptions on the derived dimensions, which can result in important dataset properties being missed. In this paper, we suggest a new dictionary learning based approach, dynDLT, for dimension reduction and pseudotime estimation of dynamic transcriptomic data. Dictionary learning is a matrix factorisation approach that does not restrict the dependence of the derived dimensions. To evaluate the performance, we conduct a large simulation study and analyse 8 real-world datasets. Results The simulation studies reveal that firstly, dynDLT preserves the simulated patterns in low-dimension and the pseudotimes can be derived from the low-dimensional representation. Secondly, the results show that dynDLT is suitable for the detection of genes exhibiting the simulated dynamic patterns, thereby facilitating the interpretation of the compressed representation and thus the dynamic processes. For the real-world data analysis, we select datasets with samples that are taken at different time points throughout an experiment. The pseudotimes found by dynDLT have high correlations with the experimental times. We compare the results to other approaches used in pseudotime estimation, or those that are method-wise closely connected to dictionary learning: ICA, NMF, PCA, t-SNE, and UMAP. DynDLT has the best overall performance for the simulated and real-world datasets. Conclusions We introduce dynDLT, a method that is suitable for pseudotime estimation. Its main advantages are: (1) It presents a model-free approach, meaning that it does not restrict the dependence of the derived dimensions; (2) Genes that are relevant in the detected dynamic processes can be identified from the dictionary matrix; (3) By a restriction of the dictionary entries to positive values, the dictionary atoms are highly interpretable.

Model Averaging Based Sufficient Dimension Reduction

Stat ◽

10.1002/sta4.458 ◽

2022 ◽

Author(s):

Min Cai ◽

Ruige Zhuang ◽

Zhou Yu ◽

Ping Wu

Keyword(s):

Dimension Reduction ◽

Model Averaging ◽

Sufficient Dimension Reduction

Explainable t-SNE for single-cell RNA-seq data analysis

10.1101/2022.01.12.476084 ◽

2022 ◽

Author(s):

Henry Han ◽

Tianyu Zhang ◽

Mary Lauren Benton ◽

Chun Li ◽

Juan Wang ◽

...

Keyword(s):

Gene Expression ◽

Machine Learning ◽

Data Analysis ◽

Dimension Reduction ◽

Single Cell ◽

Method Development ◽

Robustness Analysis ◽

High Dimensional ◽

Machine Learning Method ◽

Learning Method

Single-cell RNA (scRNA-seq) sequencing technologies trigger the study of individual cell gene expression and reveal the diversity within cell populations. To measure cell-to-cell similarity based on their transcription and gene expression, many dimension reduction methods are employed to retrieve the corresponding low-dimensional embeddings of input scRNA-seq data to conduct clustering. However, the methods lack explainability and may not perform well with scRNA-seq data because they are often migrated from other fields and not customized for high-dimensional sparse scRNA-seq data. In this study, we propose an explainable t-SNE: cell-driven t-SNE (c-TSNE) that fuses the cell differences reflected from biologically meaningful distance metrics for input scRNA-seq data. Our study shows that the proposed method not only enhances the interpretation of the original t-SNE visualization for scRNA-seq data but also demonstrates favorable single cell segregation performance on benchmark datasets compared to the state-of-the-art peers. The robustness analysis shows that the proposed cell-driven t-SNE demonstrates robustness to dropout and noise in dimension reduction and clustering. It provides a novel and practical way to investigate the interpretability of t-SNE in scRNA-seq data analysis. Unlike the general assumption that the explainanbility of a machine learning method needs to compromise with the learning efficiency, the proposed explainable t-SNE improves both clustering efficiency and explainanbility in scRNA-seq analysis. More importantly, our work suggests that widely used t-SNE can be easily misused in the existing scRNA-seq analysis, because its default Euclidean distance can bring biases or meaningless results in cell difference evaluation for high-dimensional sparse scRNA-seq data. To the best of our knowledge, it is the first explainable t-SNE proposed in scRNA-seq analysis and will inspire other explainable machine learning method development in the field.

Sufficient dimension reduction in the presence of controlling variables

Science China Mathematics ◽

10.1007/s11425-020-1824-8 ◽

2022 ◽

Author(s):

Guoliang Fan ◽

Liping Zhu

Keyword(s):

Dimension Reduction ◽

Sufficient Dimension Reduction ◽

Controlling Variables

An Anomaly-Based IDS Framework Using Centroid-Based Classification

Symmetry ◽

10.3390/sym14010105 ◽

2022 ◽

Vol 14 (1) ◽

pp. 105

Author(s):

Iuon-Chang Lin ◽

Ching-Chun Chang ◽

Chih-Hsiang Peng

Keyword(s):

Information Systems ◽

False Alarm ◽

Dimension Reduction ◽

False Alarm Rate ◽

Detection Rate ◽

The Internet ◽

Urgent Problem ◽

Accuracy Rate

Botnet is an urgent problem that will reduce the security and availability of the network. When the bot master launches attacks to certain victims, the infected users are awakened, and attacks start according to the commands from the bot master. Via Botnet, DDoS is an attack whose purpose is to paralyze the victim’s service. In all kinds of DDoS, SYN flood is still a problem that reduces security and availability. To enhance the security of the Internet, IDS is proposed to detect attacks and protect the server. In this paper, the concept of centroid-based classification is used to enhance performance of the framework. An anomaly-based IDS framework which combines K-means and KNN is proposed to detect SYN flood. Dimension reduction is designed to achieve visualization, and weights can adjust the occupancy ratio of each sub-feature. Therefore, this framework is also suitable for use on the modern symmetry or asymmetry architecture of information systems. With the detection by the framework proposed in this paper, the detection rate is 96.8 percent, the accuracy rate is 97.3 percent, and the false alarm rate is 1.37 percent.

Programming with equadratures: an open-source package for uncertainty quantification, dimension reduction, and optimisation

10.2514/6.2022-2108 ◽

2022 ◽

Author(s):

Ashley D. Scillitoe ◽

Chun Yui Wong ◽

James C. Gross ◽

Irene Virdis ◽

Bryn N. Ubald ◽

...

Keyword(s):

Dimension Reduction ◽

Uncertainty Quantification ◽

Open Source

Regularizing invertible neural networks for airfoil design through dimension reduction

10.2514/6.2022-1098 ◽

2022 ◽

Author(s):

Andrew Glaws ◽

Jeffrey Hokanson ◽

Ryan King ◽

Ganesh Vijayakumar

Keyword(s):

Neural Networks ◽

Dimension Reduction

dimension reduction
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Robust Inference of Conditional Average Treatment Effects Using Dimension Reduction

A Minimum Discrepancy Approach With Fourier Transform in Sufficient Dimension Reduction

Distributed Sufficient Dimension Reduction for Heterogeneous Massive Data

Dictionary learning allows model-free pseudotime estimation of transcriptomic data

Model Averaging Based Sufficient Dimension Reduction

Explainable t-SNE for single-cell RNA-seq data analysis

Sufficient dimension reduction in the presence of controlling variables

An Anomaly-Based IDS Framework Using Centroid-Based Classification

Programming with equadratures: an open-source package for uncertainty quantification, dimension reduction, and optimisation

Regularizing invertible neural networks for airfoil design through dimension reduction

Export Citation Format

dimension reductionRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Robust Inference of Conditional Average Treatment Effects Using Dimension Reduction

A Minimum Discrepancy Approach With Fourier Transform in Sufficient Dimension Reduction

Distributed Sufficient Dimension Reduction for Heterogeneous Massive Data

Dictionary learning allows model-free pseudotime estimation of transcriptomic data

Model Averaging Based Sufficient Dimension Reduction

Explainable t-SNE for single-cell RNA-seq data analysis

Sufficient dimension reduction in the presence of controlling variables

An Anomaly-Based IDS Framework Using Centroid-Based Classification

Programming with equadratures: an open-source package for uncertainty quantification, dimension reduction, and optimisation

Regularizing invertible neural networks for airfoil design through dimension reduction

dimension reduction
Recently Published Documents