Ensemble Feature Selection for Single Cell Chromatin Conformation Analysis

2021 ◽  
Author(s):  
Amirreza Rouhi ◽  
Luca Nanni ◽  
Arif Canakoglu ◽  
Pietro Pinoli ◽  
Stefano Ceri
2018 ◽  
Vol 35 (16) ◽  
pp. 2865-2867 ◽  
Author(s):  
Tallulah S Andrews ◽  
Martin Hemberg

Abstract Motivation Most genomes contain thousands of genes, but for most functional responses, only a subset of those genes are relevant. To facilitate many single-cell RNASeq (scRNASeq) analyses the set of genes is often reduced through feature selection, i.e. by removing genes only subject to technical noise. Results We present M3Drop, an R package that implements popular existing feature selection methods and two novel methods which take advantage of the prevalence of zeros (dropouts) in scRNASeq data to identify features. We show these new methods outperform existing methods on simulated and real datasets. Availability and implementation M3Drop is freely available on github as an R package and is compatible with other popular scRNASeq tools: https://github.com/tallulandrews/M3Drop. Supplementary information Supplementary data are available at Bioinformatics online.


Genes ◽  
2020 ◽  
Vol 12 (1) ◽  
pp. 28
Author(s):  
Shruti Gupta ◽  
Ajay Kumar Verma ◽  
Shandar Ahmad

Single-cell transcriptomics data, when combined with in situ hybridization patterns of specific genes, can help in recovering the spatial information lost during cell isolation. Dialogue for Reverse Engineering Assessments and Methods (DREAM) consortium conducted a crowd-sourced competition known as DREAM Single Cell Transcriptomics Challenge (SCTC) to predict the masked locations of single cells from a set of 60, 40 and 20 genes out of 84 in situ gene patterns known in Drosophila embryo. We applied a genetic algorithm (GA) to predict the most important genes that carry positional and proximity information of the single-cell origins, in combination with the base distance mapping algorithm DistMap. Resulting gene selection was found to perform well and was ranked among top 10 in two of the three sub-challenges. However, the details of the method did not make it to the main challenge publication, due to an intricate aggregation ranking. In this work, we discuss the detailed implementation of GA and its post-challenge parameterization, with a view to identify potential areas where GA-based approaches of gene-set selection for topological association prediction may be improved, to be more effective. We believe this work provides additional insights into the feature-selection strategies and their relevance to single-cell similarity prediction and will form a strong addendum to the recently published work from the consortium.


Author(s):  
Shaoheng Liang ◽  
Vakul Mohanty ◽  
Jinzhuang Dou ◽  
Qi Miao ◽  
Yuefan Huang ◽  
...  

2016 ◽  
Author(s):  
Tallulah S. Andrews ◽  
Martin Hemberg

AbstractFeatures selection is a key step in many single-cell RNASeq (scRNASeq) analyses. Feature selection is intended to preserve biologically relevant information while removing genes only subject to technical noise. As it is frequently performed prior to dimensionality reduction, clustering and pseudotime analyses, feature selection can have a major impact on the results. Several different approaches have been proposed for unsupervised feature selection from unprocessed single-cell expression matrices, most based upon identifying highly variable genes in the dataset. We present two methods which take advantage of the prevalence of zeros (dropouts) in scRNASeq data to identify features. We show that dropout-based feature selection outperforms variance-based feature selection for multiple applications of single-cell RNASeq.


Author(s):  
Shaoheng Liang ◽  
Vakul Mohanty ◽  
Jinzhuang Dou ◽  
Qi Miao ◽  
Yuefan Huang ◽  
...  

2019 ◽  
pp. 389
Author(s):  
زينب عبدالأمير ◽  
علياء كريم عبدالحسن

Sign in / Sign up

Export Citation Format

Share Document