DigestiFlow - reproducible demultiplexing for the single cell era

Digestiflow: from BCL to FASTQ with ease

10.7287/peerj.preprints.27717v4 ◽

2019 ◽

Author(s):

Manuel Holtgrewe ◽

Mikko Nieminen ◽

Clemens Messerschmidt ◽

Dieter Beule

Keyword(s):

Quality Control ◽

Software Package ◽

Flow Cell ◽

Automated Extraction ◽

Sequencing Data ◽

Report Generation ◽

Raw Data ◽

Cell Sample ◽

Cell Data

Management raw sequencing data and its preprocessing (conversion into sequences and demultiplexing) remains a challenging topic for groups running sequencing devices. They face many challenges in such efforts and solutions ranging from manual management of spreadsheets to very complex and customized LIMS systems handling much more than just sequencing raw data. In this manuscript, we describe the software package DigestiFlow that focuses on the management of Illumina flow cell sample sheets and raw data. It allows for automated extraction of information from flow cell data and management of sample sheets. Furthermore, it allows for the automated and reproducible conversion of Illumina base calls to sequences and the demultiplexing thereof using bcl2fastq and Picard Tools, followed by quality control report generation.

Download Full-text

Digestiflow: from BCL to FASTQ with ease

Bioinformatics ◽

10.1093/bioinformatics/btz850 ◽

2019 ◽

Author(s):

Manuel Holtgrewe ◽

Clemens Messerschmidt ◽

Mikko Nieminen ◽

Dieter Beule

Keyword(s):

Software Package ◽

Flow Cell ◽

Supplementary Information ◽

Software Components ◽

Sequencing Data ◽

Report Generation ◽

Raw Data ◽

Client Software ◽

Cell Sample ◽

Cell Data

Abstract Summary Management raw sequencing data and its preprocessing (conversion into sequences and demultiplexing) remains a challenging topic for groups running sequencing devices. They face many challenges in such efforts and solutions ranging from manual management of spreadsheets to very complex and customized LIMS systems handling much more than just sequencing raw data. In this manuscript, we describe the software package DigestiFlow that focuses on the management of Illumina flow cell sample sheets and raw data. It allows for automated extraction of information from flow cell data and management of sample sheets. Furthermore, it allows for the automated and reproducible conversion of Illumina base calls to sequences and the demultiplexing thereof using bcl2fastq and Picard Tools, followed by quality control report generation. Availability and Implementation The software is available under the MIT license at https://github.com/bihealth/digestiflow-server. The client software components are available via Bioconda. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Digestiflow: from BCL to FASTQ with ease

10.7287/peerj.preprints.27717 ◽

2019 ◽

Author(s):

Manuel Holtgrewe ◽

Mikko Nieminen ◽

Clemens Messerschmidt ◽

Dieter Beule

Keyword(s):

Quality Control ◽

Software Package ◽

Flow Cell ◽

Automated Extraction ◽

Sequencing Data ◽

Report Generation ◽

Raw Data ◽

Cell Sample ◽

Cell Data

Management raw sequencing data and its preprocessing (conversion into sequences and demultiplexing) remains a challenging topic for groups running sequencing devices. They face many challenges in such efforts and solutions ranging from manual management of spreadsheets to very complex and customized LIMS systems handling much more than just sequencing raw data. In this manuscript, we describe the software package DigestiFlow that focuses on the management of Illumina flow cell sample sheets and raw data. It allows for automated extraction of information from flow cell data and management of sample sheets. Furthermore, it allows for the automated and reproducible conversion of Illumina base calls to sequences and the demultiplexing thereof using bcl2fastq and Picard Tools, followed by quality control report generation.

Download Full-text

Connectome: computation and visualization of cell-cell signaling topologies in single-cell systems data

10.1101/2021.01.21.427529 ◽

2021 ◽

Author(s):

Micha Sam Brickman Raredon ◽

Junchen Yang ◽

James Garritano ◽

Meng Wang ◽

Dan Kushnir ◽

...

Keyword(s):

Cell Signaling ◽

Single Cell ◽

Rna Sequencing ◽

Software Package ◽

Sequencing Data ◽

Single Cell Rna Sequencing ◽

Rapid Calculation ◽

Connectivity Patterns ◽

Network Topologies ◽

Cell Cell

AbstractSingle-cell RNA-sequencing data can revolutionize our understanding of the patterns of cell-cell and ligand-receptor connectivity that influence the function of tissues and organs. However, the quantification and visualization of these patterns are major computational and epistemological challenges. Here, we present Connectome, a software package for R which facilitates rapid calculation, and interactive exploration, of cell-cell signaling network topologies contained in single-cell RNA-sequencing data. Connectome can be used with any reference set of known ligand-receptor mechanisms. It has built-in functionality to facilitate differential and comparative connectomics, in which complete mechanistic networks are quantitatively compared between systems. Connectome includes computational and graphical tools designed to analyze and explore cell-cell connectivity patterns across disparate single-cell datasets. We present approaches to quantify these topologies and discuss some of the biologic theory leading to their design.

Download Full-text

Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors

Algorithms for Molecular Biology ◽

10.1186/s13015-021-00194-5 ◽

2021 ◽

Vol 16 (1) ◽

Author(s):

Leah L. Weber ◽

Mohammed El-Kebir

Keyword(s):

Dna Sequencing ◽

Single Cell ◽

Evolutionary Process ◽

Treatment Decision ◽

Real Data ◽

Current Data ◽

Fast Method ◽

Sequencing Data ◽

Evolutionary Trajectory ◽

Cancer Types

Abstract Background Cancer arises from an evolutionary process where somatic mutations give rise to clonal expansions. Reconstructing this evolutionary process is useful for treatment decision-making as well as understanding evolutionary patterns across patients and cancer types. In particular, classifying a tumor’s evolutionary process as either linear or branched and understanding what cancer types and which patients have each of these trajectories could provide useful insights for both clinicians and researchers. While comprehensive cancer phylogeny inference from single-cell DNA sequencing data is challenging due to limitations with current sequencing technology and the complexity of the resulting problem, current data might provide sufficient signal to accurately classify a tumor’s evolutionary history as either linear or branched. Results We introduce the Linear Perfect Phylogeny Flipping (LPPF) problem as a means of testing two alternative hypotheses for the pattern of evolution, which we prove to be NP-hard. We develop Phyolin, which uses constraint programming to solve the LPPF problem. Through both in silico experiments and real data application, we demonstrate the performance of our method, outperforming a competing machine learning approach. Conclusion Phyolin is an accurate, easy to use and fast method for classifying an evolutionary trajectory as linear or branched given a tumor’s single-cell DNA sequencing data.

Download Full-text

Mixed Distribution Models Based on Single-Cell RNA Sequencing Data

Interdisciplinary Sciences Computational Life Sciences ◽

10.1007/s12539-021-00427-6 ◽

2021 ◽

Author(s):

Min Wu ◽

Junhua Xu ◽

Tao Ding ◽

Jie Gao

Keyword(s):

Single Cell ◽

Rna Sequencing ◽

Sequencing Data ◽

Distribution Models ◽

Mixed Distribution ◽

Single Cell Rna Sequencing

Download Full-text

IMMU-27. SINGLE CELL RNA-SEQUENCING IDENTIFIES NOVEL BONE MARROW DERIVED MYELOID CELLS IN GLIOBLASTOMA ASSOCIATED WITH TUMOR AGGRESSION

Neuro-Oncology ◽

10.1093/neuonc/noaa215.457 ◽

2020 ◽

Vol 22 (Supplement_2) ◽

pp. ii110-ii110

Author(s):

Christina Jackson ◽

Christopher Cherry ◽

Sadhana Bom ◽

Hao Zhang ◽

John Choi ◽

...

Keyword(s):

Bone Marrow ◽

Single Cell ◽

Tumor Cells ◽

Rna Sequencing ◽

Metabolic Pathways ◽

Myeloid Cells ◽

Tumor Grade ◽

Sequencing Data ◽

Single Cell Rna Sequencing ◽

Two Populations

Abstract BACKGROUND Glioma associated myeloid cells (GAMs) can be induced to adopt an immunosuppressive phenotype that can lead to inhibition of anti-tumor responses in glioblastoma (GBM). Understanding the composition and phenotypes of GAMs is essential to modulating the myeloid compartment as a therapeutic adjunct to improve anti-tumor immune response. METHODS We performed single-cell RNA-sequencing (sc-RNAseq) of 435,400 myeloid and tumor cells to identify transcriptomic and phenotypic differences in GAMs across glioma grades. We further correlated the heterogeneity of the GAM landscape with tumor cell transcriptomics to investigate interactions between GAMs and tumor cells. RESULTS sc-RNAseq revealed a diverse landscape of myeloid-lineage cells in gliomas with an increase in preponderance of bone marrow derived myeloid cells (BMDMs) with increasing tumor grade. We identified two populations of BMDMs unique to GBMs; Mac-1and Mac-2. Mac-1 demonstrates upregulation of immature myeloid gene signature and altered metabolic pathways. Mac-2 is characterized by expression of scavenger receptor MARCO. Pseudotime and RNA velocity analysis revealed the ability of Mac-1 to transition and differentiate to Mac-2 and other GAM subtypes. We further found that the presence of these two populations of BMDMs are associated with the presence of tumor cells with stem cell and mesenchymal features. Bulk RNA-sequencing data demonstrates that gene signatures of these populations are associated with worse survival in GBM. CONCLUSION We used sc-RNAseq to identify a novel population of immature BMDMs that is associated with higher glioma grades. This population exhibited altered metabolic pathways and stem-like potentials to differentiate into other GAM populations including GAMs with upregulation of immunosuppressive pathways. Our results elucidate unique interactions between BMDMs and GBM tumor cells that potentially drives GBM progression and the more aggressive mesenchymal subtype. Our discovery of these novel BMDMs have implications in new therapeutic targets in improving the efficacy of immune-based therapies in GBM.

Download Full-text

Software Benchmark—Classification Tree Algorithms for Cell Atlases Annotation Using Single-Cell RNA-Sequencing Data

Microbiology Research ◽

10.3390/microbiolres12020022 ◽

2021 ◽

Vol 12 (2) ◽

pp. 317-334

Author(s):

Omar Alaqeeli ◽

Li Xing ◽

Xuekui Zhang

Keyword(s):

Single Cell ◽

Rna Sequencing ◽

Classification Tree ◽

Area Under The Curve ◽

Data Sets ◽

Sequencing Data ◽

Single Cell Rna Sequencing ◽

Tree Algorithms ◽

R Packages

Classification tree is a widely used machine learning method. It has multiple implementations as R packages; rpart, ctree, evtree, tree and C5.0. The details of these implementations are not the same, and hence their performances differ from one application to another. We are interested in their performance in the classification of cells using the single-cell RNA-Sequencing data. In this paper, we conducted a benchmark study using 22 Single-Cell RNA-sequencing data sets. Using cross-validation, we compare packages’ prediction performances based on their Precision, Recall, F1-score, Area Under the Curve (AUC). We also compared the Complexity and Run-time of these R packages. Our study shows that rpart and evtree have the best Precision; evtree is the best in Recall, F1-score and AUC; C5.0 prefers more complex trees; tree is consistently much faster than others, although its complexity is often higher than others.

Download Full-text

Single-cell transcriptomic analysis of mIHC images via antigen mapping

Science Advances ◽

10.1126/sciadv.abc5464 ◽

2021 ◽

Vol 7 (10) ◽

pp. eabc5464

Author(s):

Kiya W. Govek ◽

Emma C. Troisi ◽

Zhen Miao ◽

Rachael G. Aubin ◽

Steven Woodhouse ◽

...

Keyword(s):

Single Cell ◽

Spatial Patterns ◽

Cell Types ◽

Level Of Detail ◽

Cell Populations ◽

Sequencing Data ◽

Spatially Resolved ◽

Murine Spleen ◽

Single Cell Rna Sequencing ◽

Antibody Panel

Highly multiplexed immunohistochemistry (mIHC) enables the staining and quantification of dozens of antigens in a tissue section with single-cell resolution. However, annotating cell populations that differ little in the profiled antigens or for which the antibody panel does not include specific markers is challenging. To overcome this obstacle, we have developed an approach for enriching mIHC images with single-cell RNA sequencing data, building upon recent experimental procedures for augmenting single-cell transcriptomes with concurrent antigen measurements. Spatially-resolved Transcriptomics via Epitope Anchoring (STvEA) performs transcriptome-guided annotation of highly multiplexed cytometry datasets. It increases the level of detail in histological analyses by enabling the systematic annotation of nuanced cell populations, spatial patterns of transcription, and interactions between cell types. We demonstrate the utility of STvEA by uncovering the architecture of poorly characterized cell types in the murine spleen using published cytometry and mIHC data of this organ.

Download Full-text

Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data

Biometrics ◽

10.1111/biom.13457 ◽

2021 ◽

Author(s):

Zhen Yang ◽

Yen‐Yi Ho

Keyword(s):

Single Cell ◽

Rna Sequencing ◽

Count Data ◽

Sequencing Data ◽

Dynamic Correlation ◽

Single Cell Rna Sequencing

Download Full-text