high throughput data Latest Research Papers

Collection of high-throughput data has become prevalent in biology. Large datasets allow the use of statistical constructs such as binning and linear regression to quantify relationships between variables and hypothesize underlying biological mechanisms based on it. We discuss several such examples in relation to single-cell data and cellular growth. In particular, we show instances where what appears to be ordinary use of these statistical methods leads to incorrect conclusions such as growth being non-exponential as opposed to exponential and vice versa. We propose that the data analysis and its interpretation should be done in the context of a generative model, if possible. In this way, the statistical methods can be validated either analytically or against synthetic data generated via the use of the model, leading to a consistent method for inferring biological mechanisms from data. On applying the validated methods of data analysis to infer cellular growth on our experimental data, we find the growth of length in E. coli to be non-exponential. Our analysis shows that in the later stages of the cell cycle the growth rate is faster than exponential.

Download Full-text

Unraveling molecular mechanism underlying biomaterial and stem cells interaction during cell fate commitment using high throughput data analysis

Gene ◽

10.1016/j.gene.2021.146111 ◽

2021 ◽

pp. 146111

Author(s):

Erfan Sharifi ◽

Niusha Khazaei ◽

Nicholas W. Kieran ◽

Sahel Jahangiri Esfahani ◽

Abdulshakour Mohammadnia ◽

...

Keyword(s):

Stem Cells ◽

Data Analysis ◽

Molecular Mechanism ◽

Cell Fate ◽

High Throughput ◽

High Throughput Data ◽

High Throughput Data Analysis

Download Full-text

High Throughput Data-Based, Toxicity Pathway-Oriented Development of a Quantitative Adverse Outcome Pathway Network Linking AHR Activation to Lung Damages

Journal of Hazardous Materials ◽

10.1016/j.jhazmat.2021.128041 ◽

2021 ◽

pp. 128041

Author(s):

Yuan Jin ◽

Guangshuai Qi ◽

Yingqing Shou ◽

Daochuan Li ◽

Yuzhen Liu ◽

...

Keyword(s):

High Throughput ◽

Adverse Outcome ◽

Adverse Outcome Pathway ◽

Pathway Network ◽

High Throughput Data ◽

Toxicity Pathway

Download Full-text

Clipper: p-value-free FDR control on high-throughput data from two conditions

Genome Biology ◽

10.1186/s13059-021-02506-9 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Xinzhou Ge ◽

Yiling Elaine Chen ◽

Dongyuan Song ◽

MeiLu McDermott ◽

Kyla Woyshner ◽

...

Keyword(s):

Data Analysis ◽

High Throughput ◽

Biological Data ◽

P Value ◽

High Throughput Data ◽

Statistical Framework ◽

Large Numbers ◽

Biological Data Analysis ◽

General Statistical ◽

Genomic Regions

AbstractHigh-throughput biological data analysis commonly involves identifying features such as genes, genomic regions, and proteins, whose values differ between two conditions, from numerous features measured simultaneously. The most widely used criterion to ensure the analysis reliability is the false discovery rate (FDR), which is primarily controlled based on p-values. However, obtaining valid p-values relies on either reasonable assumptions of data distribution or large numbers of replicates under both conditions. Clipper is a general statistical framework for FDR control without relying on p-values or specific data distributions. Clipper outperforms existing methods for a broad range of applications in high-throughput data analysis.

Download Full-text

Clinical Value for Diagnosis and Prognosis of Signal Sequence Receptor 1 (SSR1) and Its Potential Mechanism in Hepatocellular Carcinoma: A Comprehensive Study Based on High-Throughput Data Analysis

International Journal of General Medicine ◽

10.2147/ijgm.s336725 ◽

2021 ◽

Vol Volume 14 ◽

pp. 7435-7451

Author(s):

Liang Chen ◽

Yunhua Lin ◽

Guoqing Liu ◽

Rubin Xu ◽

Yiming Hu ◽

...

Keyword(s):

Hepatocellular Carcinoma ◽

Data Analysis ◽

High Throughput ◽

Signal Sequence ◽

Potential Mechanism ◽

Clinical Value ◽

High Throughput Data ◽

Diagnosis And Prognosis ◽

High Throughput Data Analysis ◽

Comprehensive Study

Download Full-text

Dr. Sim: Similarity Learning for Transcriptional Phenotypic Drug discovery

10.1101/2021.09.23.461458 ◽

2021 ◽

Author(s):

Zhiting Wei ◽

Sheng Zhu ◽

Xiaohan Chen ◽

Chenyu Zhu ◽

Bin Duan ◽

...

Keyword(s):

Drug Discovery ◽

High Throughput ◽

Great Success ◽

Similarity Learning ◽

Integrated Network ◽

High Throughput Data ◽

Drug Mechanism ◽

Perturbation Data

Transcriptional phenotypic drug discovery has achieved great success, and various compound perturbation-based data resources, such as Connectivity Map (CMap) and Library of Integrated Network-Based Cellular Signatures (LINCS), have been presented. Computational strategies fully mining these resources for phenotypic drug discovery have been proposed, and among them, a fundamental issue is to define the proper similarity between the transcriptional profiles to elucidate the drug mechanism of actions and identify new drug indications. Traditionally, this similarity has been defined in an unsupervised way, and due to the high dimensionality and the existence of high noise in those high-throughput data, it lacks robustness with limited performance. In our study, we present Dr. Sim, which is a general learning-based framework that automatically infers similarity measurement rather than being manually designed and can be used to characterize transcriptional phenotypic profiles for drug discovery with generalized good performance. We evaluated Dr. Sim on comprehensively publicly available in vitro and in vivo datasets in drug annotation and repositioning using high-throughput transcriptional perturbation data and indicated that Dr. Sim significantly outperforms the existing methods and is proved to be a conceptual improvement by learning transcriptional similarity to facilitate the broad utility of high-throughput transcriptional perturbation data for phenotypic drug discovery. The source code and usage of Dr. Sim is available at https://github.com/bm2-lab/DrSim/.

Download Full-text

Novel Approach Combining Transcriptional and Evolutionary Signatures to Identify New Multiciliation Genes

Genes ◽

10.3390/genes12091452 ◽

2021 ◽

Vol 12 (9) ◽

pp. 1452

Author(s):

Audrey Defosset ◽

Dorine Merlat ◽

Laetitia Poidevin ◽

Yannis Nevers ◽

Arnaud Kress ◽

...

Keyword(s):

Large Scale ◽

Protein Interaction Networks ◽

Biological Processes ◽

High Throughput Data ◽

Novel Approach ◽

Phylogenetic Profiles ◽

Multiciliated Cells ◽

Complex Process ◽

The Brain ◽

Large Scale Screening

Multiciliogenesis is a complex process that allows the generation of hundreds of motile cilia on the surface of specialized cells, to create fluid flow across epithelial surfaces. Dysfunction of human multiciliated cells is associated with diseases of the brain, airway and reproductive tracts. Despite recent efforts to characterize the transcriptional events responsible for the differentiation of multiciliated cells, a lot of actors remain to be identified. In this work, we capitalize on the ever-growing quantity of high-throughput data to search for new candidate genes involved in multiciliation. After performing a large-scale screening using 10 transcriptomics datasets dedicated to multiciliation, we established a specific evolutionary signature involving Otomorpha fish to use as a criterion to select the most likely targets. Combining both approaches highlighted a list of 114 potential multiciliated candidates. We characterized these genes first by generating protein interaction networks, which showed various clusters of ciliated and multiciliated genes, and then by computing phylogenetic profiles. In the end, we selected 11 poorly characterized genes that seem like particularly promising multiciliated candidates. By combining functional and comparative genomics methods, we developed a novel type of approach to study biological processes and identify new promising candidates linked to that process.

Download Full-text

Improve consensus partitioning via a hierarchical procedure

10.1101/2021.09.03.458844 ◽

2021 ◽

Author(s):

Zuguang Gu ◽

Daniel Huebschmann

Keyword(s):

Dna Methylation ◽

Data Analysis ◽

High Throughput ◽

R Package ◽

Complete Analysis ◽

High Throughput Data ◽

Unsupervised Method ◽

Large Numbers ◽

New Strategy ◽

High Throughput Data Analysis

Consensus partitioning is an unsupervised method widely used in high throughput data analysis for revealing subgroups and assigns stability for the classification. However, standard consensus partitioning procedures are weak to identify large numbers of stable subgroups. There are two main issues. 1. Subgroups with small differences are difficult to separate if they are simultaneously detected with subgroups with large differences. And 2. stability of classification generally decreases as the number of subgroups increases. In this work, we proposed a new strategy to solve these two issues by applying consensus partitionings in a hierarchical procedure. We demonstrated hierarchical consensus partitioning can be efficient to reveal more subgroups. We also tested the performance of hierarchical consensus partitioning on revealing a great number of subgroups with a DNA methylation dataset. The hierarchical consensus partitioning is implemented in the R package cola with comprehensive functionality for analysis and visualizations. It can also automate the analysis only with a minimum of two lines of code, which generates a detailed HTML report containing the complete analysis.

Download Full-text

high throughput data
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Automatic organ-level point cloud segmentation of maize shoots by integrating high-throughput data acquisition and deep learning

A chemistry‐inspired neural network kinetic model for oxidative coupling of methane from high‐throughput data

Distinguishing different modes of growth using single-cell data

Unraveling molecular mechanism underlying biomaterial and stem cells interaction during cell fate commitment using high throughput data analysis

High Throughput Data-Based, Toxicity Pathway-Oriented Development of a Quantitative Adverse Outcome Pathway Network Linking AHR Activation to Lung Damages

Clipper: p-value-free FDR control on high-throughput data from two conditions

Clinical Value for Diagnosis and Prognosis of Signal Sequence Receptor 1 (SSR1) and Its Potential Mechanism in Hepatocellular Carcinoma: A Comprehensive Study Based on High-Throughput Data Analysis

Dr. Sim: Similarity Learning for Transcriptional Phenotypic Drug discovery

Novel Approach Combining Transcriptional and Evolutionary Signatures to Identify New Multiciliation Genes

Improve consensus partitioning via a hierarchical procedure

Export Citation Format

high throughput dataRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Automatic organ-level point cloud segmentation of maize shoots by integrating high-throughput data acquisition and deep learning

A chemistry‐inspired neural network kinetic model for oxidative coupling of methane from high‐throughput data

Distinguishing different modes of growth using single-cell data

Unraveling molecular mechanism underlying biomaterial and stem cells interaction during cell fate commitment using high throughput data analysis

High Throughput Data-Based, Toxicity Pathway-Oriented Development of a Quantitative Adverse Outcome Pathway Network Linking AHR Activation to Lung Damages

Clipper: p-value-free FDR control on high-throughput data from two conditions

Clinical Value for Diagnosis and Prognosis of Signal Sequence Receptor 1 (SSR1) and Its Potential Mechanism in Hepatocellular Carcinoma: A Comprehensive Study Based on High-Throughput Data Analysis

Dr. Sim: Similarity Learning for Transcriptional Phenotypic Drug discovery

Novel Approach Combining Transcriptional and Evolutionary Signatures to Identify New Multiciliation Genes

Improve consensus partitioning via a hierarchical procedure

high throughput data
Recently Published Documents