FlyClockbase: Importance of Biological Model Curation for Analyzing Variability in the Circadian Clock of Drosophila melanogaster by Integrating Time Series from 25 Years of Research

AbstractBiological model curation provides new insights by integrating biological knowledge-fragments, assessing their uncertainty, and analyzing the reliability of potential interpretations. Here we integrate published results about circadian clocks in Drosophila melanogaster while exploring economies of scale in biological model curation. Clocks govern rhythms of gene-expression that impact fitness, health, cancer, memory, mental functions, and more. Human clock insights have been repeatedly pioneered in flies. Flies simplify investigating complex gene regulatory networks, which express proteins cyclically using environmentally entrained interlocking feedback loops that act as clocks. Simulations could simplify research further. We found that very few computational models test their quality directly against experimentally observed time series scattered in the literature. We designed FlyClockbase for integrating such scattered data to enable robust efficient access for biologists and modelers. To this end we have been defining data structures that simplify the construction and maintenance of Versioned Biological Information Resources (VBIRs) that prioritize simplicity, openness, and therefore maintainability. We aim to simplify the preservation of more raw data and relevant annotations from experiments in order to multiply the long-term value of wet-lab datasets for modelers interested in meta-analyses, parameter estimates, and hypothesis testing. Currently FlyClockbase contains over 400 wildtype time series of core circadian components systematically curated from 86 studies published between 1990 and 2015. Using FlyClockbase, we show that PERIOD protein amount peak time variance unexpectedly exceeds that of TIMELESS. We hypothesize that PERIOD’s exceedingly more complex phosphorylation rules are responsible. Variances of daily event times are easily confounded by errors. We improved result reliability by a human error analysis of our data handling; this revealed significance-degrading outliers, possibly violating a presumed absence of wildtype heterogeneity or lab evolution. Separate analyses revealed elevated stochasticity in PCR-based peak time variances; yet our reported core difference in peak time variances appears robust. Our study demonstrates how biological model curation enhances the understanding of circadian clocks. It also highlights diverse broader challenges that are likely to become recurrent themes if models in molecular systems biology aim to integrate ‘all relevant knowledge’. We developed a trans-disciplinary workflow, which demonstrates the importance of developing compilers for VBIRs with a more biology-friendly logic that is likely to greatly simplify biological model curation. Curation-limited grand challenges, including personalizing medicine, critically depend on such progress if they are indeed to integrate ‘all relevant knowledge’.General Article SummaryCircadian clocks impact health and fitness by controlling daily rhythms of gene-expression through complex gene-regulatory networks. Deciphering how they work requires experimentally tracking changes in amounts of clock components. We designed FlyClockbase to simplify data-access for biologists and modelers, curating over 400 time series observed in wildtype fruit flies from 25 years of clock research. Substantial biological model curation was essential for identifying differences in peak time variance of the clock-proteins ‘PERIOD’ and ‘TIMELESS’, which probably stem from differences in phosphorylation-network complexity.We repeatedly encountered systemic limitations of contemporary data analysis strategies in our work on circadian clocks. Thus, we used it as an opportunity for composing a panoramic view of the broader challenges in biological model curation, which are likely to increase as biologists aim to integrate all existing expertise in order to address diverse grand challenges. We developed and tested a trans-disciplinary research workflow, which enables biologists and compiler-architects to define biology-friendly compilers for efficiently constructing and maintaining Versioned Biological Information Resources (VBIRs). We report insights gleaned from our practical clock research that are essential for defining a VBIRs infrastructure, which improves the efficiency of biological model curation to the point where it can be democratized.Statement of data availabilityStabilizing Versioned Variant of this file: QQv1r4_2017m07d14_LionBefore final publication FlyClockbase will be at https://github.com/FlyClockbase For review purposes FlyClockbase QQv1r4 will be provided as a zip-archive in the uploaded Supplemental Material; it is also available upon request from L. Loewe.AbbreviationsTable 1: Molecular core clock componentsTable 2: Concepts for organizing FlyClockbaseSupplemental MaterialAppendix: Supplemental Text and Tables (32 pages included in this file, QQv1v4)Supplemental Statistical Analysis (87 pages not included in this file, QQv1v4)R-Script zip file (>12K lines not included in this file, QQv1v4)FlyClockbase zip file (available upon request, QQv1v4)

Download Full-text

Inference of Gene Regulatory Networks Using Time Sliding Comparison and Transcriptional Lagging Time from Time Series Gene Expression Profiles

2007 IEEE 7th International Symposium on BioInformatics and BioEngineering ◽

10.1109/bibe.2007.4375684 ◽

2007 ◽

Author(s):

Sheehyun Kim ◽

Dongsup Kim

Keyword(s):

Gene Expression ◽

Time Series ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Time Series Gene Expression ◽

Gene Regulatory

Download Full-text

Infer Gene Regulatory Networks from Time Series Data with Probabilistic Model Checking

2015 IEEE/ACM 3rd FME Workshop on Formal Methods in Software Engineering ◽

10.1109/formalise.2015.12 ◽

2015 ◽

Cited By ~ 9

Author(s):

Michele Ceccarelli ◽

Luigi Cerulo ◽

Giuseppe De Ruvo ◽

Vittoria Nardone ◽

Antonella Santone

Keyword(s):

Time Series ◽

Model Checking ◽

Gene Regulatory Networks ◽

Probabilistic Model ◽

Regulatory Networks ◽

Time Series Data ◽

Series Data ◽

Probabilistic Model Checking ◽

Gene Regulatory

Download Full-text

Performance evaluation of the time-delayed dynamic Bayesian network approach to inferring gene regulatory networks from time series microarray data

Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology - BCB '10 ◽

10.1145/1854776.1854859 ◽

2010 ◽

Author(s):

Haoni Li ◽

Peng Li ◽

Chaoyang Zhang ◽

Nan Wang ◽

Ping Gong ◽

...

Keyword(s):

Time Series ◽

Performance Evaluation ◽

Bayesian Network ◽

Gene Regulatory Networks ◽

Microarray Data ◽

Regulatory Networks ◽

Dynamic Bayesian Network ◽

Network Approach ◽

Gene Regulatory ◽

Time Series Microarray

Download Full-text

Inference of gene regulatory networks based on nonlinear ordinary differential equations

Bioinformatics ◽

10.1093/bioinformatics/btaa032 ◽

2020 ◽

Vol 36 (19) ◽

pp. 4885-4893 ◽

Cited By ~ 2

Author(s):

Baoshan Ma ◽

Mingkun Fang ◽

Xiangtian Jiao

Keyword(s):

Gene Expression ◽

Time Series ◽

Steady State ◽

Differential Equations ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Time Series Data ◽

Series Data ◽

State Data ◽

Gene Regulatory

Abstract Motivation Gene regulatory networks (GRNs) capture the regulatory interactions between genes, resulting from the fundamental biological process of transcription and translation. In some cases, the topology of GRNs is not known, and has to be inferred from gene expression data. Most of the existing GRNs reconstruction algorithms are either applied to time-series data or steady-state data. Although time-series data include more information about the system dynamics, steady-state data imply stability of the underlying regulatory networks. Results In this article, we propose a method for inferring GRNs from time-series and steady-state data jointly. We make use of a non-linear ordinary differential equations framework to model dynamic gene regulation and an importance measurement strategy to infer all putative regulatory links efficiently. The proposed method is evaluated extensively on the artificial DREAM4 dataset and two real gene expression datasets of yeast and Escherichia coli. Based on public benchmark datasets, the proposed method outperforms other popular inference algorithms in terms of overall score. By comparing the performance on the datasets with different scales, the results show that our method still keeps good robustness and accuracy at a low computational complexity. Availability and implementation The proposed method is written in the Python language, and is available at: https://github.com/lab319/GRNs_nonlinear_ODEs Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Reconstruction of extended Petri nets from time series data and its application to signal transduction and to gene regulatory networks

BMC Systems Biology ◽

10.1186/1752-0509-5-113 ◽

2011 ◽

Vol 5 (1) ◽

pp. 113 ◽

Cited By ~ 19

Author(s):

Markus Durzinsky ◽

Annegret Wagler ◽

Wolfgang Marwan

Keyword(s):

Signal Transduction ◽

Time Series ◽

Petri Nets ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Time Series Data ◽

Series Data ◽

Extended Petri Nets ◽

Gene Regulatory

Download Full-text

MODELING NONLINEAR GENE REGULATORY NETWORKS FROM TIME SERIES GENE EXPRESSION DATA

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720008003746 ◽

2008 ◽

Vol 06 (05) ◽

pp. 961-979 ◽

Cited By ~ 15

Author(s):

ANDRÉ FUJITA ◽

JOÃO RICARDO SATO ◽

HUMBERTO MIGUEL GARAY-MALPARTIDA ◽

MARI CLEIDE SOGAYAR ◽

CARLOS EDUARDO FERREIRA ◽

...

Keyword(s):

Gene Expression ◽

Time Series ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Expression Profiles ◽

A Priori ◽

Molecular Networks ◽

Biological Knowledge ◽

Vector Autoregressive ◽

Gene Regulatory

In cells, molecular networks such as gene regulatory networks are the basis of biological complexity. Therefore, gene regulatory networks have become the core of research in systems biology. Understanding the processes underlying the several extracellular regulators, signal transduction, protein–protein interactions, and differential gene expression processes requires detailed molecular description of the protein and gene networks involved. To understand better these complex molecular networks and to infer new regulatory associations, we propose a statistical method based on vector autoregressive models and Granger causality to estimate nonlinear gene regulatory networks from time series microarray data. Most of the models available in the literature assume linearity in the inference of gene connections; moreover, these models do not infer directionality in these connections. Thus, a priori biological knowledge is required. However, in pathological cases, no a priori biological information is available. To overcome these problems, we present the nonlinear vector autoregressive (NVAR) model. We have applied the NVAR model to estimate nonlinear gene regulatory networks based entirely on gene expression profiles obtained from DNA microarray experiments. We show the results obtained by NVAR through several simulations and by the construction of three actual gene regulatory networks (p53, NF-κB, and c-Myc) for HeLa cells.

Download Full-text

Inference of gene regulatory networks from time-series microarray data

Proceedings of the 8th IEEE International NEWCAS Conference 2010 ◽

10.1109/newcas.2010.5603729 ◽

2010 ◽

Cited By ~ 1

Author(s):

Ola ElBakry ◽

M. Omair Ahmad ◽

M. N. S. Swamy

Keyword(s):

Time Series ◽

Gene Regulatory Networks ◽

Microarray Data ◽

Regulatory Networks ◽

Gene Regulatory ◽

Time Series Microarray

Download Full-text

Inference of Gene Regulatory Networks from Time Series Expression Data: A Data Mining Approach

Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06) ◽

10.1109/icdmw.2006.99 ◽

2006 ◽

Cited By ~ 1

Author(s):

Patrick H. Ma ◽

Keith C. Chan

Keyword(s):

Data Mining ◽

Time Series ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Expression Data ◽

Data Mining Approach ◽

Series Expression ◽

Gene Regulatory

Download Full-text

State of the Art of Fuzzy Methods for Gene Regulatory Networks Inference

The Scientific World JOURNAL ◽

10.1155/2015/148010 ◽

2015 ◽

Vol 2015 ◽

pp. 1-11 ◽

Cited By ~ 5

Author(s):

Tuqyah Abdullah Al Qazlan ◽

Aboubekeur Hamdi-Cherif ◽

Chafia Kara-Mohamed

Keyword(s):

Gene Regulatory Networks ◽

Regulatory Networks ◽

Ad Hoc ◽

Cellular Level ◽

Biological Information ◽

Cellular Functions ◽

Fuzzy Methods ◽

Gene Regulatory ◽

Living Organisms ◽

Different Levels

To address one of the most challenging issues at the cellular level, this paper surveys the fuzzy methods used in gene regulatory networks (GRNs) inference. GRNs represent causal relationships between genes that have a direct influence, trough protein production, on the life and the development of living organisms and provide a useful contribution to the understanding of the cellular functions as well as the mechanisms of diseases. Fuzzy systems are based on handling imprecise knowledge, such as biological information. They provide viable computational tools for inferring GRNs from gene expression data, thus contributing to the discovery of gene interactions responsible for specific diseases and/orad hoccorrecting therapies. Increasing computational power and high throughput technologies have provided powerful means to manage these challenging digital ecosystems at different levels from cell to society globally. The main aim of this paper is to report, present, and discuss the main contributions of this multidisciplinary field in a coherent and structured framework.

Download Full-text

Inference of gene regulatory networks from time series by Tsallis entropy

BMC Systems Biology ◽

10.1186/1752-0509-5-61 ◽

2011 ◽

Vol 5 (1) ◽

Cited By ~ 25

Author(s):

Fabrício Martins Lopes ◽

Evaldo A de Oliveira ◽

Roberto M Cesar

Keyword(s):

Time Series ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Tsallis Entropy ◽

Gene Regulatory

Download Full-text