EFMlrs: a Python package for elementary flux mode enumeration via lexicographic reverse search

COBREXA.jl is a Julia package for scalable, high-performance constraint-based reconstruction and analysis of very large-scale biological models. Its primary purpose is to facilitate the integration of modern high performance computing environments with the processing and analysis of large-scale metabolic models of challenging complexity. We report the architecture of the package, and demonstrate its scalability by benchmarking the task distribution overhead incurred when performing analyses of many variants of multi-organism community models simultaneously.

Download Full-text

CGAT-core: a python framework for building scalable, reproducible computational biology workflows

F1000Research ◽

10.12688/f1000research.18674.1 ◽

2019 ◽

Vol 8 ◽

pp. 377 ◽

Cited By ~ 2

Author(s):

Adam P. Cribbs ◽

Sebastian Luna-Valero ◽

Charlotte George ◽

Ian M. Sudbery ◽

Antonio J. Berlanga-Taylor ◽

...

Keyword(s):

Computational Biology ◽

High Performance Computing ◽

High Performance ◽

Large Data ◽

Database Integration ◽

Scientific Rigour ◽

Rapid Construction ◽

Rnaseq Data ◽

Performance Computing ◽

Python Package

In the genomics era computational biologists regularly need to process, analyse and integrate large and complex biomedical datasets. Analysis inevitably involves multiple dependent steps, resulting in complex pipelines or workflows, often with several branches. Large data volumes mean that processing needs to be quick and efficient and scientific rigour requires that analysis be consistent and fully reproducible. We have developed CGAT-core, a python package for the rapid construction of complex computational workflows. CGAT-core seamlessly handles parallelisation across high performance computing clusters, integration of Conda environments, full parameterisation, database integration and logging. To illustrate our workflow framework, we present a pipeline for the analysis of RNAseq data using pseudo-alignment.

Download Full-text

CGAT-core: a python framework for building scalable, reproducible computational biology workflows

10.1101/581009 ◽

2019 ◽

Author(s):

Adam Cribbs ◽

Sebastian Luna-Valero ◽

Charlotte George ◽

Ian M Sudbery ◽

Antonio J Berlanga-Taylor ◽

...

Keyword(s):

Computational Biology ◽

High Performance Computing ◽

High Performance ◽

Large Data ◽

Database Integration ◽

Scientific Rigour ◽

Rapid Construction ◽

Rnaseq Data ◽

Performance Computing ◽

Python Package

In the genomics era computational biologists regularly need to process, analyse and integrate large and complex biomedical datasets. Analysis inevitably involves multiple dependent steps, resulting in complex pipelines or workflows, often with several branches. Large data volumes mean that processing needs to be quick and efficient and scientific rigour requires that analysis be consistent and fully reproducible. We have developed CGAT-core, a python package for the rapid construction of complex computational workflows. CGAT-core seamlessly handles parallelisation across high performance computing clusters, integration of Conda environments, full parameterisation, database integration and logging. To illustrate our workflow framework, we present a pipeline for the analysis of RNAseq data using pseudo-alignment.

Download Full-text

CGAT-core: a python framework for building scalable, reproducible computational biology workflows

F1000Research ◽

10.12688/f1000research.18674.2 ◽

2019 ◽

Vol 8 ◽

pp. 377 ◽

Cited By ~ 3

Author(s):

Adam P. Cribbs ◽

Sebastian Luna-Valero ◽

Charlotte George ◽

Ian M. Sudbery ◽

Antonio J. Berlanga-Taylor ◽

...

Keyword(s):

Computational Biology ◽

High Performance Computing ◽

High Performance ◽

Large Data ◽

Database Integration ◽

Scientific Rigour ◽

Rapid Construction ◽

Rnaseq Data ◽

Performance Computing ◽

Python Package

In the genomics era computational biologists regularly need to process, analyse and integrate large and complex biomedical datasets. Analysis inevitably involves multiple dependent steps, resulting in complex pipelines or workflows, often with several branches. Large data volumes mean that processing needs to be quick and efficient and scientific rigour requires that analysis be consistent and fully reproducible. We have developed CGAT-core, a python package for the rapid construction of complex computational workflows. CGAT-core seamlessly handles parallelisation across high performance computing clusters, integration of Conda environments, full parameterisation, database integration and logging. To illustrate our workflow framework, we present a pipeline for the analysis of RNAseq data using pseudo-alignment.

Download Full-text

ipyrad: Interactive assembly and analysis of RADseq datasets

Bioinformatics ◽

10.1093/bioinformatics/btz966 ◽

2020 ◽

Vol 36 (8) ◽

pp. 2592-2594 ◽

Cited By ~ 44

Author(s):

Deren A R Eaton ◽

Isaac Overcast

Keyword(s):

Open Source ◽

High Performance ◽

De Novo ◽

Application Programming Interface ◽

Analysis Tools ◽

Source Program ◽

Application Programming ◽

Downstream Analysis ◽

Performance Computing ◽

Python Package

Abstract Summary ipyrad is a free and open source tool for assembling and analyzing restriction site-associated DNA sequence datasets using de novo and/or reference-based approaches. It is designed to be massively scalable to hundreds of taxa and thousands of samples, and can be efficiently parallelized on high performance computing clusters. It is available both as a command line interface and as a Python package with an application programming interface, the latter of which can be used interactively to write complex, reproducible scripts and implement a suite of downstream analysis tools. Availability and implementation ipyrad is a free and open source program written in Python. Source code is available from the GitHub repository (https://github.com/dereneaton/ipyrad/), and Linux and MacOS installs are distributed through the conda package manager. Complete documentation, including numerous tutorials, and Jupyter notebooks demonstrating example assemblies and applications of downstream analysis tools are available online: https://ipyrad.readthedocs.io/.

Download Full-text

Application of high-performance computing to the reconstruction, analysis, and optimization of genome-scale metabolic models

Journal of Physics Conference Series ◽

10.1088/1742-6596/180/1/012025 ◽

2009 ◽

Vol 180 ◽

pp. 012025 ◽

Cited By ~ 8

Author(s):

Christopher S Henry ◽

Fangfang Xia ◽

Rick Stevens

Keyword(s):

High Performance Computing ◽

High Performance ◽

Metabolic Models ◽

Genome Scale ◽

Performance Computing

Download Full-text

Simulation of Multilayer Shallow Water Fluid Flow Using Lattice Boltzmann Modeling and High Performance Computing

World Environmental and Water Resources Congress 2009 ◽

10.1061/41036(342)282 ◽

2009 ◽

Author(s):

K. R. Tubbs ◽

F. T. -C. Tsai

Keyword(s):

Fluid Flow ◽

Shallow Water ◽

High Performance Computing ◽

Lattice Boltzmann ◽

High Performance ◽

Lattice Boltzmann Modeling ◽

Performance Computing

Download Full-text

High performance computing on graphics processing units

Pollack Periodica ◽

10.1556/pollack.3.2008.2.3 ◽

2008 ◽

Vol 3 (2) ◽

pp. 27-34 ◽

Cited By ~ 2

Author(s):

Balázs Tukora ◽

Tibor Szalay

Keyword(s):

High Performance Computing ◽

Graphics Processing Units ◽

High Performance ◽

Graphics Processing ◽

Performance Computing

Download Full-text

The Recent Revolution in High Performance Computing

MRS Bulletin ◽

10.1557/s0883769400034096 ◽

1997 ◽

Vol 22 (10) ◽

pp. 5-6

Author(s):

Horst D. Simon

Keyword(s):

High Performance Computing ◽

High Performance ◽

New Technologies ◽

New Technology ◽

Parallel Architecture ◽

Time Frame ◽

Good News ◽

Computing Industry ◽

Time And Energy ◽

Performance Computing

Recent events in the high-performance computing industry have concerned scientists and the general public regarding a crisis or a lack of leadership in the field. That concern is understandable considering the industry's history from 1993 to 1996. Cray Research, the historic leader in supercomputing technology, was unable to survive financially as an independent company and was acquired by Silicon Graphics. Two ambitious new companies that introduced new technologies in the late 1980s and early 1990s—Thinking Machines and Kendall Square Research—were commercial failures and went out of business. And Intel, which introduced its Paragon supercomputer in 1994, discontinued production only two years later.During the same time frame, scientists who had finished the laborious task of writing scientific codes to run on vector parallel supercomputers learned that those codes would have to be rewritten if they were to run on the next-generation, highly parallel architecture. Scientists who are not yet involved in high-performance computing are understandably hesitant about committing their time and energy to such an apparently unstable enterprise.However, beneath the commercial chaos of the last several years, a technological revolution has been occurring. The good news is that the revolution is over, leading to five to ten years of predictable stability, steady improvements in system performance, and increased productivity for scientific applications. It is time for scientists who were sitting on the fence to jump in and reap the benefits of the new technology.

Download Full-text

High Performance Computing in Parallel Electromagnetics Simulation Code suite ACE3P

2020 International Applied Computational Electromagnetics Society Symposium (ACES) ◽

10.23919/aces49320.2020.9196167 ◽

2020 ◽

Author(s):

Lixin Ge ◽

Zenghai Li ◽

Cho-Kuen Ng ◽

Liling Xiao

Keyword(s):

High Performance Computing ◽

High Performance ◽

Simulation Code ◽

Performance Computing

Download Full-text