workflow manager
Recently Published Documents


TOTAL DOCUMENTS

16
(FIVE YEARS 6)

H-INDEX

2
(FIVE YEARS 1)

2021 ◽  
Author(s):  
Nathalie Lehmann ◽  
Sandrine Perrin ◽  
Claire Wallon ◽  
Xavier Bauquet ◽  
Vivien Deshaies ◽  
...  

Motivation: Core sequencing facilities produce huge amounts of sequencing data that need to be analysed with automated workflows to ensure reproducibility and traceability. Eoulsan is a versatile open-source workflow engine meeting the needs of core facilities, by automating the analysis of a large number of samples. Its core design separates the description of the workflow from the actual commands to be run. This originality simplifies its usage as the user does not need to handle code, while ensuring reproducibility. Eoulsan was initially developed for bulk RNA-seq data, but the transcriptomics applications have recently widened with the advent of long-read sequencing and single-cell technologies, calling for the development of new workflows. Result: We present Eoulsan 2, a major update that (i) enhances the workflow manager itself, (ii) facilitates the development of new modules, and (iii) expands its applications to long reads RNA-seq (Oxford Nanopore Technologies) and scRNA-seq (Smart-seq2 and 10x Genomics). The workflow manager has been rewritten, with support for execution on a larger choice of computational infrastructure (workstations, Hadoop clusters, and various job schedulers for cluster usage). Eoulsan now facilitates the development of new modules, by reusing wrappers developed for the Galaxy platform, with support for container images (Docker or Singularity) packaging tools to execute. Finally, Eoulsan natively integrates novel modules for bulk RNA-seq, as well as others specifically designed for processing long read RNA-seq and scRNA-seq. Eoulsan 2 is distributed with ready-to-use workflows and companion tutorials. Availability and implementation: Eoulsan is implemented in Java, supported on Linux systems and distributed under the LGPL and CeCILL-C licenses at: http://outils.genomique.biologie.ens.fr/eoulsan/. The source code and sample workflows are available on GitHub: https://github.com/GenomicParisCentre/eoulsan. A GitHub repository for modules using the Galaxy tool XML syntax is further provided at: https://github.com/GenomicParisCentre/eoulsan-tools


2020 ◽  
Author(s):  
Mike Marquet ◽  
Martin Hölzer ◽  
Mathias W. Pletz ◽  
Adrian Viehweger ◽  
Oliwia Makarewicz ◽  
...  

AbstractPhages are among the most abundant and diverse biological entities on earth. Identification from sequence data is a crucial first step to understand their impact on the environment. A variety of bacteriophage identification tools have been developed over the years. They differ in algorithmic approach, results and ease of use. We, therefore, developed “What the Phage” (WtP), an easy-to-use and parallel multitool approach for phage identification combined with an annotation and classification downstream strategy, thus, supporting the user’s decision-making process when the phage identification tools are not in agreement to each other. WtP is reproducible and scales to thousands of datasets through the use of a workflow manager (Nextflow). WtP is freely available under a GPL-3.0 license (https://github.com/replikation/What_the_Phage).


2019 ◽  
Author(s):  
Nathan Sheffield

Reproducible computing environments are required for reproducible analysis and are also useful for interactive exploratory analysis. In the past, scientific computing environments have been managed with package managers or with virtual machines. More recently, modern workflow managers have incorporated linux container technology to improve reproducibility. These existing solutions solve some of the challenges of managing reproducible computing environments, but they remain limited: System-wide environments and native package managers lack the portability, efficiency, and robustness of modern containers, while container-aware workflows are specialized and incapable of interactive computing. Here, I introduce *bulker*, an approach that combines the advantages of virtual machines, native package managers, and container-aware workflow managers. Bulker creates and distributes complete environments, like virtual machines, but with re-usable modular components, like a native package manager. It uses individually containerized tools like a container-aware workflow manager, but also allows these environments to be activated interactively and distributed independently of a particular workflow. Bulker is thus a more general approach to portable and reproducible computing environments.


2019 ◽  
Vol 15 (7) ◽  
pp. 155014771986220
Author(s):  
Youngkuk Kim ◽  
Siwoon Son ◽  
Yang-Sae Moon

In this article, we address dynamic workflow management for sampling and filtering data streams in Apache Storm. As many sensors generate data streams continuously, we often use sampling to choose some representative data or filtering to remove unnecessary data. Apache Storm is a real-time distributed processing platform suitable for handling large data streams. Storm, however, must stop the entire work when it changes the input data structure or processing algorithm as it needs to modify, redistribute, and restart the programs. In addition, for effective data processing, we often use Storm with Kafka and databases, but it is difficult to use these platforms in an integrated manner. In this article, we derive the problems when applying sampling and filtering algorithms to Storm and propose a dynamic workflow management model that solves these problems. First, we present the concept of a plan consisting of input, processing, and output modules of a data stream. Second, we propose Storm Plan Manager, which can operate Storm, Kafka, and database as a single integrated system. Storm Plan Manager is an integrated workflow manager that dynamically controls sampling and filtering of data streams through plans. Third, as a key feature, Storm Plan Manager provides a Web client interface to visually create, execute, and monitor plans. In this article, we show the usefulness of the proposed Storm Plan Manager by presenting its design, implementation, and experimental results in order.


2019 ◽  
Vol 35 (19) ◽  
pp. 3875-3876 ◽  
Author(s):  
Jan Kožusznik ◽  
Petr Bainar ◽  
Jana Klímová ◽  
Michal Krumnikl ◽  
Pavel Moravec ◽  
...  

Abstract Summary Here we introduce a Fiji plugin utilizing the HPC-as-a-Service concept, significantly mitigating the challenges life scientists face when delegating complex data-intensive processing workflows to HPC clusters. We demonstrate on a common Selective Plane Illumination Microscopy image processing task that execution of a Fiji workflow on a remote supercomputer leads to improved turnaround time despite the data transfer overhead. The plugin allows the end users to conveniently transfer image data to remote HPC resources, manage pipeline jobs and visualize processed results directly from the Fiji graphical user interface. Availability and implementation The code is distributed free and open source under the MIT license. Source code: https://github.com/fiji-hpc/hpc-workflow-manager/, documentation: https://imagej.net/SPIM_Workflow_Manager_For_HPC. Supplementary information Supplementary data are available at Bioinformatics online.


2019 ◽  
pp. 263-278
Author(s):  
Vlad Catrinescu ◽  
Trevor Seward
Keyword(s):  

2018 ◽  
Vol 24 (S2) ◽  
pp. 126-127
Author(s):  
Mirna Lerotic ◽  
Ian McNulty ◽  
Martin V. Holt

2018 ◽  
Vol 175 ◽  
pp. 09009 ◽  
Author(s):  
Venkitesh Ayyar ◽  
Daniel C. Hackett ◽  
William I. Jay ◽  
Ethan T. Neil

The process of generating ensembles of gauge configurations (and measuring various observables over them) can be tedious and error-prone when done “by hand”. In practice, most of this procedure can be automated with the use of a workflow manager. We discuss how this automation can be accomplished using Taxi, a minimal Python-based workflow manager built for generating lattice data. We present a case study demonstrating this technology.


2016 ◽  
pp. 203-215
Author(s):  
Vlad Catrinescu ◽  
Trevor Seward
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document