scholarly journals rTASSEL: an R interface to TASSEL for association mapping of complex traits

2020 ◽  
Author(s):  
Brandon Monier ◽  
Terry M. Casstevens ◽  
Edward S. Buckler

AbstractThe need for efficient tools and applications for analyzing genomic diversity is essential for any genetics research program. One such tool, TASSEL (Trait Analysis by aSSociation, Evolution and Linkage), provides many core methods for genomic analyses. Despite its efficiency, TASSEL has limited means for reproducible research and interacting with other analytical tools. Here we present an R package rTASSEL, a front-end to connect to a variety of highly used TASSEL methods and analytical tools. The goal of this package is to create a unified scripting workflow that exploits the analytical prowess of TASSEL in conjunction with R’s popular data handling and parsing capabilities without ever having the user to switch between these two environments.

2021 ◽  
Author(s):  
Rebecca J Bengtsson ◽  
Adam J Simpkin ◽  
Caisey V Pulford ◽  
Ross Low ◽  
David A Rasko ◽  
...  

Shigella spp. are the leading bacterial cause of severe childhood diarrhoea in low- and middle- income countries (LMIC), are increasingly antimicrobial resistant and have no licensed vaccine. We performed genomic analyses of 1246 systematically collected shigellae from seven LMIC to inform control and identify factors that could limit the effectiveness of current approaches. We found that S. sonnei contributes ≥20-fold more disease than other Shigella species relative to its genomic diversity and highlight existing diversity and adaptative capacity among S. flexneri that may generate vaccine escape variants in <6 months. Furthermore, we show convergent evolution of resistance against the current recommended antimicrobial among shigellae. This demonstrates the urgent need to integrate existing genomic diversity into vaccine and treatment plans for Shigella, and other pathogens.


Author(s):  
Yang Hai ◽  
Yalu Wen

Abstract Motivation Accurate disease risk prediction is essential for precision medicine. Existing models either assume that diseases are caused by groups of predictors with small-to-moderate effects or a few isolated predictors with large effects. Their performance can be sensitive to the underlying disease mechanisms, which are usually unknown in advance. Results We developed a Bayesian linear mixed model (BLMM), where genetic effects were modelled using a hybrid of the sparsity regression and linear mixed model with multiple random effects. The parameters in BLMM were inferred through a computationally efficient variational Bayes algorithm. The proposed method can resemble the shape of the true effect size distributions, captures the predictive effects from both common and rare variants, and is robust against various disease models. Through extensive simulations and the application to a whole-genome sequencing dataset obtained from the Alzheimer’s Disease Neuroimaging Initiatives, we have demonstrated that BLMM has better prediction performance than existing methods and can detect variables and/or genetic regions that are predictive. Availability The R-package is available at https://github.com/yhai943/BLMM Supplementary information Supplementary data are available at Bioinformatics online.


2020 ◽  
Vol 8 (11) ◽  
pp. 1679
Author(s):  
Valentina Méndez ◽  
Miryam Valenzuela ◽  
Francisco Salvà-Serra ◽  
Daniel Jaén-Luchoro ◽  
Ximena Besoain ◽  
...  

The genus Clavibacter has been associated largely with plant diseases. The aims of this study were to characterize the genomes and the virulence factors of Chilean C. michiganensis subsp. michiganensis strains VL527, MSF322 and OP3, and to define their phylogenomic positions within the species, Clavibacter michiganensis. VL527 and MSF322 genomes possess 3,396,632 and 3,399,199 bp, respectively, with a pCM2-like plasmid in strain VL527, with pCM1- and pCM2-like plasmids in strain MSF322. OP3 genome is composed of a chromosome and three plasmids (including pCM1- and pCM2-like plasmids) of 3,466,104 bp. Genomic analyses confirmed the phylogenetic relationships of the Chilean strains among C.michiganensis subsp. michiganensis and showed their low genomic diversity. Different virulence levels in tomato plants were observable. Phylogenetic analyses of the virulence factors revealed that the pelA1 gene (chp/tomA region)—that grouped Chilean strains in three distinct clusters—and proteases and hydrolases encoding genes, exclusive for each of the Chilean strains, may be involved in these observed virulence levels. Based on genomic similarity (ANIm) analyses, a proposal to combine and reclassify C. michiganensis subsp. phaseoli and subsp. chilensis at the species level, as C. phaseoli sp. nov., as well as to reclassify C. michiganensis subsp. californiensis as the species C. californiensis sp. nov. may be justified.


2019 ◽  
Author(s):  
Marcin Tabaka ◽  
Joshua Gould ◽  
Aviv Regev

AbstractWe present scSVA (single-cell Scalable Visualization and Analytics), a lightweight R package for interactive two- and three-dimensional visualization and exploration of massive single-cell omics data. Building in part of methods originally developed for astronomy datasets, scSVA is memory efficient for more than hundreds of millions of cells, can be run locally or in a cloud, and generates high-quality figures. In particular, we introduce a numerically efficient method for single-cell data embedding in 3D which combines an optimized implementation of diffusion maps with a 3D force-directed layout, enabling generation of 3D data visualizations at the scale of a million cells. To facilitate reproducible research, scSVA supports interactive analytics in a cloud with containerized tools. scSVA is available online at https://github.com/klarman-cell-observatory/scSVA.


2021 ◽  
Author(s):  
Chin Jian Yang ◽  
Rodney N. Edmondson ◽  
Hans-Peter Piepho ◽  
Wayne Powell ◽  
Ian Mackay

AbstractMultiparental advanced generation inter-cross (MAGIC) populations are valuable crop resources with a wide array of research uses including genetic mapping of complex traits, management of genetic resources and breeding of new varieties. Multiple founders are crossed to create a rich mosaic of highly recombined founder genomes in the MAGIC recombinant inbred lines (RILs). Many variations of MAGIC population designs exist; however, a large proportion of the currently available populations have been created empirically and based on similar designs. In our evaluations of five MAGIC populations, we found that the choice of designs has a large impact on the recombination landscape in the RILs. The most popular design used in many MAGIC populations has been shown to have a bias in recombinant haplotypes and low level of unique recombinant haplotypes, and therefore is not recommended. To address this problem and provide a remedy for the future, we have developed the “magicdesign” R package for creating and testing any MAGIC population design via simulation. A Shiny app version of the package is available as well. Our “magicdesign” package provides a unifying tool and a framework for creativity and innovation in MAGIC population designs. For example, using this package, we demonstrate that MAGIC population designs can be found which are very effective in creating haplotype diversity without the requirement for very large crossing programmes. Further, we show that interspersing cycles of crossing with cycles of selfing is effective in increasing haplotype diversity. These approaches are applicable in species which are hard to cross or in which resources are limited.


2018 ◽  
Vol 4 (9) ◽  
Author(s):  
Amy Fleshman ◽  
Kristin Mullins ◽  
Jason Sahl ◽  
Crystal Hepp ◽  
Nathan Nieto ◽  
...  

F1000Research ◽  
2019 ◽  
Vol 8 ◽  
pp. 1749 ◽  
Author(s):  
John D. Blischak ◽  
Peter Carbonetto ◽  
Matthew Stephens

Making scientific analyses reproducible, well documented, and easily shareable is crucial to maximizing their impact and ensuring that others can build on them. However, accomplishing these goals is not easy, requiring careful attention to organization, workflow, and familiarity with tools that are not a regular part of every scientist's toolbox. We have developed an R package, workflowr, to help all scientists, regardless of background, overcome these challenges. Workflowr aims to instill a particular "workflow" — a sequence of steps to be repeated and integrated into research practice — that helps make projects more reproducible and accessible.This workflow integrates four key elements: (1) version control (via Git); (2) literate programming (via R Markdown); (3) automatic checks and safeguards that improve code reproducibility; and (4) sharing code and results via a browsable website. These features exploit powerful existing tools, whose mastery would take considerable study. However, the workflowr interface is simple enough that novice users can quickly enjoy its many benefits. By simply following the workflowr "workflow", R users can create projects whose results, figures, and development history are easily accessible on a static website — thereby conveniently shareable with collaborators by sending them a URL — and accompanied by source code and reproducibility safeguards. The workflowr R package is open source and available on CRAN, with full documentation and source code available at https://github.com/jdblischak/workflowr.


Author(s):  
Sarah Percy

From China’s claims in the South China Sea and its dispute with Japan over Senkaku Island to Canada’s concerns over melting sea ice in the Northwest Passage and how best to secure its Arctic region, maritime security issues rest at the heart of many core strategic disputes. Maritime insecurity can also take unconventional forms, stemming from criminal threats at sea that can have an impact upon national security. Both challenges have a long history, and both intersect the crucial issues or rules, order, and ungoverned spaces. this chapter examines this nexus of challenges, providing the analytical tools needed to forecast what is and is not likely to change. The chapter concludes by considering the relationship between continuity, change, and contingency in the future of maritime security.


2017 ◽  
Author(s):  
Dennis van Muijen ◽  
Ram K. Basnet ◽  
Nathalie N.J. Dek ◽  
Chris Maliepaard ◽  
Evert W. Gutteling

AbstractMotivationWhere standalone tools for genetic map visualisation, consensus map construction, and Marey map analysis offer useful analyses for research and breeding, no single tool combines the three. Manual data curation is part of each of these analyses, which is difficult to standardize and consequently error prone.ResultsMapfuser provides a high-level interface for common analyses in breeding programs and quantitative genetics. Combined with interactive visualisations and automated quality control, mapfuser provides a standardized and flexible toolbox and is available as Shiny app for biologists. Reproducible research in the R package is facilitated by storage of raw data, function parameters, and results in an R object. In the shiny app a rmarkdown report is available.AvailabilityMapfuser is available as R package at https://github.com/dmuijen/mapfuser under the GPL-3 License and is available for public use as Shiny application at https://plantbreeding.shinyapps.io/mapfuser. Documentation for the R package is available as package vignette. Shiny app documentation is integrated in the [email protected]


Author(s):  
Chin Jian Yang ◽  
Rodney N Edmondson ◽  
Hans-Peter Piepho ◽  
Wayne Powell ◽  
Ian Mackay

Abstract Multiparental advanced generation inter-cross (MAGIC) populations are valuable crop resources with a wide array of research uses including genetic mapping of complex traits, management of genetic resources and breeding of new varieties. Multiple founders are crossed to create a rich mosaic of highly recombined founder genomes in the MAGIC recombinant inbred lines (RILs). Many variations of MAGIC population designs exist; however, a large proportion of the currently available populations have been created empirically and based on similar designs. In our evaluations of five MAGIC populations, we found that the choice of designs has a large impact on the recombination landscape in the RILs. The most popular design used in many MAGIC populations has been shown to have a bias in recombinant haplotypes and low level of unique recombinant haplotypes, and therefore is not recommended. To address this problem and provide a remedy for the future, we have developed the “magicdesign” R package for creating and testing any MAGIC population design via simulation. A Shiny app version of the package is available as well. Our “magicdesign” package provides a unifying tool and a framework for creativity and innovation in MAGIC population designs. For example, using this package, we demonstrate that MAGIC population designs can be found which are very effective in creating haplotype diversity without the requirement for very large crossing programs. Further, we show that interspersing cycles of crossing with cycles of selfing is effective in increasing haplotype diversity. These approaches are applicable in species which are hard to cross or in which resources are limited.


Sign in / Sign up

Export Citation Format

Share Document