scholarly journals The ATCC Genome Portal: Microbial Genome Reference Standards with Data Provenance

2021 ◽  
Vol 10 (47) ◽  
Author(s):  
Briana Benton ◽  
Stephen King ◽  
Samuel R. Greenfield ◽  
Nikhita Puthuveetil ◽  
Amy L. Reese ◽  
...  

Lack of data provenance negatively impacts scientific reproducibility and the reliability of genomic data. The ATCC Genome Portal ( https://genomes.atcc.org ) addresses this by providing data provenance information for microbial whole-genome assemblies originating from authenticated biological materials. To date, we have sequenced 1,579 complete genomes, including 466 type strains and 1,156 novel genomes.

2018 ◽  
Vol 6 (7) ◽  
Author(s):  
Joseph W. Saelens ◽  
Dalia Lau-Bonilla ◽  
Anneliese Moller ◽  
Ana M. Xet-Mull ◽  
Narda Medina ◽  
...  

ABSTRACT Whole-genome sequencing has resulted in new insights into the phylogeography of Mycobacterium tuberculosis . However, only limited genomic data are available from M. tuberculosis strains in Guatemala. Here we report 16 complete genomes of clinical strains belonging to the Euro-American lineage 4, the most common lineage found in Guatemala and Central America.


2021 ◽  
Vol 52 (1) ◽  
Author(s):  
Jaewon Lim ◽  
Hong-Tae Park ◽  
Seyoung Ko ◽  
Hyun-Eui Park ◽  
Gyumin Lee ◽  
...  

AbstractMycobacterium avium subsp. paratuberculosis (MAP) is a causative agent of Johne’s disease, which is a chronic granulomatous enteropathy in ruminants. Determining the genetic diversity of MAP is necessary to understand the epidemiology and biology of MAP, as well as establishing disease control strategies. In the present study, whole genome-based alignment and comparative analysis were performed using 40 publicly available MAP genomes, including newly sequenced Korean isolates. First, whole genome-based alignment was employed to identify new genomic structures in MAP genomes. Second, the genomic diversity of the MAP population was described by pangenome analysis. A phylogenetic tree based on the core genome and pangenome showed that the MAP was differentiated into two major types (C- and S-type), which was in keeping with the findings of previous studies. However, B-type strains were discriminated from C-type strains. Finally, functional analysis of the pangenome was performed using three virulence factor databases (i.e., PATRIC, VFDB, and Victors) to predict the phenotypic diversity of MAP in terms of pathogenicity. Based on the results of the pangenome analysis, we developed a real-time PCR technique to distinguish among S-, B- and C-type strains. In conclusion, the results of our study suggest that the phenotypic differences between MAP strains can be explained by their genetic polymorphisms. These results may help to elucidate the diversity of MAP, extending from genomic features to phenotypic traits.


2021 ◽  
pp. 1-12
Author(s):  
Holly Etchegary ◽  
Daryl Pullman ◽  
Charlene Simmonds ◽  
Zoha Rabie ◽  
Proton Rahman

<b><i>Introduction:</i></b> The growth of global sequencing initiatives and commercial genomic test offerings suggests the public will increasingly be confronted with decisions about sequencing. Understanding public attitudes can assist efforts to integrate sequencing into care and inform the development of public education and outreach strategies. <b><i>Methods:</i></b> A 48-item online survey was advertised on Facebook in Eastern Canada and hosted on SurveyMonkey in late 2018. The survey measured public interest in whole genome sequencing and attitudes toward various aspects of sequencing using vignettes, scaled, and open-ended items. <b><i>Results:</i></b> While interest in sequencing was high, critical attitudes were observed. In particular, items measuring features of patient control and choice regarding genomic data were strongly endorsed by respondents. Majority wanted to specify upfront how their data could be used, retain the ability to withdraw their sample at a later date, sign a written consent form, and speak to a genetic counselor prior to sequencing. Concerns about privacy and unauthorized access to data were frequently observed. Education level was the sociodemographic variable most often related to attitude statements such that those with higher levels of education generally displayed more critical attitudes. <b><i>Conclusions:</i></b> Attitudes identified here could be used to inform the development of implementation strategies for genomic medicine. Findings suggest health systems must address patient concerns about privacy, consent practices, and the strong desire to control what happens to their genomic data through public outreach and education. Specific oversight procedures and policies that are clearly communicated to the public will be required.


Genes ◽  
2021 ◽  
Vol 12 (1) ◽  
pp. 124
Author(s):  
Alessio Iannucci ◽  
Alexey I. Makunin ◽  
Artem P. Lisachov ◽  
Claudio Ciofi ◽  
Roscoe Stanyon ◽  
...  

The study of vertebrate genome evolution is currently facing a revolution, brought about by next generation sequencing technologies that allow researchers to produce nearly complete and error-free genome assemblies. Novel approaches however do not always provide a direct link with information on vertebrate genome evolution gained from cytogenetic approaches. It is useful to preserve and link cytogenetic data with novel genomic discoveries. Sequencing of DNA from single isolated chromosomes (ChromSeq) is an elegant approach to determine the chromosome content and assign genome assemblies to chromosomes, thus bridging the gap between cytogenetics and genomics. The aim of this paper is to describe how ChromSeq can support the study of vertebrate genome evolution and how it can help link cytogenetic and genomic data. We show key examples of ChromSeq application in the refinement of vertebrate genome assemblies and in the study of vertebrate chromosome and karyotype evolution. We also provide a general overview of the approach and a concrete example of genome refinement using this method in the species Anolis carolinensis.


Author(s):  
Anton Michlmayr ◽  
Florian Rosenberg ◽  
Philipp Leitner ◽  
Schahram Dustdar

In general, provenance describes the origin and well-documented history of a given object. This notion has been applied in information systems, mainly to provide data provenance of scientific workflows. Similar to this, provenance in Service-oriented Computing has also focused on data provenance. However, the authors argue that in service-centric systems the origin and history of services is equally important. This paper presents an approach that addresses service provenance. The authors show how service provenance information can be collected and retrieved, and how security mechanisms guarantee integrity and access to this information, while also providing user-specific views on provenance. Finally, the paper gives a performance evaluation of the authors’ approach, which has been integrated into the VRESCo Web service runtime environment.


2020 ◽  
Author(s):  
Kyle Fletcher ◽  
Lin Zhang ◽  
Juliana Gil ◽  
Rongkui Han ◽  
Keri Cavanaugh ◽  
...  

AbstractBackgroundGenetic maps are an important resource for validation of genome assemblies, trait discovery, and breeding. Next generation sequencing has enabled production of high-density genetic maps constructed with 10,000s of markers. Most current approaches require a genome assembly to identify markers. Our Assembly Free Linkage Analysis Pipeline (AFLAP) removes this requirement by using uniquely segregating k-mers as markers to rapidly construct a genotype table and perform subsequent linkage analysis. This avoids potential biases including preferential read alignment and variant calling.ResultsThe performance of AFLAP was determined in simulations and contrasted to a conventional workflow. We tested AFLAP using 100 F2 individuals of Arabidopsis thaliana, sequenced to low coverage. Genetic maps generated using k-mers contained over 130,000 markers that were concordant with the genomic assembly. The utility of AFLAP was then demonstrated by generating an accurate genetic map using genotyping-by-sequencing data of 235 recombinant inbred lines of Lactuca spp. AFLAP was then applied to 83 F1 individuals of the oomycete Bremia lactucae, sequenced to >5x coverage. The genetic map contained over 90,000 markers ordered in 19 large linkage groups. This genetic map was used to fragment, order, orient, and scaffold the genome, resulting in a much-improved reference assembly.ConclusionsAFLAP can be used to generate high density linkage maps and improve genome assemblies of any organism when a mapping population is available using whole genome sequencing or genotyping-by-sequencing data. Genetic maps produced for B. lactucae were accurately aligned to the genome and guided significant improvements of the reference assembly.


Author(s):  
Camille Bourgaux ◽  
Ana Ozaki ◽  
Rafael Penaloza ◽  
Livia Predoiu

We address the problem of handling provenance information in ELHr ontologies. We consider a setting recently introduced for ontology-based data access, based on semirings and extending classical data provenance, in which ontology axioms are annotated with provenance tokens. A consequence inherits the provenance of the axioms involved in deriving it, yielding a provenance polynomial as an annotation. We analyse the semantics for the ELHr case and show that the presence of conjunctions poses various difficulties for handling provenance, some of which are mitigated by assuming multiplicative idempotency of the semiring. Under this assumption, we study three problems: ontology completion with provenance, computing the set of relevant axioms for a consequence, and query answering.


2014 ◽  
Vol 52 (1) ◽  
pp. 36-47 ◽  
Author(s):  
JN Morris ◽  
TS Crawford ◽  
A Jeffs ◽  
PA Stockwell ◽  
JJ Eaton-Rye ◽  
...  
Keyword(s):  

2018 ◽  
Vol 6 (4) ◽  
Author(s):  
Lindsey A. Moser ◽  
Lauren M. Oldfield ◽  
Nadia Fedorova ◽  
Vinita Puri ◽  
Susmita Shrivastava ◽  
...  

ABSTRACT We report 26 complete genomes of Zika virus (ZIKV) isolated after passaging the Zika virus strain FLR in mosquito (C6/36) and mammalian (Vero) cell lines. The consensus ZIKV genomes we recovered show greater than 99% nucleotide identify with each other and with the FLR strain used as input.


2013 ◽  
Vol 10 (6) ◽  
pp. 563-569 ◽  
Author(s):  
Chen-Shan Chin ◽  
David H Alexander ◽  
Patrick Marks ◽  
Aaron A Klammer ◽  
James Drake ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document