Whole Genome Sequencing Assembly and Annotation of Halobacillus Trueperi S61 Isolated From the Qarhan Salt Lake

Mapping Intimacies ◽

10.21203/rs.3.rs-1188620/v1 ◽

2022 ◽

Author(s):

Wei Li ◽

Shuo Shen ◽

Jian Wang

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Protein Function ◽

Salt Lake ◽

Gc Content ◽

Whole Genome ◽

Biological Processes ◽

Circular Chromosome ◽

Advanced Analysis ◽

Qarhan Salt Lake

Abstract Background: Halophilic microbial as prospective resources of biotechnology due to the advantages of flexible survivability. Qarhan Salt Lake is the second largest Salt Lake in the world which contains rich-unique extremophiles and deserved in-depth exploration. Results: Present study first time isolated novel strain Halobacillus trueperi S61 from Qarhan Salt Lake and performed whole-genome sequencing through combined third-generation PacBio and second-generation Illumina technology. The whole genome of Halobacillus trueperi S61 identified 57549 total reads and consists a complete circular chromosome of 4047887 bp with 43.86% GC content without gaps. Total number of 139 non-coding RNA (included 86 tRNA, 30 rRNA and 23 sRNA), 16 gene islands with 260275 bp and two prophages (with 82682 length) were predicted. In addition, the whole genome of Halobacillus trueperi S61 summarized basic annotation for 3982 protein-coding genes, 3980, 3667, 2998 and 2303 unigenes were annotated with Nr, Swissport, KOG and KEGG database. Combined with advanced analysis, 561 carbohydrate enzymes and 4416 pathogen host interactions related genes were identified. The protein function of Halobacillus trueperi S61 was mainly focus on biological processes, and the protein function was mainly distributed in gene transcription and amino acids, and carbohydrates metabolism. Conclusions: The complete whole genome sequence assembly and annotation of novel strain Halobacillus trueperi S61 isolated from Qarhan Salt Lake mainly focus on protein biological processes and antibiotic resistance, provides a potential resource for biotechnology.

Get full-text (via PubEx)

Characterization of multidrug-resistant Acinetobacter baumannii strain ATCC BAA1605 using whole-genome sequencing

BMC Research Notes ◽

10.1186/s13104-021-05493-z ◽

2021 ◽

Vol 14 (1) ◽

Author(s):

Kah Ern Ten ◽

Muhammad Zarul Hanifah Md Zoqratt ◽

Qasim Ayub ◽

Hock Siew Tan

Keyword(s):

Whole Genome Sequencing ◽

Acinetobacter Baumannii ◽

Genome Sequencing ◽

Gc Content ◽

Multidrug Resistant ◽

Genomic Islands ◽

Whole Genome ◽

Strain Atcc ◽

Circular Chromosome ◽

Genome Data

Abstract Objective The nosocomial pathogen, Acinetobacter baumannii, has acquired clinical significance due to its ability to persist in hospital settings and survive antibiotic treatment, which eventually resulted in the rapid spread of this bacterium with antimicrobial resistance (AMR) phenotypes. This study used a multidrug-resistant A. baumannii (strain ATCC BAA1605) as a model to study the genomic features of this pathogen. Results One circular chromosome and one circular plasmid were discovered in the complete genome of A. baumannii ATCC BAA1605 using whole-genome sequencing. The chromosome is 4,039,171 bp long with a GC content of 39.24%. Many AMR genes, which confer resistance to major classes of antibiotics (beta-lactams, aminoglycosides, tetracycline, sulphonamides), were found on the chromosome. Two genomic islands were predicted on the chromosome, one of which (Genomic Island 1) contains a cluster of AMR genes and mobile elements, suggesting the possibility of horizontal gene transfer. A subtype I-F CRISPR-Cas system was also identified on the chromosome of A. baumannii ATCC BAA1605. This study provides valuable genome data that can be used as a reference for future studies on A. baumannii. The genome of A. baumannii ATCC BAA1605 has been deposited at GenBank under accession no. CP058625 and CP058626.

Get full-text (via PubEx)

Evaluating coverage bias in next-generation sequencing of Escherichia coli

PLoS ONE ◽

10.1371/journal.pone.0253440 ◽

2021 ◽

Vol 16 (6) ◽

pp. e0253440

Author(s):

Samantha Gunasekera ◽

Sam Abraham ◽

Marc Stegger ◽

Stanley Pang ◽

Penghao Wang ◽

...

Keyword(s):

Escherichia Coli ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

De Novo ◽

Fragment Size ◽

Gc Content ◽

Main Concern ◽

Whole Genome ◽

Assembly Quality ◽

Coverage Bias

Whole-genome sequencing is essential to many facets of infectious disease research. However, technical limitations such as bias in coverage and tagmentation, and difficulties characterising genomic regions with extreme GC content have created significant obstacles in its use. Illumina has claimed that the recently released DNA Prep library preparation kit, formerly known as Nextera Flex, overcomes some of these limitations. This study aimed to assess bias in coverage, tagmentation, GC content, average fragment size distribution, and de novo assembly quality using both the Nextera XT and DNA Prep kits from Illumina. When performing whole-genome sequencing on Escherichia coli and where coverage bias is the main concern, the DNA Prep kit may provide higher quality results; though de novo assembly quality, tagmentation bias and GC content related bias are unlikely to improve. Based on these results, laboratories with existing workflows based on Nextera XT would see minor benefits in transitioning to the DNA Prep kit if they were primarily studying organisms with neutral GC content.

Get full-text (via PubEx)

Whole-Genome Sequencing of Mycobacterium tilburgii Strain MEPHI

Microbiology Resource Announcements ◽

10.1128/mra.00933-19 ◽

2019 ◽

Vol 8 (40) ◽

Author(s):

Jamal Saad ◽

Michel Drancourt ◽

Margaret M. Hannan ◽

Patrick J. Stapleton ◽

Simon Grandjean Lapierre

Keyword(s):

Whole Genome Sequencing ◽

Clinical Isolate ◽

Genome Sequencing ◽

Gc Content ◽

Whole Genome ◽

Content Type ◽

Phylogenetic Placement ◽

Mycobacterium Simiae

Mycobacterium tilburgii is a fastidious mycobacterium which has previously been reported to cause severe disseminated infections. Genome sequencing of the M. tilburgii MEPHI clinical isolate yielded 3.14 Mb, with 66.3% GC content, and confirmed phylogenetic placement within the Mycobacterium simiae complex.

Get full-text (via PubEx)

Fine mapping of genomic regions associated with female fertility in Nellore beef cattle based on sequence variants from segregating sires

Journal of Animal Science and Biotechnology ◽

10.1186/s40104-019-0403-0 ◽

2019 ◽

Vol 10 (1) ◽

Author(s):

Gerson A. Oliveira Júnior ◽

Daniel J. A. Santos ◽

Aline S. M. Cesar ◽

Solomon A. Boison ◽

Ricardo V. Ventura ◽

...

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Fine Mapping ◽

Reproductive Hormones ◽

Sequence Variants ◽

Whole Genome ◽

Biological Processes ◽

Snp Chip ◽

Mesh Terms ◽

Genomic Regions

Abstract Background Impaired fertility in cattle limits the efficiency of livestock production systems. Unraveling the genetic architecture of fertility traits would facilitate their improvement by selection. In this study, we characterized SNP chip haplotypes at QTL blocks then used whole-genome sequencing to fine map genomic regions associated with reproduction in a population of Nellore (Bos indicus) heifers. Methods The dataset comprised of 1337 heifers genotyped using a GeneSeek® Genomic Profiler panel (74677 SNPs), representing the daughters from 78 sires. After performing marker quality control, 64800 SNPs were retained. Haplotypes carried by each sire at six previously identified QTL on BTAs 5, 14 and 18 for heifer pregnancy and BTAs 8, 11 and 22 for antral follicle count were constructed using findhap software. The significance of the contrasts between the effects of every two paternally-inherited haplotype alleles were used to identify sires that were heterozygous at each QTL. Whole-genome sequencing data localized to the haplotypes from six sires and 20 other ancestors were used to identify sequence variants that were concordant with the haplotype contrasts. Enrichment analyses were applied to these variants using KEGG and MeSH libraries. Results A total of six (BTA 5), six (BTA 14) and five (BTA 18) sires were heterozygous for heifer pregnancy QTL whereas six (BTA 8), fourteen (BTA 11), and five (BTA 22) sires were heterozygous for number of antral follicles’ QTL. Due to inadequate representation of many haplotype alleles in the sequenced animals, fine mapping analysis could only be reliably performed for the QTL on BTA 5 and 14, which had 641 and 3733 concordant candidate sequence variants, respectively. The KEGG “Circadian rhythm” and “Neurotrophin signaling pathway” were significantly associated with the genes in the QTL on BTA 5 whereas 32 MeSH terms were associated with the QTL on BTA 14. Among the concordant sequence variants, 0.2% and 0.3% were classified as missense variants for BTAs 5 and 14, respectively, highlighting the genes MTERF2, RTMB, ENSBTAG00000037306 (miRNA), ENSBTAG00000040351, PRKDC, and RGS20. The potential causal mutations found in the present study were associated with biological processes such as oocyte maturation, embryo development, placenta development and response to reproductive hormones. Conclusions The identification of heterozygous sires by positionally phasing SNP chip data and contrasting haplotype effects for previously detected QTL can be used for fine mapping to identify potential causal mutations and candidate genes. Genomic variants on genes MTERF2, RTBC, miRNA ENSBTAG00000037306, ENSBTAG00000040351, PRKDC, and RGS20, which are known to have influence on reproductive biological processes, were detected.

Get full-text (via PubEx)

Validation and Implementation of CLIA-Compliant Whole Genome Sequencing (WGS) in Public Health Laboratory

10.1101/107003 ◽

2017 ◽

Author(s):

Varvara K. Kozyreva ◽

Chau-Linda Truong ◽

Alexander L. Greninger ◽

John Crandall ◽

Rituparna Mukhopadhyay ◽

...

Keyword(s):

Public Health ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Gc Content ◽

End Users ◽

Whole Genome ◽

Base Calling ◽

Specificity And Sensitivity ◽

Wide Range ◽

Performance Specifications

AbstractBackgroundPublic health microbiology laboratories (PHL) are at the cusp of unprecedented improvements in pathogen identification, antibiotic resistance detection, and outbreak investigation by using whole genome sequencing (WGS). However, considerable challenges remain due to the lack of common standards.Objectives1) Establish the performance specifications of WGS applications used in PHL to conform with CLIA (Clinical Laboratory Improvements Act) guidelines for laboratory developed tests (LDT), 2) Develop quality assurance (QA) and quality control (QC) measures, 3) Establish reporting language for end users with or without WGS expertise, 4) Create a validation set of microorganisms to be used for future validations of WGS platforms and multi-laboratory comparisons and, 5) Create modular templates for the validation of different sequencing platforms.MethodsMiSeq Sequencer and Illumina chemistry (Illumina, Inc.) were used to generate genomes for 34 bacterial isolates with genome sizes from 1.8 to 4.7 Mb and wide range of GC content (32.1%-66.1%). A customized CLCbio Genomics Workbench - shell script bioinformatics pipeline was used for the data analysis.ResultsWe developed a validation panel comprising ten Enterobacteriaceae isolates, five gram-positive cocci, five gram-negative non-fermenting species, nine Mycobacterium tuberculosis, and five miscellaneous bacteria; the set represented typical workflow in the PHL. The accuracy of MiSeq platform for individual base calling was >99.9% with similar results shown for reproducibility/repeatability of genome-wide base calling. The accuracy of phylogenetic analysis was 100%. The specificity and sensitivity inferred from MLST and genotyping tests were 100%. A test report format was developed for the end users with and without WGS knowledge.ConclusionWGS was validated for routine use in PHL according to CLIA guidelines for LDTs. The validation panel, sequencing analytics, and raw sequences will be available for future multi-laboratory comparisons of WGS in PHL. Additionally, the WGS performance specifications and modular validation template are likely to be adaptable for the validation of other platforms and reagents kits.

Get full-text (via PubEx)

Safety Evaluation and Whole Genome Sequencing of Aspergillus japonicas PJ01 Reveal Its Potential to Degrade Citrus Segments in Juice Processing

Foods ◽

10.3390/foods10081736 ◽

2021 ◽

Vol 10 (8) ◽

pp. 1736

Author(s):

Yujiao Qian ◽

Zhipeng Gao ◽

Jieyi Wang ◽

Chen Wang ◽

Gaoyang Li ◽

...

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Gene Annotation ◽

Gc Content ◽

Crude Enzyme ◽

Control Group ◽

Whole Genome ◽

Enzyme Solution ◽

Juice Processing ◽

Crude Enzyme Solution

Aspergillus japonicas PJ01 (A. japonicas PJ01) is a strain isolated from the rotten branches. In previ-ous studies, it was shown that it can produce complex enzymes to degrade polysaccharide com-ponents. In this study, we evaluated the safety of its crude enzyme solution. Acute oral toxicity, subchronic toxicity, micronucleus and sperm malformation tests all validated the high biologi-cal safety for the crude enzymes. Secondly, we carried out the citrus segment degradation ex-periment of crude enzyme solution. Compared with the control group, the crude enzyme solu-tion of A. japonicas PJ01 can completely degrade the segments in 50 min, which provides the basis for enzymatic peeling during juice processing. The whole genome sequencing showed that the genome of A. japonicus PJ01 has a GC content of 51.37% with a size of 36204647 bp, and encoded 10070 genes. GO, COG, KEGG and CAZy databases were used in gene annotation analyses. Pathway enrichment showed many genes related to carbohydrate metabolism, rich in genes re-lated to pectinase, xylanase and carboxylcellulase. Therefore, the complex enzyme produced by A. japonicus PJ01 can be used in gizzard juice processing to achieve efficient enzymatic decapsu-lation.

Get full-text (via PubEx)

Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples

Nature Communications ◽

10.1038/s41467-020-18151-y ◽

2020 ◽

Vol 11 (1) ◽

Author(s):

Matthew H. Bailey ◽

◽

William U. Meyerson ◽

Lewis Jonathan Dursi ◽

Liang-Bo Wang ◽

...

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Gc Content ◽

Cancer Genome ◽

The Cancer Genome Atlas ◽

Whole Genome Sequencing Data ◽

Whole Genome ◽

Sequencing Data ◽

Whole Exome ◽

Cancer Genome Atlas

AbstractThe Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts.

Get full-text (via PubEx)

Genome Sequence of Bacillus velezensis 5RB, an Overproducer of 2,3-Butanediol

Microbiology Resource Announcements ◽

10.1128/mra.01475-18 ◽

2019 ◽

Vol 8 (1) ◽

Author(s):

Penka Petrova ◽

Petya Velikova ◽

Kaloyan Petrov

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Genome Sequence ◽

Whole Genome ◽

Bacillus Velezensis ◽

Circular Chromosome ◽

Content Type

Bacillus velezensis 5RB is capable of producing large amounts of 2,3-butanediol. Whole-genome sequencing revealed that the strain contains one circular chromosome of 3.91 Mbp without plasmids.

Get full-text (via PubEx)

Squamate Chromosome Size and GC Content Assessed by Flow Karyotyping

Cytogenetic and Genome Research ◽

10.1159/000497265 ◽

2019 ◽

Vol 157 (1-2) ◽

pp. 46-52 ◽

Cited By ~ 1

Author(s):

Fumio Kasai ◽

Patricia C.M. O'Brien ◽

Malcolm A. Ferguson-Smith

Keyword(s):

Gene Mapping ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Chromosome Painting ◽

Karyotype Evolution ◽

Gc Content ◽

Whole Genome ◽

Chromosome Size ◽

The Relationship ◽

Reptilian Species

Chromosome homologies in reptiles have been investigated extensively by gene mapping and chromosome painting. Relative chromosome size can be estimated roughly from conventional karyotypes, but chromosome GC content cannot be evaluated by any of these approaches. However, GC content can be obtained by whole-genome sequencing, although complete data are available only for a limited number of reptilian species. Chromosomes can be characterized by size and GC content in bivariate flow karyotypes, in which the distribution of peaks represents the differences. We have analysed flow karyotypes from 9 representative squamate species and show chromosome profiles for each species based on the relationship between size and GC content. Our results reveal that the GC content of macrochromosomes is invariable in the 9 species. A higher GC content was found in microchromosomes, similar to profiles previously determined in crocodile, turtle, and chicken. The findings suggest that karyotype evolution in reptiles is characterized by unique features of chromosome GC content.

Get full-text (via PubEx)

Utility of Whole Genome Sequencing in diagnosing complex disorders: lesson from renal tubular disorders

Endocrine Abstracts ◽

10.1530/endoabs.59.p052 ◽

2018 ◽

Author(s):

Mark Stevenson ◽

Alistair T Pagnamenta ◽

Heather G Mack ◽

Judith A Savige ◽

Kate E Lines ◽

...

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Whole Genome ◽

Complex Disorders ◽

Tubular Disorders ◽

Renal Tubular Disorders ◽

Renal Tubular

Get full-text (via PubEx)