scholarly journals Development of Genome-wide Simple Sequence Repeat Markers from Whole-genome Sequence of Mungbean (Vigna radiata)

Author(s):  
Kanimoli Mathivathana Mayalagu ◽  
Karthikeyan Adhimoolam ◽  
Jagadeeshselvam Nallathambi ◽  
Veera Ranjani Rajagopalan ◽  
Madhumitha Balasubramanian ◽  
...  

Background: Mungbean is an important pulse crop and it is mainly cultivated in Asia for human consumption. Its small genome and diploid nature make it a well-suited model organism among legume crops. Thus, cost-effective, reliable and highly polymorphic molecular markers distributing the whole genome are needed for diversity, mapping and functional genomics studies in this model species. Methods: The whole-genome sequence of mungbean was obtained and used as a source of identification of simple sequence repeats (SSR). The sequence reads were aligned and SSRs detection was performed using the Phobos plugin tandem repeat finder in the Geneious software program. A total of 12 mungbean genotypes were selected to validate the newly developed SSR markers. Result: In the present study, a total of 12, 49,774 and 11, 86, 386 perfect and imperfect SSR repeats were identified from the mungbean genome. The tri-repeats were the most abundant (26.10%), followed by hexa (20.82%), penta (20.45%), tetra (17.65%) and di-repeats (14.95%). We designed 1330 SSR primers based on the genomic sequence of flanking perfect SSRs (Di and tri-repeats). Among them, 50 SSR primers uniformly distributed across the 11 mungbean chromosomes were selected and used to validate 12 mungbean genotypes. The newly developed genomic SSR markers generated in the present study are a valuable genomic resource for the mungbean breeding programs.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Gehendra Bhattarai ◽  
Ainong Shi ◽  
Devi R. Kandel ◽  
Nora Solís-Gracia ◽  
Jorge Alberto da Silva ◽  
...  

AbstractThe availability of well-assembled genome sequences and reduced sequencing costs have enabled the resequencing of many additional accessions in several crops, thus facilitating the rapid discovery and development of simple sequence repeat (SSR) markers. Although the genome sequence of inbred spinach line Sp75 is available, previous efforts have resulted in a limited number of useful SSR markers. Identification of additional polymorphic SSR markers will support genetics and breeding research in spinach. This study aimed to use the available genomic resources to mine and catalog a large number of polymorphic SSR markers. A search for SSR loci on six chromosome sequences of spinach line Sp75 using GMATA identified a total of 42,155 loci with repeat motifs of two to six nucleotides in the Sp75 reference genome. Whole-genome sequences (30x) of additional 21 accessions were aligned against the chromosome sequences of the reference genome and in silico genotyped using the HipSTR program by comparing and counting repeat numbers variation across the SSR loci among the accessions. The HipSTR program generated SSR genotype data were filtered for monomorphic and high missing loci, and a final set of the 5986 polymorphic SSR loci were identified. The polymorphic SSR loci were present at a density of 12.9 SSRs/Mb and were physically mapped. Out of 36 randomly selected SSR loci for validation, two failed to amplify, while the remaining were all polymorphic in a set of 48 spinach accessions from 34 countries. Genetic diversity analysis performed using the SSRs allele score data on the 48 spinach accessions showed three main population groups. This strategy to mine and develop polymorphic SSR markers by a comparative analysis of the genome sequences of multiple accessions and computational genotyping of the candidate SSR loci eliminates the need for laborious experimental screening. Our approach increased the efficiency of discovering a large set of novel polymorphic SSR markers, as demonstrated in this report.


2017 ◽  
Vol 7 (1) ◽  
Author(s):  
Lynsey K. Whitacre ◽  
Jesse L. Hoff ◽  
Robert D. Schnabel ◽  
Sara Albarella ◽  
Francesca Ciotola ◽  
...  

2019 ◽  
Author(s):  
Xin Zhou ◽  
Lu Zhang ◽  
Ziming Weng ◽  
David L. Dill ◽  
Arend Sidow

AbstractVariant discovery in personal, whole genome sequence data is critical for uncovering the genetic contributions to health and disease. We introduce a new approach, Aquila, that uses linked-read data for generating a high quality diploid genome assembly, from which it then comprehensively detects and phases personal genetic variation. Assemblies cover >95% of the human reference genome, with over 98% in a diploid state. Thus, the assemblies support detection and accurate genotyping of the most prevalent types of human genetic variation, including single nucleotide polymorphisms (SNPs), small insertions and deletions (small indels), and structural variants (SVs), in all but the most difficult regions. All heterozygous variants are phased in blocks that can approach arm-level length. The final output of Aquila is a diploid and phased personal genome sequence, and a phased VCF file that also contains homozygous and a few unphased heterozygous variants. Aquila represents a cost-effective evolution of whole-genome reconstruction that can be applied to cohorts for variation discovery or association studies, or to single individuals with rare phenotypes that could be caused by SVs or compound heterozygosity.


2012 ◽  
Vol 10 (06) ◽  
pp. 1250016 ◽  
Author(s):  
MADHAVI K. GANAPATHIRAJU ◽  
ASIA D. MITCHELL ◽  
MOHAMED THAHIR ◽  
KAMIYA MOTWANI ◽  
SESHAN ANANTHASUBRAMANIAN

Genome sequences contain a number of patterns that have biomedical significance. Repetitive sequences of various kinds are a primary component of most of the genomic sequence patterns. We extended the suffix-array based Biological Language Modeling Toolkit to compute n-gram frequencies as well as n-gram language-model based perplexity in windows over the whole genome sequence to find biologically relevant patterns. We present the suite of tools and their application for analysis on whole human genome sequence.


2019 ◽  
Vol 8 (32) ◽  
Author(s):  
Sofia B. Mohamed ◽  
Mohamed Hassan ◽  
Abdalla Munir ◽  
Sumaya Kambal ◽  
Nusiba I. Abdalla ◽  
...  

Acinetobacter baumannii has emerged as an important pathogen leading to multiple nosocomial outbreaks. Here, we describe the genomic sequence of a multidrug-resistant Acinetobacter baumannii sequence type 164 (ST164) isolate from a hospital patient in Sudan. To our knowledge, this is the first reported draft genome of an A. baumannii strain isolated from Sudan.


2020 ◽  
Vol 49 (D1) ◽  
pp. D644-D650 ◽  
Author(s):  
Ye Feng ◽  
Shengmei Zou ◽  
Hangfei Chen ◽  
Yunsong Yu ◽  
Zhi Ruan

Abstract An increasing prevalence of hospital acquired infections and foodborne illnesses caused by pathogenic and multidrug-resistant bacteria has stimulated a pressing need for benchtop computational techniques to rapidly and accurately classify bacteria from genomic sequence data, and based on that, to trace the source of infection. BacWGSTdb (http://bacdb.org/BacWGSTdb) is a free publicly accessible database we have developed for bacterial whole-genome sequence typing and source tracking. This database incorporates extensive resources for bacterial genome sequencing data and the corresponding metadata, combined with specialized bioinformatics tools that enable the systematic characterization of the bacterial isolates recovered from infections. Here, we present BacWGSTdb 2.0, which encompasses several major updates, including (i) the integration of the core genome multi-locus sequence typing (cgMLST) approach, which is highly scalable and appropriate for typing isolates belonging to different lineages; (ii) the addition of a multiple genome analysis module that can process dozens of user uploaded sequences in a batch mode; (iii) a new source tracking module for comparing user uploaded plasmid sequences to those deposited in the public databases; (iv) the number of species encompassed in BacWGSTdb 2.0 has increased from 9 to 20, which represents bacterial pathogens of medical importance; (v) a newly designed, user-friendly interface and a set of visualization tools for providing a convenient platform for users are also included. Overall, the updated BacWGSTdb 2.0 bears great utility in continuing to provide users, including epidemiologists, clinicians and bench scientists, with a one-stop solution to bacterial genome sequence analysis.


2017 ◽  
Vol 5 (24) ◽  
Author(s):  
William G. Miller ◽  
Emma Yee ◽  
Joana Revez ◽  
James L. Bono ◽  
Mirko Rossi

ABSTRACT Campylobacter cuniculorum is a thermotolerant species isolated from farmed rabbits (Oryctolagus cuniculus). Although C. cuniculorum is highly prevalent in rabbits farmed for human consumption, the pathogenicity of this organism in humans is still unknown. This study describes the whole-genome sequence of the C. cuniculorum type strain LMG 24588 (=CCUG 56289T).


2018 ◽  
Vol 143 (5) ◽  
pp. 347-362 ◽  
Author(s):  
Gehendra Bhattarai ◽  
Shawn A. Mehlenbacher

From the genome sequence of hazelnut (Corylus avellana), 192 new polymorphic simple sequence repeat (SSR) markers were developed, characterized, and used to investigate genetic diversity in 50 accessions. Next-generation sequencing allows inexpensive sequencing of plant genomes and transcriptomes, and efficient development of polymorphic SSR markers, also known as microsatellite markers, at low cost. A search of the genome sequence of ‘Jefferson’ hazelnut identified 9094 fragments with long repeat motifs of 4, 5, or 6 base pairs (bp), from which polymorphic SSR markers were developed. The repeat regions in the ‘Jefferson’ genome were used as references to which genomic sequence reads of seven additional cultivars were aligned in silico. Visual inspection for variation in repeat number among the aligned reads identified 246 as polymorphic, for which primer pairs were designed. Polymerase chain reaction (PCR) amplification followed by agarose gel separation indicated polymorphism at 195 loci, for which fluorescent forward primers were used to amplify the DNA of 50 hazelnut accessions. Amplicons were post-PCR multiplexed for capillary electrophoresis, and allele sizes were determined for 50 accessions. After eliminating three, 192 were confirmed as polymorphic, and 169 showed only one or two alleles in each of the 50 cultivars, as expected in a diploid. At these 169 SSRs, a total of 843 alleles were found, for an average of 4.99 and a range of 2 to 17 alleles per locus. The mean observed heterozygosity, expected heterozygosity, polymorphism information content, and the frequency of null alleles were 0.51, 0.53, 0.47, and 0.03, respectively. An additional 25 primer pairs produced more than two bands in some accessions with an average of 6.8 alleles. The UPGMA dendrogram revealed a wide genetic diversity and clustered the 50 accessions according to their geographic origin. Of the new SSRs, 132 loci were placed on the linkage map. These new markers will be useful for diversity and parentage studies, cultivar fingerprinting, marker-assisted selection, and aligning the linkage map with scaffolds of the genome sequence.


Sign in / Sign up

Export Citation Format

Share Document