Bacterial artificial chromosome clones randomly selected for sequencing reveal genomic differences between soybean cultivars

2018 ◽  
Vol 69 (2) ◽  
pp. 131
Author(s):  
Tingting He ◽  
Longshu Yang ◽  
Xianlong Ding ◽  
Linfeng Chen ◽  
Yanwei Li ◽  
...  

This study pioneered the use of multiple technologies to combine the bacterial artificial chromosome (BAC) pooling strategy with high-throughput next- and third-generation sequencing technologies to analyse genomic difference. To understand the genetic background of the Chinese soybean cultivar N23601, we built a BAC library and sequenced 10 randomly selected clones followed by de novo assembly. Comparative analysis was conducted against the reference genome of Glycine max var. Williams 82 (2.0). Therefore, our result is an assessment of the reference genome. Our results revealed that 3517 single nucleotide polymorphisms (SNPs) and 662 insertion–deletions (InDels) occurred in ~1.2 Mb of the genomic region and that four of the 10 BAC clones contained 15 large structural variations (72 887 bp) compared with the reference genome. Gene annotation of the reference genome showed that Glyma.18g181000 was missing from the corresponding position of the 10 BAC clones. Additionally, there may be a problem with the assembly of some positions of the reference genome. Several gap regions in the reference genome could be supplemented by using the complete sequence of the 10 BAC clones. We believe that accurate and complete BAC sequence is a valuable resource that contributes to the completeness of the reference genome.

Genome ◽  
2004 ◽  
Vol 47 (2) ◽  
pp. 239-245 ◽  
Author(s):  
Yaping Qian ◽  
Li Jin ◽  
Bing Su

The large-insert genomic DNA library is a critical resource for genome-wide genetic dissection of target species. We constructed a high-redundancy bacterial artificial chromosome (BAC) library of a New World monkey species, the black-handed spider monkey (Ateles geoffroyi). A total of 193 152 BAC clones were generated in this library. The average insert size of the BAC clones was estimated to be 184.6 kb with the small inserts (50-100 kb) accounting for less than 3% and the non-recombinant clones only 1.2%. Assuming a similar genome size with humans, the spider monkey BAC library has about 11× genome coverage. In addition, by end sequencing of randomly selected BAC clones, we generated 367 sequence tags for the library. When blasted against human genome, they showed a good correlation between the number of hit clones and the size of the chromosomes, an indication of unbiased chromosomal distribution of the library. This black-handed spider monkey BAC library would serve as a valuable resource in comparative genomic study and large-scale genome sequencing of nonhuman primates.Key words: black-handed spider monkeys, Ateles geoffroyi, BAC library.


1998 ◽  
Vol 66 (5) ◽  
pp. 2221-2229 ◽  
Author(s):  
Roland Brosch ◽  
Stephen V. Gordon ◽  
Alain Billault ◽  
Thierry Garnier ◽  
Karin Eiglmeier ◽  
...  

ABSTRACT The bacterial artificial chromosome (BAC) cloning system is capable of stably propagating large, complex DNA inserts in Escherichia coli. As part of the Mycobacterium tuberculosis H37Rv genome sequencing project, a BAC library was constructed in the pBeloBAC11 vector and used for genome mapping, confirmation of sequence assembly, and sequencing. The library contains about 5,000 BAC clones, with inserts ranging in size from 25 to 104 kb, representing theoretically a 70-fold coverage of the M. tuberculosisgenome (4.4 Mb). A total of 840 sequences from the T7 and SP6 termini of 420 BACs were determined and compared to those of a partial genomic database. These sequences showed excellent correlation between the estimated sizes and positions of the BAC clones and the sizes and positions of previously sequenced cosmids and the resulting contigs. Many BAC clones represent linking clones between sequenced cosmids, allowing full coverage of the H37Rv chromosome, and they are now being shotgun sequenced in the framework of the H37Rv sequencing project. Also, no chimeric, deleted, or rearranged BAC clones were detected, which was of major importance for the correct mapping and assembly of the H37Rv sequence. The minimal overlapping set contains 68 unique BAC clones and spans the whole H37Rv chromosome with the exception of a single gap of ∼150 kb. As a postgenomic application, the canonical BAC set was used in a comparative study to reveal chromosomal polymorphisms between M. tuberculosis, M. bovis, and M. bovis BCG Pasteur, and a novel 12.7-kb segment present in M. tuberculosis but absent from M. bovis and M. bovis BCG was characterized. This region contains a set of genes whose products show low similarity to proteins involved in polysaccharide biosynthesis. The H37Rv BAC library therefore provides us with a powerful tool both for the generation and confirmation of sequence data as well as for comparative genomics and other postgenomic applications. It represents a major resource for present and future M. tuberculosis research projects.


2012 ◽  
Vol 2012 ◽  
pp. 1-7 ◽  
Author(s):  
Peter J. Maughan ◽  
Scott M. Smith ◽  
Joshua A. Raney

Bacterial artificial chromosome (BAC) libraries are critical for identifying full-length genomic sequences, correlating genetic and physical maps, and comparative genomics. Here we describe the utilization of the Fluidigm access array genotyping system in conjunction with KASPar genotyping technology to identify individual BAC clones corresponding to specific single-nucleotide polymorphisms (SNPs) from an Amplicon Express seven-plate super pooledAmaranthus hypochondriacusBAC library. Ninety-six SNP loci, spanning the length ofA. hypochondriacuslinkage groups 1, 2, and 15, were simultaneously tested for clone identification from four BAC super pools, corresponding to 28 384-well plates, using a single Fluidigm integrated fluidic chip (IFC). Forty-six percent of the SNPs were associated with a single unambiguous identified BAC clone. PCR amplification and next-generation sequencing of individual BAC clones confirmed the IFC clone identification. Utilization of the Fluidigm Dynamic array platform allowed for the simultaneous PCR screening of 10,752 BAC pools for 96 SNP tag sites in less than three hours at a cost of~$0.05per reaction.


Genome ◽  
2017 ◽  
Vol 60 (4) ◽  
pp. 310-324 ◽  
Author(s):  
Brad S. Coates ◽  
Craig A. Abel ◽  
Omaththage P. Perera

The lepidopteran pest insect Helicoverpa zea feeds on cultivated corn and cotton across the Americas where control remains challenging owing to the evolution of resistance to chemical and transgenic insecticidal toxins, yet genomic resources remain scarce for this species. A bacterial artificial chromosome (BAC) library having a mean genomic insert size of 145 ± 20 kbp was created from a laboratory strain of H. zea, which provides ∼12.9-fold coverage of a 362.8 ± 8.8 Mbp (0.37 ± 0.09 pg) flow cytometry estimated haploid genome size. Assembly of Illumina HiSeq 2000 reads generated from 14 pools that encompassed all BAC clones resulted in 165 485 genomic contigs (N50 = 3262 bp; 324.6 Mbp total). Long terminal repeat (LTR) protein coding regions annotated from 181 contigs included 30 Ty1/copia, 78 Ty3/gypsy, and 73 BEL/Pao elements, of which 60 (33.1%) encoded all five functional polyprotein (pol) domains. Approximately 14% of LTR elements are distributed non-randomly across pools of BAC clones.


Genome ◽  
2000 ◽  
Vol 43 (1) ◽  
pp. 199-204 ◽  
Author(s):  
Junqi Song ◽  
Fenggao Dong ◽  
Jiming Jiang

Lack of reliable techniques for chromosome identification is the major obstacle for cytogenetics research in plant species with large numbers of small chromosomes. To promote molecular cytogenetics research of potato (Solanum tuberosum, 2n = 4x = 48) we developed a bacterial artificial chromosome (BAC) library of a diploid potato species S. bulbocastanum. The library consists of 23 808 clones with an average insert size of 155 kb, and represents approximately 3.7 equivalents to the potato genome. The majority of the clones in the BAC library generated distinct signals on specific potato chromosomes using fluorescence in situ hybridization (FISH). The hybridization signals provide excellent cytological markers to tag individual potato chromosomes. We also demonstrated that the BAC clones can be mapped to specific positions on meiotic pachytene chromosomes. The excellent resolution of pachytene FISH can be used to construct a physical map of potato by mapping molecular marker-targeted BAC clones on pachytene chromosomes. Key words: potato, BAC library, chromosome identification, physical mapping, molecular cytogenetics.


2011 ◽  
Vol 2011 ◽  
pp. 1-5 ◽  
Author(s):  
Yan Hu ◽  
Yamin Lu ◽  
Dan Ma ◽  
Wangzhen Guo ◽  
Tianzhen Zhang

A bacterial artificial chromosome (BAC) library for the A-genome of cotton has been constructed from the leaves ofG. arboreumL cv. Jianglinzhongmian. It is used as elite A-genome germplasm resources in the present cotton breeding program and has been used to build a genetic reference map of cotton. The BAC library consists of 123,648 clones stored in 322 384-well plates. Statistical analysis of a set of 103 randomly selected BAC clones indicated that each clone has an average insert length of 100.2 kb per plasmid, with a range of 30 to 190 kb. Theoretically, this represents 7.2 haploid genome equivalents based on an A-genome size of 1697 Mb. The BAC library has been arranged in column pools and superpools allowing screening with various PCR-based markers. In the future, the A-genome cotton BAC library will serve as both a giant gene resource and a valuable tool for map-based gene isolation, physical mapping and comparative genome analysis.


Genome ◽  
2004 ◽  
Vol 47 (6) ◽  
pp. 1182-1191 ◽  
Author(s):  
Jan Šafář ◽  
Juan Carlos Noa-Carrazana ◽  
Jan Vrána ◽  
Jan Bartoš ◽  
Olena Alkhimova ◽  
...  

The first bacterial artificial chromosome (BAC) library of the banana species Musa balbisiana 'Pisang Klutuk Wulung' (PKW BAC library) was constructed and characterized. One improved and one novel protocol for nuclei isolation were employed to overcome problems caused by high levels of polyphenols and polysaccharides present in leaf tissues. The use of flow cytometry to purify cell nuclei eliminated contamination with secondary metabolites and plastid DNA. Furthermore, the usefulness of the inducible pCC1BAC vector to obtain a higher amount of BAC DNA was demonstrated. The PKW BAC library represents nine haploid genome equivalents of M. balbisiana and its mean insert size is 135 kb. It consists of two sublibraries, of which the first one (SN sublibrary with 24 960 clones) was prepared according to an improved standard nuclei isolation protocol, whereas the second (FN sublibrary with 11 904 clones) was obtained from flow-sorted nuclei. Screening with 12 RFLP probes, which were genetically anchored to 8 genetic linkage groups of the banana species Musa acuminata, revealed an average of 11 BAC clones per probe, thus confirming the genome coverage estimated based on the insert size, as well as a high level of conservation between the two species of Musa. Localization of selected BAC clones to mitotic chromosomes using FISH indicated that the BAC library represented a useful resource for cytogenetic mapping. As the first step in map-based cloning of a genetic factor that is involved in the activation of integrated pararetroviral sequences of Banana streak virus (BSV), the BSV expressed locus (BEL) was physically delimited. The PKW BAC library represents a publicly available tool, and is currently used to reveal the integration and activation mechanisms of BSV sequences and to study banana genome structure and evolution.Key words: bacterial artificial chromosome library, banana, BAC-FISH, flow cytometry, Musa balbisiana, Banana streak virus, BSV.


2021 ◽  
Author(s):  
Myung-Shin Kim ◽  
Taeyoung Lee ◽  
Jeonghun Baek ◽  
Ji Hong Kim ◽  
Changhoon Kim ◽  
...  

AbstractMassive resequencing efforts have been undertaken to catalog allelic variants in major crop species including soybean, but the scope of the information for genetic variation often depends on short sequence reads mapped to the extant reference genome. Additional de novo assembled genome sequences provide a unique opportunity to explore a dispensable genome fraction in the pan-genome of a species. Here, we report the de novo assembly and annotation of Hwangkeum, a popular soybean cultivar in Korea. The assembly was constructed using PromethION nanopore sequencing data and two genetic maps, and was then error-corrected using Illumina short-reads and PacBio SMRT reads. The 933.12 Mb assembly was annotated 79,870 transcripts for 58,550 genes using RNA-Seq data and the public soybean annotation set. Comparison of the Hwangkeum assembly with the Williams 82 soybean reference genome sequence revealed 1.8 million single-nucleotide polymorphisms, 0.5 million indels, and 25 thousand putative structural variants. However, there was no natural megabase-scale chromosomal rearrangement. Incidentally, by adding two novel groups, we found that soybean contains four clearly separated groups of centromeric satellite repeats. Analyses of satellite repeats and gene content suggested that the Hwangkeum assembly is a high-quality assembly. This was further supported by comparison of the marker arrangement of anthocyanin biosynthesis genes and of gene arrangement at the Rsv3 locus. Therefore, the results indicate that the de novo assembly of Hwangkeum is a valuable additional reference genome resource for characterizing traits for the improvement of this important crop species.


Sign in / Sign up

Export Citation Format

Share Document