reference assembly
Recently Published Documents


TOTAL DOCUMENTS

29
(FIVE YEARS 11)

H-INDEX

8
(FIVE YEARS 3)

2021 ◽  
Author(s):  
Andrea Talenti ◽  
Jessica Powell ◽  
Johanneke D Hemmink ◽  
Elizabeth AJ Cook ◽  
David Wragg ◽  
...  

Despite only 8% of cattle being found in Europe, European breeds dominate current genetic resources. This adversely impacts cattle research in other important global cattle breeds. To mitigate this issue, we have generated the first assemblies of African breeds, which have been integrated with genomic data for 294 diverse cattle into the first graph genome that incorporates global cattle diversity. We illustrate how this more representative reference assembly contains an extra 116.1Mb (4.2%) of sequence absent from the current Hereford sequence and consequently inaccessible to current studies. We further demonstrate how using this graph genome increases read mapping rates, reduces allelic biases and improves the agreement of structural variant calling with independent optical mapping data. Consequently, we present an improved, more representative, reference assembly that will improve global cattle research.


F1000Research ◽  
2021 ◽  
Vol 10 ◽  
pp. 281
Author(s):  
Kevin McKernan ◽  
Liam T. Kane ◽  
Seth Crawford ◽  
Chen-Shan Chin ◽  
Aaron Trippe ◽  
...  

We describe the use of high-fidelity single molecule sequencing to assemble the genome of the psychoactive Psilocybe cubensis mushroom. The genome is 46.6Mb, 46% GC, and in 32 contigs with an N50 of 3.3Mb. The BUSCO completeness scores are 97.6% with 1.2% duplicates. The Psilocybin synthesis cluster exists in a single 3.2Mb contig. The dataset is available from NCBI BioProject with accessions PRJNA687911 and PRJNA700437.


2021 ◽  
Author(s):  
Chunlin Xiao ◽  
Zhong Chen ◽  
Wanqiu Chen ◽  
Cory Padilla ◽  
Li-Tai Fang ◽  
...  

The use of personalized genome assembly as a reference for detecting the full spectrum of somatic events from cancers has long been advocated but never been systematically investigated. Here we address the critical need of assessing the accuracy of somatic mutation detection using personalized genome assembly versus the standard human reference assembly (i.e. GRCh38). We first obtained massive whole genome sequencing data using multiple sequencing technologies, and then performed de novo assembly of the first tumor-normal paired genomes, both nuclear and mitochondrial, derived from the same donor with triple negative breast cancer. Compared to standard human reference assembly, the haplotype phased chromosomal-scale personalized genome was best demonstrated with individual specific haplotypes for some complex regions and medical relevant genes. We then used this well-assembled personalized genome as a reference for read mapping and somatic variant discovery. We showed that the personalized genome assembly results in better alignments of sequencing reads and more accurate somatic mutation calls. Direct comparison of mitochondrial genomes led to discovery of unreported nonsynonymous somatic mutations. Our findings provided a unique resource and proved the necessity of personalized genome assembly as a reference in improving somatic mutation detection at personal genome level not only for breast cancer reference samples, but also potentially for other cancers.


2020 ◽  
Author(s):  
Nishadi H. De Silva ◽  
Jyothish Bhai ◽  
Marc Chakiachvili ◽  
Bruno Contreras-Moreira ◽  
Carla Cummins ◽  
...  

ABSTRACTThe Ensembl COVID-19 browser (covid-19.ensembl.org) was launched in May 2020 in response to the ongoing pandemic. It is Ensembl’s contribution to the global efforts to develop treatments, diagnostics and vaccines for COVID-19, and it supports research into the genomic epidemiology and evolution of the SARS-CoV-2 virus. This freely available resource incorporates a new Ensembl gene set, multiple sets of variants, and alignments of annotation from several resources against the reference assembly for SARS-CoV-2. It represents the first virus to be encompassed within the Ensembl platform. Additional data are being continually integrated via our new rapid release protocols alongside tools such as the Ensembl Variant Effect Predictor. Here we describe the data and infrastructure behind the resource and discuss future work.


Genetics ◽  
2020 ◽  
Vol 216 (2) ◽  
pp. 599-608 ◽  
Author(s):  
Michael Alonge ◽  
Alaina Shumate ◽  
Daniela Puiu ◽  
Aleksey V. Zimin ◽  
Steven L. Salzberg

Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of nongap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered >5700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus.


2020 ◽  
Author(s):  
Michael Alonge ◽  
Alaina Shumate ◽  
Daniela Puiu ◽  
Aleksey Zimin ◽  
Steven L. Salzberg

ABSTRACTBread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all 3 wheat subgenomes at chromosome scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 gigabases of genomic sequence. We earlier published an independent wheat assembly (Triticum 3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC 1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum 4.0, contains 15.07 gigabases of non-gap sequence anchored to chromosomes, which is 1.2 gigabases more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2000 genes that were previously unplaced. We also discovered more than 5700 new genes, all of them duplications in the Chinese Spring genome that are missing from the IWGSC assembly and annotation. The Triticum 4.0 assembly and annotations are freely available at www.ncbi.nlm.nih.gov/bioproject/PRJNA392179.


2020 ◽  
Vol 10 (6) ◽  
pp. 1823-1827 ◽  
Author(s):  
Miriam Schreiber ◽  
Martin Mascher ◽  
Jonathan Wright ◽  
Sudharasan Padmarasu ◽  
Axel Himmelbach ◽  
...  

Barley (Hordeum vulgare) is one of the most important crops worldwide and is also considered a research model for the large-genome small grain temperate cereals. Despite genomic resources improving all the time, they are limited for the cv. Golden Promise, the most efficient genotype for genetic transformation. We have developed a barley cv. Golden Promise reference assembly integrating Illumina paired-end reads, long mate-pair reads, Dovetail Chicago in vitro proximity ligation libraries and chromosome conformation capture sequencing (Hi-C) libraries into a contiguous reference assembly. The assembled genome of 7 chromosomes and 4.13Gb in size, has a super-scaffold N50 after Chicago libraries of 4.14Mb and contains only 2.2% gaps. Using BUSCO (benchmarking universal single copy orthologous genes) as evaluation the genome assembly contains 95.2% of complete and single copy genes from the plant database. A high-quality Golden Promise reference assembly will be useful and utilized by the whole barley research community but will prove particularly useful for CRISPR-Cas9 experiments.


2020 ◽  
Author(s):  
Miriam Schreiber ◽  
Martin Mascher ◽  
Jonathan Wright ◽  
Sudharasan Padmarasu ◽  
Axel Himmelbach ◽  
...  

AbstractBackgroundBarley (Hordeum vulgare) is one of the most important crops worldwide and is also considered a research model for the large-genome small grain temperate cereals. Despite genomic resources improving all the time, they are limited for the cv. Golden Promise, the most efficient genotype for genetic transformation.FindingsWe have developed a barley cv. Golden Promise reference assembly integrating Illumina paired-end reads, long mate-pair reads, Dovetail Chicago in vitro proximity ligation libraries and chromosome conformation capture sequencing (Hi-C) libraries into a contiguous reference assembly. The assembled genome of 7 chromosomes and 4.13Gb in size, has a super-scaffold N50 after Chicago libraries of 4.14Mb and contains only 2.2% gaps. Using BUSCO (benchmarking universal single copy orthologous genes) as evaluation the genome assembly contains 95.2% of complete and single copy genes from the plant database.ConclusionsA high-quality Golden Promise reference assembly will be useful and utilised by the whole barley research community but will prove particularly useful for CRISPR-Cas9 experiments.


2019 ◽  
Vol 10 ◽  
Author(s):  
Liandong Yang ◽  
Ying Wang ◽  
Tai Wang ◽  
Shengchang Duan ◽  
Yang Dong ◽  
...  
Keyword(s):  

2019 ◽  
Vol 11 (5) ◽  
pp. 1483-1486 ◽  
Author(s):  
Gregg W C Thomas ◽  
Matthew W Hahn

Sign in / Sign up

Export Citation Format

Share Document