scholarly journals Annotating and prioritising genomic variants using the Ensembl Variant Effect Predictor ‐ a tutorial

2021 ◽  
Author(s):  
Sarah E. Hunt ◽  
Benjamin Moore ◽  
Ridwan M. Amode ◽  
Irina M. Armean ◽  
Diana Lemos ◽  
...  
Author(s):  
Sarah Hunt ◽  
Benjamin Moore ◽  
M. Amode ◽  
Irina Armean ◽  
Diana Lemos ◽  
...  

The Ensembl Variant Effect Predictor (VEP) is a freely available, open source tool for the annotation and filtering of genomic variants. It predicts variant molecular consequence using the Ensembl/GENCODE or RefSeq gene sets. It also reports phenotype associations from databases such as ClinVar, allele frequencies from studies including gnomAD, and predictions of deleteriousness from tools such as SIFT and CADD. Ensembl VEP includes filtering options to customise variant prioritisation. It is well supported and updated roughly quarterly to incorporate the latest gene, variant and phenotype association information. Ensembl VEP analysis can be performed using a highly configurable, extensible command-line tool, a Representational State Transfer (REST) application programming interface (API) and a user-friendly web interface. These access methods are designed to suit different levels of bioinformatics experience and meet different needs in terms of data size, visualisation and flexibility. In this tutorial, we will describe performing variant annotation using the Ensembl VEP web tool, which enables sophisticated analysis through a simple interface.


2016 ◽  
Author(s):  
William McLaren ◽  
Laurent Gil ◽  
Sarah E Hunt ◽  
Harpreet Singh Riat ◽  
Graham R. S. Ritchie ◽  
...  

The Ensembl Variant Effect Predictor (VEP) is a powerful toolset for the analysis, annotation and prioritization of genomic variants, including in non-coding regions. The VEP accurately predicts the effects of sequence variants on transcripts, protein products, regulatory regions and binding motifs by leveraging the high quality, broad scope, and integrated nature of the Ensembl databases. In addition, it enables comparison with a large collection of existing publicly available variation data within Ensembl to provide insights into population and ancestral genetics, phenotypes and disease. The VEP is open source and free to use. It is available via a simple web interface (http://www.ensembl.org/vep), a powerful downloadable package, and both Ensembl’s Perl and REST application program interface (API) services.


Author(s):  
Yashvant Khimsuriya ◽  
Salil Vaniyawala ◽  
Babajan Banaganapalli ◽  
Muhammadh Khan ◽  
Ramu Elango ◽  
...  

2018 ◽  
Vol 35 (13) ◽  
pp. 2315-2317 ◽  
Author(s):  
Jannah Shamsani ◽  
Stephen H Kazakoff ◽  
Irina M Armean ◽  
Will McLaren ◽  
Michael T Parsons ◽  
...  

Abstract Summary Assessing the pathogenicity of genetic variants can be a complex and challenging task. Spliceogenic variants, which alter mRNA splicing, may yield mature transcripts that encode non-functional protein products, an important predictor of Mendelian disease risk. However, most variant annotation tools do not adequately assess spliceogenicity outside the native splice site and thus the disease-causing potential of variants in other intronic and exonic regions is often overlooked. Here, we present a plugin for the Ensembl Variant Effect Predictor that packages MaxEntScan and extends its functionality to provide splice site predictions using a maximum entropy model. The plugin incorporates a sliding window algorithm to predict splice site loss or gain for any variant that overlaps a transcript feature. We also demonstrate the utility of the plugin by comparing our predictions to two mRNA splicing datasets containing several cancer-susceptibility genes. Availability and implementation Source code is freely available under the Apache License, Version 2.0: https://github.com/Ensembl/VEP_plugins. Supplementary information Supplementary data are available at Bioinformatics online.


2014 ◽  
Vol 16 (2) ◽  
pp. 255-264 ◽  
Author(s):  
Michael Yourshaw ◽  
S. Paige Taylor ◽  
Aliz R. Rao ◽  
Martín G. Martín ◽  
Stanley F. Nelson

2016 ◽  
Vol 17 (1) ◽  
Author(s):  
William McLaren ◽  
Laurent Gil ◽  
Sarah E. Hunt ◽  
Harpreet Singh Riat ◽  
Graham R. S. Ritchie ◽  
...  

2019 ◽  
Vol 40 (9) ◽  
pp. 1486-1494 ◽  
Author(s):  
Maximilian Miller ◽  
Yanran Wang ◽  
Yana Bromberg

2020 ◽  
Author(s):  
Nishadi H. De Silva ◽  
Jyothish Bhai ◽  
Marc Chakiachvili ◽  
Bruno Contreras-Moreira ◽  
Carla Cummins ◽  
...  

ABSTRACTThe Ensembl COVID-19 browser (covid-19.ensembl.org) was launched in May 2020 in response to the ongoing pandemic. It is Ensembl’s contribution to the global efforts to develop treatments, diagnostics and vaccines for COVID-19, and it supports research into the genomic epidemiology and evolution of the SARS-CoV-2 virus. This freely available resource incorporates a new Ensembl gene set, multiple sets of variants, and alignments of annotation from several resources against the reference assembly for SARS-CoV-2. It represents the first virus to be encompassed within the Ensembl platform. Additional data are being continually integrated via our new rapid release protocols alongside tools such as the Ensembl Variant Effect Predictor. Here we describe the data and infrastructure behind the resource and discuss future work.


2016 ◽  
Author(s):  
Jennifer Yen ◽  
Sarah Garcia ◽  
Aldrin Montana ◽  
Jason Harris ◽  
Steven Chervitz ◽  
...  

ABSTRACTBackgroundClinical genomic testing is dependent on the robust identification and reporting of variant-level information in relation to disease. With the shift to high-throughput sequencing, a major challenge for clinical diagnostics is the cross-identification of variants called on their genomic position to resources that rely on transcript- or protein-based descriptions.MethodsWe evaluated the accuracy of three tools (SnpEff, Variant Effect Predictor and Variation Reporter) that generate transcript and protein-based variant nomenclature from genomic coordinates according to guidelines by the Human Genome Variation Society (HGVS). Our evaluation was based on comparisons to a manually curated list of 127 test variants of various types drawn from data sources, each with HGVS-compliant transcript and protein descriptors. We further evaluated the concordance between annotations generated by Snpeff and Variant Effect Predictor with those in major germline and cancer databases: ClinVar and COSMIC, respectively.ResultsWe find that there is substantial discordance between the annotation tools and databases in the description of insertion and/or deletions. Accuracy based on our ground truth set was between 80-90% for coding and 50-70% for protein variants, numbers that are not adequate for clinical reporting. Exact concordance for SNV syntax was over 99.5% between ClinVar and Variant Effect Predictor (VEP) and SnpEff, but less than 90% for non-SNV variants. For COSMIC, exact concordance for coding and protein SNVs were between 65 and 88%, and less than 15% for insertions. Across the tools and datasets, there was a wide range of equivalent expressions describing protein variants.ConclusionOur results reveal significant inconsistency in variant representation across tools and databases. These results highlight the urgent need for the adoption and adherence to uniform standards in variant annotation, with consistent reporting on the genomic reference, to enable accurate and efficient data-driven clinical care.


Sign in / Sign up

Export Citation Format

Share Document