scholarly journals dbNSFP v4: a comprehensive database of transcript-specific functional predictions and annotations for human nonsynonymous and splice-site SNVs

2020 ◽  
Vol 12 (1) ◽  
Author(s):  
Xiaoming Liu ◽  
Chang Li ◽  
Chengcheng Mou ◽  
Yibo Dong ◽  
Yicheng Tu

AbstractWhole exome sequencing has been increasingly used in human disease studies. Prioritization based on appropriate functional annotations has been used as an indispensable step to select candidate variants. Here we present the latest updates to dbNSFP (version 4.1), a database designed to facilitate this step by providing deleteriousness prediction and functional annotation for all potential nonsynonymous and splice-site SNVs (a total of 84,013,093) in the human genome. The current version compiled 36 deleteriousness prediction scores, including 12 transcript-specific scores, and other variant and gene-level functional annotations. The database is available at http://database.liulab.science/dbNSFP with a downloadable version and a web-service.

2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Ege Ülgen ◽  
Özge Can ◽  
Kaya Bilguvar ◽  
Cemaliye Akyerli Boylu ◽  
Şirin Kılıçturgay Yüksel ◽  
...  

Abstract Background In the clinical setting, workflows for analyzing individual genomics data should be both comprehensive and convenient for clinical interpretation. In an effort for comprehensiveness and practicality, we attempted to create a clinical individual whole exome sequencing (WES) analysis workflow, allowing identification of genomic alterations and presentation of neurooncologically-relevant findings. Methods The analysis workflow detects germline and somatic variants and presents: (1) germline variants, (2) somatic short variants, (3) tumor mutational burden (TMB), (4) microsatellite instability (MSI), (5) somatic copy number alterations (SCNA), (6) SCNA burden, (7) loss of heterozygosity, (8) genes with double-hit, (9) mutational signatures, and (10) pathway enrichment analyses. Using the workflow, 58 WES analyses from matched blood and tumor samples of 52 patients were analyzed: 47 primary and 11 recurrent diffuse gliomas. Results The median mean read depths were 199.88 for tumor and 110.955 for normal samples. For germline variants, a median of 22 (14–33) variants per patient was reported. There was a median of 6 (0–590) reported somatic short variants per tumor. A median of 19 (0–94) broad SCNAs and a median of 6 (0–12) gene-level SCNAs were reported per tumor. The gene with the most frequent somatic short variants was TP53 (41.38%). The most frequent chromosome-/arm-level SCNA events were chr7 amplification, chr22q loss, and chr10 loss. TMB in primary gliomas were significantly lower than in recurrent tumors (p = 0.002). MSI incidence was low (6.9%). Conclusions We demonstrate that WES can be practically and efficiently utilized for clinical analysis of individual brain tumors. The results display that NOTATES produces clinically relevant results in a concise but exhaustive manner.


2021 ◽  
Author(s):  
Peng Tu ◽  
Hairui Sun ◽  
Xiaohang Zhang ◽  
Qian Ran ◽  
suzhen Ran ◽  
...  

Abstract Background: Left ventricular non-compaction cardiomyopathy (LVNC) is a rare congenital heart defect (CHD), genetics defects have been found in patients with LVNC and their family members; and MYH7 is the most common genetic associated with LVNC. Methods: A trio (fetus and the parents) whole-exome sequencing (WES) was performed when the fetus was found with Ebstein's anomaly (EA), heart dilatation, perimembranous ventricular septal defects (VSD), mild seroperitoneum and single umbilical artery (SUA).Results: Whole-exome sequencing identified a maternal inherited heterozygous splice site mutation in MYH7 (NM_000257.3:c.732+1G>A). Subsequent Sanger sequencing confirmed that the mutation was heterozygous in the fetus, the old sister, the grandmother, and the mother. QPCR experiment using RNA from blood lymphocytes but were unable to amplify any product.Conclusion: This familial case underlines that the striking cardiac phenotypic of MYH7 mutation (the c.732+1G>A spice site variant) may be highly variable. The mechanistic studies which could uncover candidate genes modulating cardiac phenotype associated with LVNC/EA should be proceed.


PLoS ONE ◽  
2015 ◽  
Vol 10 (6) ◽  
pp. e0130729 ◽  
Author(s):  
Sara L. Ma ◽  
Virginia Vega-Warner ◽  
Christopher Gillies ◽  
Matthew G. Sampson ◽  
Vijay Kher ◽  
...  

JBMR Plus ◽  
2018 ◽  
Vol 2 (4) ◽  
pp. 235-239 ◽  
Author(s):  
Zixue Jin ◽  
Lindsay C Burrage ◽  
Ming-Ming Jiang ◽  
Yi-Chien Lee ◽  
Terry Bertin ◽  
...  

2019 ◽  
Author(s):  
Mei Sim Lung ◽  
Catherine A. Mitchell ◽  
Maria A. Doyle ◽  
Andrew C. Lynch ◽  
Kylie L. Gorringe ◽  
...  

Abstract Background Familial cases of appendiceal mucinous tumours (AMTs) are extremely rare and the underlying genetic aetiology uncertain. We identified potential predisposing germline genetic variants in a father and daughter with AMTs presenting with pseudomyxoma peritonei (PMP) and correlated these with regions of loss of heterozygosity (LOH) in the tumours. Materials and Methods Through germline whole exome sequencing, we identified novel heterozygous loss-of-function (LoF) (i.e. nonsense, frameshift and essential splice site mutations) and missense variants shared between father and daughter, and validated all LoF variants, and missense variants with a Combined Annotation Dependent Depletion (CADD) scaled score of ≥10. Genome-wide copy number analysis was performed on tumour tissue from both individuals to identify regions of LOH. Results Seventeen novel variants in 17 genes were shared by the father and daughter: a nonsense mutation in REEP5 , an essential splice site mutation in THOP1 , and 15 missense variants. None of these germline variants were located in tumour regions of LOH shared by the father and daughter. Four genes ( EXOG , RANBP2, RANBP6 and TNFRSF1B ) harboured missense variants that fell in a region of LOH in the tumour from the father only, but none showed somatic loss of the wild type allele in the tumour. The REEP5 gene was sequenced in 23 individuals with presumed sporadic PMP; no LoF or rare missense germline variants were identified. Conclusion Germline exome sequencing of a father and daughter with AMTs identified novel candidate predisposing genes. Further studies are required to clarify the role of these genes in familial AMTs.


Blood ◽  
2011 ◽  
Vol 118 (21) ◽  
pp. 2462-2462
Author(s):  
Roberta Spinelli ◽  
Rocco Piazza ◽  
Hima Raman ◽  
Alessandra Pirola ◽  
Simona Valletta ◽  
...  

Abstract Abstract 2462 Point mutations in intronic regions near mRNA splice junctions can affect mRNA splicing, altering the resulting RNA sequence. The molecular characterization of in-frame or out-of-frame splicing variants in cancer samples can potentially assist in the molecular characterization of tumors. The aim of this study was to identify mutations located in the 5' or 3' exon-intron borders that affect RNA splicing using whole-exome sequencing analysis, a technique that targets coding sequences but also include the nearby intronic regions. In order to identify novel (in-frame and out-of-frame) splicing variants in myeloproliferative disorders we developed a bioinformatics procedure ‘Splice-Site Prediction Procedure to analyze Next Generation Sequencing data’ (SSPP-NGS). The SSPP-NGS bioinformatics method is an integration of two functional annotation tools for high-throughput sequencing data, ANNOVAR and MutationTaster and two canonical splice-site analysis tools, NetGene2 and Neural Network Promoter Prediction Tool (NNPPT). In addition, to assess the phenotypic effects of intronic mutations on mRNA splicing we combined DNA mutational screening analysis with RNA-Seq mediated gene expression profiling. Whole genome expression analysis was performed by using TopHat and Cufflinks: the first one is a splice junction mapper for RNA-Seq experiments able to mapp the reads against the junction to confirm them; the second one estimates gene expression, isoform-level expression, transcript abundance, differential gene expression and splicing. We used ANNOVAR and MutationTaster based on statistical Naive Bayes classifier to predict the non-coding mutations that affected physiological splicing. We then confirmed the results by queering NetGene2 and NNPPT using default parameters. Only the predictions found in all three programs were accepted as putative splicing variants and sequenced by Sanger method. We applied the entire procedure to whole exome sequencing data from 1 Ph+ leukemic patient sample (>80% myeloid cells) matched to autologous normal lymphocytes: on average, 70 million of paired-end reads and 5.2 gigabases (Gb) of sequences were generated per sample. A total of 177 candidate somatic point mutations (with minimum read depth of 20, minimum percent of substitution equal to 25% and minimum average Phred quality score of 30, corresponding to an accuracy of 99.9%, confirmed by at least 6 individual sequences) were found: 82/177 annotated in coding regions and 95/177 in non-coding regions. In particular 5/95 were located within 10-bp from a splicing junction. SSPS-NGS prediction analysis suggested the presence of 1/5 potential splicing site (predicting a loss of physiologic donor splicing site), while 4/5 were annotated as polymorphisms. The hypothetical splicing variant was located near the 5' donor splice site at position +1 in the intron between exons 5 and 6 of the GNAQ gene (IVS5+1C->T); it was present with a frequency of mutation of 35%, corresponding to its heterozygous presence in 88% of cells. The presence of this heterozygous mutation was confirmed by Sanger method. SSPS-NGS allowed us to focus on transcriptional analysis of this gene. RNA-seq analysis showed that 73% of GNAQ mRNA effectively skipped the upstream exon 5, resulting in a 4 to 6 frameshift fusion, which likely destroys the GTPase activity of GNAQ. No evidence of GNAQ exon 5 deleted RNA was found in additional 7 patients analyzed who lacked the intronic mutation. We extended the SSPN-NGS analysis to 7 myeloproliferative patients analyzed by exome sequencing. Three novel heterozygous splicing variants were identified, affecting the HOOK1, SMAD9 and DNAH9 genes. All mutations were confirmed by Sanger method. SSPS-NSG analysis predicted 1 loss of donor site in-frame (DNAH9) and 2 loss of acceptor splice site out of frame (HOOK1 and SMAD9), in one case with an activation of a new cryptic splicing site (HOOK1). RNA-seq analysis is in progress. In conclusion, the work presented here showed the applicability of SSPPs-NGS to whole-exome sequencing data as a tool to complement exome analysis, in order to identify novel splicing variants. Disclosures: No relevant conflicts of interest to declare.


2016 ◽  
Vol 56 (3) ◽  
pp. 135-137 ◽  
Author(s):  
Fatma Bastaki ◽  
Madiha Mohamed ◽  
Pratibha Nair ◽  
Fatima Saif ◽  
Nafisa Tawfiq ◽  
...  

2015 ◽  
Vol 5 (1) ◽  
Author(s):  
Hao Li ◽  
Hebing Chen ◽  
Feng Liu ◽  
Chao Ren ◽  
Shengqi Wang ◽  
...  

2021 ◽  
Author(s):  
Duncan S Palmer ◽  
Daniel P Howrigan ◽  
Sinéad B Chapman ◽  
Rolf Adolfsson ◽  
Nick Bass ◽  
...  

AbstractHere we report results from the Bipolar Exome (BipEx) collaboration analysis of whole exome sequencing of 13,933 individuals diagnosed with bipolar disorder (BD), matched with 14,422 controls. We find an excess of ultra-rare protein-truncating variants (PTVs) in BD patients among genes under strong evolutionary constraint, a signal evident in both major BD subtypes, bipolar 1 disorder (BD1) and bipolar 2 disorder (BD2). We also find an excess of ultra-rare PTVs within genes implicated from a recent schizophrenia exome meta-analysis (SCHEMA; 24,248 SCZ cases and 97,322 controls) and among binding targets of CHD8. Genes implicated from GWAS of BD, however, are not significantly enriched for ultra-rare PTVs. Combining BD gene-level results with SCHEMA, AKAP11 emerges as a definitive risk gene (ultra-rare PTVs seen in 33 cases and 13 controls, OR = 7.06, P = 2.83 × 10−9). At the protein level, AKAP-11 is known to interact with GSK3B, the hypothesized mechanism of action for lithium, one of the few treatments for BD. Overall, our results lend further support to the polygenic basis of BD and demonstrate a role for rare coding variation as a significant risk factor in BD onset.


Sign in / Sign up

Export Citation Format

Share Document