scholarly journals StabilitySort: assessment of protein stability changes on a genome-wide scale to prioritise potentially pathogenic genetic variation

2021 ◽  
Author(s):  
Aaron Chuah ◽  
Sean Li ◽  
Andrea Do ◽  
Matt A Field ◽  
T. Daniel Andrews

AbstractSummaryMissense mutations that change protein stability are strongly associated with human inherited genetic disease. With the recent availability of predicted structures for all human proteins generated using the AlphaFold2 prediction model, genome-wide assessment of the stability effects of genetic variation can, for the first time, be easily performed. This facilitates the interrogation of personal genetic variation for potentially pathogenic effects through the application of stability metrics. Here, we present a novel algorithm to prioritise variants predicted to strongly destabilise essential proteins, available as both a standalone software package and a web-based tool. We demonstrate the utility of this tool by showing that at values of the Stability Sort Z-score above 1.6, pathogenic, protein-destabilising variants from ClinVar are detected at a 58% enrichment, over and above the destabilising (but presumably non-pathogenic) variation already present in the HapMap NA12878 genome.Availability and ImplementationStabilitySort is available as both a web service (http://130.56.244.113/StabilitySort/) and can be deployed as a standalone system (https://gitlab.com/baaron/StabilitySort)[email protected]

Nature Plants ◽  
2021 ◽  
Vol 7 (10) ◽  
pp. 1330-1334
Author(s):  
Sophie Watts ◽  
Michel McElroy ◽  
Zoë Migicovsky ◽  
Hugo Maassen ◽  
Robin van Velzen ◽  
...  

AbstractAnalysis of over 100 Cannabis samples quantified for terpene and cannabinoid content and genotyped for over 100,000 single nucleotide polymorphisms indicated that Sativa- and Indica-labelled samples were genetically indistinct on a genome-wide scale. Instead, we found that Cannabis labelling was associated with variation in a small number of terpenes whose concentrations are controlled by genetic variation at tandem arrays of terpene synthase genes.


Blood ◽  
2009 ◽  
Vol 114 (23) ◽  
pp. 4843-4846 ◽  
Author(s):  
Dalemari Crowther-Swanepoel ◽  
Mobshra Qureshi ◽  
Martin J. S. Dyer ◽  
Estella Matutes ◽  
Claire Dearden ◽  
...  

Abstract A genome-wide linkage scan has provided evidence for a chronic lymphocytic leukemia (CLL) susceptibility locus at 2q21 to which the chemokine receptor CXCR4 gene maps. Recent data provide some evidence for common variation in CXCR4 according to the polymorphic variant rs2228014 defining CLL risk. To examine the role of genetic variation in CXCR4 on CLL risk, we screened 188 familial CLL cases and 213 controls for germline mutations in the coding regions of CXCR4 and genotyped rs2228014 in 1058 CLL cases and 1807 controls. No association between rs2228014 and risk of CLL was seen (P = .83). One truncating (W195X) and 2 missense mutations with possible functional consequences (V139I and G335S) were identified among 186 familial cases and 0 in 213 controls sequenced. Our analysis provides no evidence that common variation in CXCR4 defined by rs228014 influences the risk of CLL, but that functional coding mutations in CXCR4 may contribute to familial CLL.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Kelly B. Klingler ◽  
Joshua P. Jahner ◽  
Thomas L. Parchman ◽  
Chris Ray ◽  
Mary M. Peacock

Abstract Background Distributional responses by alpine taxa to repeated, glacial-interglacial cycles throughout the last two million years have significantly influenced the spatial genetic structure of populations. These effects have been exacerbated for the American pika (Ochotona princeps), a small alpine lagomorph constrained by thermal sensitivity and a limited dispersal capacity. As a species of conservation concern, long-term lack of gene flow has important consequences for landscape genetic structure and levels of diversity within populations. Here, we use reduced representation sequencing (ddRADseq) to provide a genome-wide perspective on patterns of genetic variation across pika populations representing distinct subspecies. To investigate how landscape and environmental features shape genetic variation, we collected genetic samples from distinct geographic regions as well as across finer spatial scales in two geographically proximate mountain ranges of eastern Nevada. Results Our genome-wide analyses corroborate range-wide, mitochondrial subspecific designations and reveal pronounced fine-scale population structure between the Ruby Mountains and East Humboldt Range of eastern Nevada. Populations in Nevada were characterized by low genetic diversity (π = 0.0006–0.0009; θW = 0.0005–0.0007) relative to populations in California (π = 0.0014–0.0019; θW = 0.0011–0.0017) and the Rocky Mountains (π = 0.0025–0.0027; θW = 0.0021–0.0024), indicating substantial genetic drift in these isolated populations. Tajima’s D was positive for all sites (D = 0.240–0.811), consistent with recent contraction in population sizes range-wide. Conclusions Substantial influences of geography, elevation and climate variables on genetic differentiation were also detected and may interact with the regional effects of anthropogenic climate change to force the loss of unique genetic lineages through continued population extirpations in the Great Basin and Sierra Nevada.


Genetics ◽  
2001 ◽  
Vol 157 (1) ◽  
pp. 283-294 ◽  
Author(s):  
Kristie Ashton ◽  
Ana Patricia Wagoner ◽  
Roland Carrillo ◽  
Greg Gibson

AbstractDrosophila melanogaster appears to be well suited as a model organism for quantitative pharmacogenetic analysis. A genome-wide deficiency screen for haploinsufficient effects on prepupal heart rate identified nine regions of the genome that significantly reduce (five deficiencies) or increase (four deficiencies) heart rate across a range of genetic backgrounds. Candidate genes include several neurotransmitter receptor loci, particularly monoamine receptors, consistent with results of prior pharmacological manipulations of heart rate, as well as genes associated with paralytic phenotypes. Significant genetic variation is also shown to exist for a suite of four autonomic behaviors that are exhibited spontaneously upon decapitation, namely, grooming, grasping, righting, and quivering. Overall activity levels are increased by application of particular concentrations of the drugs octopamine and nicotine, but due to high environmental variance both within and among replicate vials, the significance of genetic variation among wild-type lines for response to the drugs is difficult to establish. An interval mapping design was also used to map two or three QTL for each behavioral trait in a set of recombinant inbred lines derived from the laboratory stocks Oregon-R and 2b.


Genes ◽  
2020 ◽  
Vol 11 (10) ◽  
pp. 1154
Author(s):  
Min Jeong Hong ◽  
Jin-Baek Kim ◽  
Yong Weon Seo ◽  
Dae Yeon Kim

Genes of the F-box family play specific roles in protein degradation by post-translational modification in several biological processes, including flowering, the regulation of circadian rhythms, photomorphogenesis, seed development, leaf senescence, and hormone signaling. F-box genes have not been previously investigated on a genome-wide scale; however, the establishment of the wheat (Triticum aestivum L.) reference genome sequence enabled a genome-based examination of the F-box genes to be conducted in the present study. In total, 1796 F-box genes were detected in the wheat genome and classified into various subgroups based on their functional C-terminal domain. The F-box genes were distributed among 21 chromosomes and most showed high sequence homology with F-box genes located on the homoeologous chromosomes because of allohexaploidy in the wheat genome. Additionally, a synteny analysis of wheat F-box genes was conducted in rice and Brachypodium distachyon. Transcriptome analysis during various wheat developmental stages and expression analysis by quantitative real-time PCR revealed that some F-box genes were specifically expressed in the vegetative and/or seed developmental stages. A genome-based examination and classification of F-box genes provide an opportunity to elucidate the biological functions of F-box genes in wheat.


2014 ◽  
Vol 42 (15) ◽  
pp. 9838-9853 ◽  
Author(s):  
Saeed Kaboli ◽  
Takuya Yamakawa ◽  
Keisuke Sunada ◽  
Tao Takagaki ◽  
Yu Sasano ◽  
...  

Abstract Despite systematic approaches to mapping networks of genetic interactions in Saccharomyces cerevisiae, exploration of genetic interactions on a genome-wide scale has been limited. The S. cerevisiae haploid genome has 110 regions that are longer than 10 kb but harbor only non-essential genes. Here, we attempted to delete these regions by PCR-mediated chromosomal deletion technology (PCD), which enables chromosomal segments to be deleted by a one-step transformation. Thirty-three of the 110 regions could be deleted, but the remaining 77 regions could not. To determine whether the 77 undeletable regions are essential, we successfully converted 67 of them to mini-chromosomes marked with URA3 using PCR-mediated chromosome splitting technology and conducted a mitotic loss assay of the mini-chromosomes. Fifty-six of the 67 regions were found to be essential for cell growth, and 49 of these carried co-lethal gene pair(s) that were not previously been detected by synthetic genetic array analysis. This result implies that regions harboring only non-essential genes contain unidentified synthetic lethal combinations at an unexpectedly high frequency, revealing a novel landscape of genetic interactions in the S. cerevisiae genome. Furthermore, this study indicates that segmental deletion might be exploited for not only revealing genome function but also breeding stress-tolerant strains.


2018 ◽  
Vol 60 (1) ◽  
pp. 17-28 ◽  
Author(s):  
Yasmeen Niazi ◽  
Hauke Thomsen ◽  
Bozena Smolkova ◽  
Ludmila Vodickova ◽  
Sona Vodenkova ◽  
...  

2016 ◽  
Author(s):  
Bethany Signal ◽  
Brian S Gloss ◽  
Marcel E Dinger ◽  
Timothy R Mercer

ABSTRACTBackgroundThe branchpoint element is required for the first lariat-forming reaction in splicing. However due to difficulty in experimentally mapping at a genome-wide scale, current catalogues are incomplete.ResultsWe have developed a machine-learning algorithm trained with empirical human branchpoint annotations to identify branchpoint elements from primary genome sequence alone. Using this approach, we can accurately locate branchpoints elements in 85% of introns in current gene annotations. Consistent with branchpoints as basal genetic elements, we find our annotation is unbiased towards gene type and expression levels. A major fraction of introns was found to encode multiple branchpoints raising the prospect that mutational redundancy is encoded in key genes. We also confirmed all deleterious branchpoint mutations annotated in clinical variant databases, and further identified thousands of clinical and common genetic variants with similar predicted effects.ConclusionsWe propose the broad annotation of branchpoints constitutes a valuable resource for further investigations into the genetic encoding of splicing patterns, and interpreting the impact of common- and disease-causing human genetic variation on gene splicing.


2021 ◽  
Vol 11 ◽  
Author(s):  
Matthew J. Rybin ◽  
Melina Ramic ◽  
Natalie R. Ricciardi ◽  
Philipp Kapranov ◽  
Claes Wahlestedt ◽  
...  

Genome instability is associated with myriad human diseases and is a well-known feature of both cancer and neurodegenerative disease. Until recently, the ability to assess DNA damage—the principal driver of genome instability—was limited to relatively imprecise methods or restricted to studying predefined genomic regions. Recently, new techniques for detecting DNA double strand breaks (DSBs) and single strand breaks (SSBs) with next-generation sequencing on a genome-wide scale with single nucleotide resolution have emerged. With these new tools, efforts are underway to define the “breakome” in normal aging and disease. Here, we compare the relative strengths and weaknesses of these technologies and their potential application to studying neurodegenerative diseases.


Sign in / Sign up

Export Citation Format

Share Document