covariation analysis
Recently Published Documents


TOTAL DOCUMENTS

29
(FIVE YEARS 17)

H-INDEX

5
(FIVE YEARS 3)

2021 ◽  
Vol 12 (6) ◽  
pp. 329-335
Author(s):  
Ya. A. Turovsky ◽  
◽  
S. V. Borzunov ◽  
A. A. Vahtin ◽  
◽  
...  

The paper discusses the application of the method of co-variance analysis of noisy signals using the example of an electroencephalogram to increase the frequency resolution in relation to the phenomena of electrical activity of the brain. The method is focused primarily on signals presented in digital form. The possibility of increasing the frequency resolution in comparison with the method of Fourier transform and wavelet transform is shown.


2021 ◽  
Author(s):  
Zhe Zhang ◽  
Peng Xiong ◽  
Junfeng Wang ◽  
Jian Zhan ◽  
Yaoqi Zhou

Long Interspersed Nuclear Element (LINE) retrotransposons play an important role in genomic innovation as well as genomic instability in many eukaryotes including human. Random insertions and extinction through mutational inactivation make them perfectly time-stamped "DNA fossils". Here, we investigated the origin of a self-cleaving ribozyme in 5' UTR of LINE-1. We showed that this ribozyme only requires 35 nucleotides for self-cleavage with a simple but previously unknown secondary-structure motif that was determined by deep mutational scanning and covariation analysis. Structure-based homology search revealed the existence of this mini-ribozyme in anthropoids but not in prosimians. In human, the most homologs of this mini-ribozyme were found in lineage L1PA6-10 but essential none in more recent L1PA1-2 or more ancient L1PA13-15. We resurrected mini-ribozymes according to consensus sequences and confirmed that mini-ribozymes were active in L1PA10 and L1PA8 but not in L1PA7 and more recent lineages. The result paints a consistent picture for the emergence of the active ribozyme around 40 million years ago, just before the divergence of the new world monkeys (Platyrrhini) and old-world monkeys (Catarrhini). The ribozyme, however, subsequently went extinct after L1PA7 emerged around 30 million years ago with a deleterious mutation. This work uncovers the rise and fall of the mini-LINE-1 ribozyme recorded in the "DNA fossils" of our own genome. More importantly, this ancient, naturally trans-cleaving ribozyme (after removing the non-functional stem loop) may find its modern usage in bioengineering and RNA-targeting therapeutics.


2021 ◽  
Author(s):  
Miranda S Wellington ◽  
Qian Cong ◽  
Amy L Schaefer ◽  
Emily Kenna MacLeod ◽  
Angelina Zimenko ◽  
...  

Many bacteria communicate with kin and coordinate group behaviors through a form of cell-cell signaling called acyl-homoserine lactone (AHL) quorum sensing (QS). In these systems, a signal synthase produces an AHL to which its paired receptor selectively responds. Selectivity is fundamental to cell signaling. Despite its importance, it has been challenging to determine how this selectivity is achieved and how AHL QS systems evolve and diversify. We hypothesized that we could use covariation within the protein sequences of AHL synthases and receptors to identify selectivity residues. We began by identifying about 6,000 unique synthase-receptor pairs. We then used the protein sequences of these pairs to identify covariation patterns and mapped the patterns onto the LasI/R system from Pseudomonas aeruginosa PAO1. The covarying residues in both proteins cluster around the ligand binding sites. We demonstrate that these residues are involved in system selectivity toward the cognate signal and go on to engineer the Las system to both produce and respond to an alternate AHL signal. We have thus demonstrated a new application for covariation methods and have deepened our understanding of how communication systems evolve and diversify.


eLife ◽  
2020 ◽  
Vol 9 ◽  
Author(s):  
Dana M Talsness ◽  
Katie G Owings ◽  
Emily Coelho ◽  
Gaelle Mercenne ◽  
John M Pleinis ◽  
...  

N-Glycanase 1 (NGLY1) is a cytoplasmic deglycosylating enzyme. Loss-of-function mutations in the NGLY1 gene cause NGLY1 deficiency, which is characterized by developmental delay, seizures, and a lack of sweat and tears. To model the phenotypic variability observed among patients, we crossed a Drosophila model of NGLY1 deficiency onto a panel of genetically diverse strains. The resulting progeny showed a phenotypic spectrum from 0 to 100% lethality. Association analysis on the lethality phenotype, as well as an evolutionary rate covariation analysis, generated lists of modifying genes, providing insight into NGLY1 function and disease. The top association hit was Ncc69 (human NKCC1/2), a conserved ion transporter. Analyses in NGLY1-/- mouse cells demonstrated that NKCC1 has an altered average molecular weight and reduced function. The misregulation of this ion transporter may explain the observed defects in secretory epithelium function in NGLY1 deficiency patients.


2020 ◽  
Vol 21 (1) ◽  
Author(s):  
John T. Lawson ◽  
Jason P. Smith ◽  
Stefan Bekiranov ◽  
Francine E. Garrett-Bakelman ◽  
Nathan C. Sheffield

Abstract A key challenge in epigenetics is to determine the biological significance of epigenetic variation among individuals. We present Coordinate Covariation Analysis (COCOA), a computational framework that uses covariation of epigenetic signals across individuals and a database of region sets to annotate epigenetic heterogeneity. COCOA is the first such tool for DNA methylation data and can also analyze any epigenetic signal with genomic coordinates. We demonstrate COCOA’s utility by analyzing DNA methylation, ATAC-seq, and multi-omic data in supervised and unsupervised analyses, showing that COCOA provides new understanding of inter-sample epigenetic variation. COCOA is available on Bioconductor (http://bioconductor.org/packages/COCOA).


2020 ◽  
Author(s):  
John T. Lawson ◽  
Jason P. Smith ◽  
Stefan Bekiranov ◽  
Francine E. Garrett-Bakelman ◽  
Nathan C. Sheffield

AbstractA key challenge in epigenetics is to determine the biological significance of epigenetic variation among individuals. Here, we present Coordinate Covariation Analysis (COCOA), a computational framework that uses covariation of epigenetic signals across individuals and a database of region sets to annotate epigenetic heterogeneity. COCOA is the first such tool for DNA methylation data and can also analyze any epigenetic signal with genomic coordinates. We demonstrate COCOA’s utility by analyzing DNA methylation, ATAC-seq, and multi-omic data in supervised and unsupervised analyses, showing that COCOA provides new understanding of inter-sample epigenetic variation. COCOA is available as a Bioconductor R package (http://bioconductor.org/packages/COCOA).


2020 ◽  
Author(s):  
Dana M. Talsness ◽  
Katie G. Owings ◽  
Emily Coelho ◽  
Gaelle Mercenne ◽  
John M. Pleinis ◽  
...  

AbstractN-Glycanase 1 (NGLY1) is a cytoplasmic deglycosylating enzyme. Loss-of-function mutations in the NGLY1 gene cause NGLY1 deficiency, which is characterized by developmental delay, seizures, and a lack of sweat and tears. To model the phenotypic variability observed among patients, we crossed a Drosophila model of NGLY1 deficiency onto a panel of genetically diverse strains. The resulting progeny showed a phenotypic spectrum from 0-100% lethality. Association analysis on the lethality phenotype as well as an evolutionary rate covariation analysis generated lists of modifying genes, providing insight into NGLY1 function and disease. The top association hit was Ncc69 (human NKCC1/2), a conserved ion transporter. Analyses in NGLY1 -/- mouse cells demonstrated that NKCC1 is misglycosylated and has reduced function, making it only the second confirmed NGLY1 enzymatic substrate. The misregulation of this ion transporter may explain the observed defects in secretory epithelium function in NGLY1 deficiency patients.


2020 ◽  
Author(s):  
Elena Rivas ◽  
Sean R. Eddy

AbstractTavares’ conclusions depend on an assumption that the statistic they use (RAFS) is an appropriate measure of RNA base pair covariation, but RAFS was not designed to measure covariation alone. RAFS detects positive signals in common patterns of primary sequence conservation in absence of any covariation. To illustrate the severity of the problem, we show that Tavares’ analysis reports “significantly covarying base pairs” in 100% identical sequence alignments with no variation or covariation. We use Tavares’ sequence alignment of HOTAIR domain 1 as an example to show that the base pairs they identify as significantly covarying actually arise from primary sequence conservation patterns. Their analysis still reports similar numbers of “significant covarying” base pairs in a negative control in which we permute residues in independent alignment columns to destroy covariation. There remains no significant covariation support for evolutionarily conserved RNA structure in the HOTAIR lncRNA or other lncRNA structures and alignments we have analyzed.


Author(s):  
Elena Rivas

AbstractKnowing the structure of conserved structural RNAs is important to elucidate their function and mechanism of action. However, predicting a conserved RNA structure remains unreliable, even when using a combination of thermodynamic stability and evolutionary covariation information. Here we present a method to predict a conserved RNA structure that combines the following three features. First, it uses significant covariation due to RNA structure and removes spurious covariation due to phylogeny. Second, it uses negative evolutionary information: basepairs that have variation but no significant covariation are prevented from occurring. Lastly, it uses a battery of probabilistic folding algorithms that incorporate all positive covariation into one structure. The method, named CaCoFold (Cascade variation/covariation Constrained Folding algorithm), predicts a nested structure guided by a maximal subset of positive basepairs, and recursively incorporates all remaining positive basepairs into alternative helices. The alternative helices can be compatible with the nested structure such as pseudoknots, or overlapping such as competing structures, base triplets, or other 3D non-antiparallel interactions. We present evidence that CaCoFold predictions are consistent with structures modeled from crystallography.Author SummaryThe availability of deeper comparative sequence alignments and recent advances in statistical analysis of RNA sequence covariation have made it possible to identify a reliable set of conserved base pairs, as well as a reliable set of non-basepairs (positions that vary without covarying). Predicting an overall consensus secondary structure consistent with a set of individual inferred pairs and non-pairs remains a problem. Current RNA structure prediction algorithms that predict nested secondary structures cannot use the full set of inferred covarying pairs, because covariation analysis also identifies important non-nested pairing interactions such as pseudoknots, base triples, and alternative structures. Moreover, although algorithms for incorporating negative constraints exist, negative information from covariation analysis (inferred non-pairs) has not been systematically exploited.Here I introduce an efficient approximate RNA structure prediction algorithm that incorporates all inferred pairs and excludes all non-pairs. Using this, and an improved visualization tool, I show that the method correctly identifies many non-nested structures in agreement with known crystal structures, and improves many curated consensus secondary structure annotations in RNA sequence alignment databases.


Sign in / Sign up

Export Citation Format

Share Document