repeat sequence
Recently Published Documents


TOTAL DOCUMENTS

680
(FIVE YEARS 97)

H-INDEX

57
(FIVE YEARS 4)

2022 ◽  
pp. 1-4
Author(s):  
Wanling Yang ◽  
Yuanwei Fan ◽  
Yong Chen ◽  
Gumu Ding ◽  
Hu Liu ◽  
...  

Abstract Dongxiang wild rice (Oryza rufipogon Griff.) (DXWR) is the northernmost distributed wild rice found in the world. Similar to other populations of O. rufipogon, DXWR contains a large number of agronomically valuable genes, which makes it a natural gene pool for rice breeding. Molecular markers, especially simple repeat sequence (SSR) markers, play important roles in plant breeding. Although a large number of SSR markers have been developed, most of them are derived from the genome coding sequences, rarely from non-coding sequences. Meanwhile, long non-coding RNAs (lncRNAs), which are derived from the transcription of non-coding sequences, play vital roles in plant growth, development and stress responses. In our previous study, we obtained 1655 lncRNA transcripts from DXWR using strand-specific RNA sequencing. In this study, 1878 SSR loci were detected from the lncRNA sequences of DXWR, and 1258 lncRNA-derived-SSR markers were developed on the genome-wide scale. To verify the validity and applicability of these markers, 72 pairs of primers were randomly selected to test 44 rice accessions. The results showed that 42 (58.33%) pairs of primers have abundant polymorphism among these rice materials; the polymorphism information content values ranged from 0.04 to 0.87 with an average of 0.50; the genetic diversity index of SSR loci varied from 0.04 to 0.88 with an average of 0.56; and the number of alleles per marker ranged from 2 to 11 with an average of 4.36. Thus, we concluded that these lncRNA-derived-SSR markers are a very useful source for future basic and applied research.


Biology ◽  
2022 ◽  
Vol 11 (1) ◽  
pp. 63
Author(s):  
Xiu-Xiu Guo ◽  
Xiao-Jian Qu ◽  
Xue-Jie Zhang ◽  
Shou-Jin Fan

Aristidoideae is a subfamily in the PACMAD clade of family Poaceae, including three genera, Aristida, Stipagrostis, and Sartidia. In this study, the plastomes of Aristida adscensionis and Stipagrostis pennata were newly sequenced, and a total of 16 Aristidoideae plastomes were compared. All plastomes were conservative in genome size, gene number, structure, and IR boundary. Repeat sequence analysis showed that forward and palindrome repeats were the most common repeat types. The number of SSRs ranged from 30 (Sartidia isaloensis) to 54 (Aristida purpurea). Codon usage analysis showed that plastome genes preferred to use codons ending with A/T. A total of 12 highly variable regions were screened, including four protein coding sequences (matK, ndhF, infA, and rpl32) and eight non-coding sequences (rpl16-1-rpl16-2, ccsA-ndhD, trnY-GUA-trnD-GUC, ndhF-rpl32, petN-trnC-GCA, trnT-GGU-trnE-UUC, trnG-GCC-trnfM-CAU, and rpl32-trnL-UAG). Furthermore, the phylogenetic position of this subfamily and their intergeneric relationships need to be illuminated. All Maximum Likelihood and Bayesian Inference trees strongly support the monophyly of Aristidoideae and each of three genera, and the clade of Aristidoideae and Panicoideae was a sister to other subfamilies in the PACMAD clade. Within Aristidoideae, Aristida is a sister to the clade composed of Stipagrostis and Sartidia. The divergence between C4 Stipagrostis and C3 Sartidia was estimated at 11.04 Ma, which may be associated with the drought event in the Miocene period. Finally, the differences in carbon fixation patterns, geographical distributions, and ploidy may be related to the difference of species numbers among these three genera. This study provides insights into the phylogeny and evolution of the subfamily Aristidoideae.


2021 ◽  
Author(s):  
Thomas Cech ◽  
Arthur Zaug

Abstract Telomeres, the natural ends of linear chromosomes, are comprised of repeat-sequence DNA and associated proteins1. Replication of telomeres allows continued proliferation of human stem cells and immortality of cancer cells2. Replication begins with telomerase3 extending the single-stranded DNA (ssDNA) of the telomeric G-strand [(TTAGGG)n]; the synthesis of the complementary C-strand [(CCCTAA)n] is much less well characterized. The CST (CTC1-STN1-TEN1) protein complex, a DNA Polymerase α-primase accessory factor4,5, is known to be required for telomere replication in vivo6,7,8,9, and the molecular analysis presented here reveals key features of its mechanism. We find that CST uses its ssDNA-binding activity to specify the origins for telomeric C-strand synthesis by bound Polα-primase. CST-organized DNA polymerization can copy a telomeric DNA template that folds into G-quadruplex structures, but the suboptimality of this template likely contributes to telomere replication problems observed in vivo. Combining telomerase, a short telomeric ssDNA primer, and CST-Polα-primase gives complete telomeric DNA replication, resulting in the same sort of ssDNA 3’-overhang found naturally on human telomeres. We conclude that the CST complex not only terminates telomerase extension10,11 and recruits Polα-primase to telomeric ssDNA4,12,13, but it also orchestrates C-strand synthesis. Because replication of the telomere has features distinct from replication of the rest of the genome, targeting telomere-replication components including CST holds promise for cancer therapeutics.


Author(s):  
Yuzo Fujino ◽  
Yoshitaka Nagai

Expanded short tandem repeats in the genome cause various monogenic diseases, particularly neurological disorders. Since the discovery of a CGG repeat expansion in the FMR1 gene in 1991, more than 40 repeat expansion diseases have been identified to date. In the coding repeat expansion diseases, in which the expanded repeat sequence is located in the coding regions of genes, the toxicity of repeat polypeptides, particularly misfolding and aggregation of proteins containing an expanded polyglutamine tract, have been the focus of investigation. On the other hand, in the non-coding repeat expansion diseases, in which the expanded repeat sequence is located in introns or untranslated regions, the toxicity of repeat RNAs has been the focus of investigation. Recently, these repeat RNAs were demonstrated to be translated into repeat polypeptides by the novel mechanism of repeat-associated non-AUG translation, which has extended the research direction of the pathological mechanisms of this disease entity to include polypeptide toxicity. Thus, a common pathogenesis has been suggested for both coding and non-coding repeat expansion diseases. In this review, we briefly outline the major pathogenic mechanisms of repeat expansion diseases, including a loss-of-function mechanism caused by repeat expansion, repeat RNA toxicity caused by RNA foci formation and protein sequestration, and toxicity by repeat polypeptides. We also discuss perturbation of the physiological liquid-liquid phase separation state caused by these repeat RNAs and repeat polypeptides, as well as potential therapeutic approaches against repeat expansion diseases.


2021 ◽  
Vol 33 (4) ◽  
pp. 293-300
Author(s):  
Larissa Arning ◽  
Huu Phuc Nguyen

Abstract The causative mutation for Huntington disease (HD), an expanded trinucleotide repeat sequence in the first exon of the huntingtin gene (HTT) is naturally polymorphic and inevitably associated with disease symptoms above 39 CAG repeats. Although symptomatic medical therapies for HD can improve the motor and non-motor symptoms for affected patients, these drugs do not stop the ongoing neurodegeneration and progression of the disease, which results in severe motor and cognitive disability and death. To date, there is still an urgent need for the development of effective disease‐modifying therapies to slow or even stop the progression of HD. The increasing ability to intervene directly at the roots of the disease, namely HTT transcription and translation of its mRNA, makes it necessary to understand the pathogenesis of HD as precisely as possible. In addition to the long-postulated toxicity of the polyglutamine-expanded mutant HTT protein, there is increasing evidence that the CAG repeat-containing RNA might also be directly involved in toxicity. Recent studies have identified cis- (DNA repair genes) and trans- (loss/duplication of CAA interruption) acting variants as major modifiers of age at onset (AO) and disease progression. More and more extensive data indicate that somatic instability functions as a driver for AO as well as disease progression and severity, not only in HD but also in other polyglutamine diseases. Thus, somatic expansions of repetitive DNA sequences may be essential to promote respective repeat lengths to reach a threshold leading to the overt neurodegenerative symptoms of trinucleotide diseases. These findings support somatic expansion as a potential therapeutic target in HD and related repeat expansion disorders.


2021 ◽  
Vol 5 (2) ◽  
pp. e202101232
Author(s):  
Alice J-L Zheng ◽  
Aikaterini Thermou ◽  
Pedro Guixens Gallardo ◽  
Laurence Malbert-Colas ◽  
Chrysoula Daskalogianni ◽  
...  

The role of G-quadruplex (G4) RNA structures is multifaceted and controversial. Here, we have used as a model the EBV-encoded EBNA1 and the Kaposi’s sarcoma-associated herpesvirus (KSHV)-encoded LANA1 mRNAs. We have compared the G4s in these two messages in terms of nucleolin binding, nuclear mRNA retention, and mRNA translation inhibition and their effects on immune evasion. The G4s in the EBNA1 message are clustered in one repeat sequence and the G4 ligand PhenDH2 prevents all G4-associated activities. The RNA G4s in the LANA1 message take part in similar multiple mRNA functions but are spread throughout the message. The different G4 activities depend on flanking coding and non-coding sequences and, interestingly, can be separated individually. Together, the results illustrate the multifunctional, dynamic and context-dependent nature of G4 RNAs and highlight the possibility to develop ligands targeting specific RNA G4 functions. The data also suggest a common multifunctional repertoire of viral G4 RNA activities for immune evasion.


Cell Research ◽  
2021 ◽  
Author(s):  
Futang Wan ◽  
Yongbo Ding ◽  
Yuebin Zhang ◽  
Zhenfang Wu ◽  
Shaobai Li ◽  
...  

AbstractTelomerase, a multi-subunit ribonucleoprotein complex, is a unique reverse transcriptase that catalyzes the processive addition of a repeat sequence to extend the telomere end using a short fragment of its own RNA component as the template. Despite recent structural characterizations of human and Tetrahymena telomerase, it is still a mystery how telomerase repeatedly uses its RNA template to synthesize telomeric DNA. Here, we report the cryo-EM structure of human telomerase holoenzyme bound with telomeric DNA at resolutions of 3.5 Å and 3.9 Å for the catalytic core and biogenesis module, respectively. The structure reveals that a leucine residue Leu980 in telomerase reverse transcriptase (TERT) catalytic subunit functions as a zipper head to limit the length of the short primer–template duplex in the active center. Moreover, our structural and computational analyses suggest that TERT and telomerase RNA (hTR) are organized to harbor a preformed active site that can accommodate short primer–template duplex substrates for catalysis. Furthermore, our findings unveil a double-fingers architecture in TERT that ensures nucleotide addition processivity of human telomerase. We propose that the zipper head Leu980 is a structural determinant for the sequence-based pausing signal of DNA synthesis that coincides with the RNA element-based physical template boundary. Functional analyses unveil that the non-glycine zipper head plays an essential role in both telomerase repeat addition processivity and telomere length homeostasis. In addition, we also demonstrate that this zipper head mechanism is conserved in all eukaryotic telomerases. Together, our study provides an integrated model for telomerase-mediated telomere synthesis.


2021 ◽  
Author(s):  
Egor Dolzhenko ◽  
Ben Weisburd ◽  
Kristina Ibanez ◽  
Indhu Shree Rajan Babu ◽  
Mark F Bennett ◽  
...  

Background: Expansions of short tandem repeats are the cause of many neurogenetic disorders including familial amyotrophic lateral sclerosis, Huntington disease, and many others. Multiple methods have been recently developed that can identify repeat expansions in whole genome or exome sequencing data. Despite the widely-recognized need for visual assessment of variant calls in clinical settings, current computational tools lack the ability to produce such visualizations for repeat expansions. Expanded repeats are difficult to visualize because they correspond to large insertions relative to the reference genome and involve many misaligning and ambiguously aligning reads. Results: We implemented REViewer, a computational method for visualization of sequencing data in genomic regions containing long repeat expansions. To generate a read pileup, REViewer reconstructs local haplotype sequences and distributes reads to these haplotypes in a way that is most consistent with the fragment lengths and evenness of read coverage. To create appropriate training materials for onboarding new users, we performed a concordance study involving 12 scientists involved in STR research. We used the results of this study to create a user guide that describes the basic principles of using REViewer as well as a guide to the typical features of read pileups that correspond to low confidence repeat genotype calls. Additionally, we demonstrated that REViewer can be used to annotate clinically-relevant repeat interruptions by comparing visual assessment results of 44 FMR1 repeat alleles with the results of triplet repeat primed PCR. For 38 of these alleles, the results of visual assessment were consistent with triplet repeat primed PCR. Conclusions: Read pileup plots generated by REViewer offer an intuitive way to visualize sequencing data in regions containing long repeat expansions. Laboratories can use REViewer to assess the quality of repeat genotype calls as well as to visually detect interruptions or other imperfections in the repeat sequence and the surrounding flanking regions.


Microbiology ◽  
2021 ◽  
Vol 167 (10) ◽  
Author(s):  
Madhu Manti Patra ◽  
Poulami Ghosh ◽  
Shreya Sengupta ◽  
Sujoy K. Das Gupta

MSMEG_2295 is a TetR family protein encoded by the first gene of a Mycobacterium smegmatis (Msm) operon that expresses the gene for DinB2 (MSMEG_2294), a translesion DNA repair enzyme. We have carried out investigations to understand its function by performing DNA binding studies and gene knockout experiments. We found that the protein binds to a conserved inverted repeat sequence located upstream of the dinB2 operon and several other genes. Using a knockout of MSMEG_2295, we show that MSMEG_2295 controls the expression of at least five genes, the products of which could potentially influence carbohydrate and fatty acid metabolism as well as antibiotic and oxidative stress resistance. We have demonstrated that MSMEG_2295 is a repressor by performing complementation analysis. Knocking out of MSMEG_2295 had a significant impact on pyruvate metabolism. Pyruvate dehydrogenase activity was virtually undetectable in ΔMSMEG_2295, although in the complemented strain, it was high. We also show that knocking out of MSMEG_2295 causes resistance to H2O2, reversed in the complemented strain. We have further found that the mycobacterial growth inhibitor plumbagin, a compound of plant origin, acts as an inducer of MSMEG_2295 regulated genes. We, therefore, establish that MSMEG_2295 functions by exerting its role as a repressor of multiple Msm genes and that by doing so, it plays a vital role in controlling pyruvate metabolism and response to oxidative stress.


Sign in / Sign up

Export Citation Format

Share Document