repeat elements
Recently Published Documents


TOTAL DOCUMENTS

185
(FIVE YEARS 56)

H-INDEX

36
(FIVE YEARS 6)

2021 ◽  
Vol 66 (2) ◽  
pp. 155-165
Author(s):  
Tian-Rui Wang ◽  
Zheng-Wei Wang ◽  
Yi-Gang Song ◽  
Gregor Kozlowski

Quercus ningangensis is an economically and ecologically important tree species belonging to the family Fagaceae. In this study, the complete chloroplast (cp) genome of Q. ningangensis was sequenced and assembled, and 18 published cp genomes of Quercus were retrieved for genomic analyses (including sequence divergence, repeat elements, and structure) and phylogenetic inference. With this study, we found that complete cp genomes in Quercus are conserved, and we discovered a codon composition bias, which may be related to genomic content and genetic characteristics. In addition, we detected considerable structural variations in the expansion and contraction of inverted repeat regions. Six regions with relatively high variable (matK-rps16, psbC, ycf3 intron, rbcL, petA-psbJ, and ycf1) were detected by conducting a sliding window analysis, which has a high potential for developing effective genetic markers. Phylogenetic analysis based on Bayesian inference and maximum likelihood methods resulted in a robust phylogenetic tree of Quercus with high resolution for nearly all identified nodes. The phylogenetic relationships showed that the phylogenetic position of Q. ningangensis was located between Q. sichourensis and Q. acuta. The results of this study contribute to future research into the phylogenetic evolution of Quercus section Cyclobalanopsis (Fagaceae).


Polymers ◽  
2021 ◽  
Vol 14 (1) ◽  
pp. 26
Author(s):  
Mingzhe Chi ◽  
Rihab Gargouri ◽  
Tim Schrader ◽  
Kamel Damak ◽  
Ramzi Maâlej ◽  
...  

Descriptors derived from atomic structure and quantum chemical calculations for small molecules representing polymer repeat elements were evaluated for machine learning models to predict the Hildebrand solubility parameters of the corresponding polymers. Since reliable cohesive energy density data and solubility parameters for polymers are difficult to obtain, the experimental heat of vaporization ΔHvap of a set of small molecules was used as a proxy property to evaluate the descriptors. Using the atomistic descriptors, the multilinear regression model showed good accuracy in predicting ΔHvap of the small-molecule set, with a mean absolute error of 2.63 kJ/mol for training and 3.61 kJ/mol for cross-validation. Kernel ridge regression showed similar performance for the small-molecule training set but slightly worse accuracy for the prediction of ΔHvap of molecules representing repeating polymer elements. The Hildebrand solubility parameters of the polymers derived from the atomistic descriptors of the repeating polymer elements showed good correlation with values from the CROW polymer database.


Author(s):  
Quentin Delorme ◽  
Rémy Costa ◽  
Yasmine Mansour ◽  
Anna-Sophie Fiston-Lavier ◽  
Annie Chateau

In this paper, we investigate througth a premilinary study the influence of repeat elements during the assembly process. We analyze the link between the presence and the nature of one type of repeat element, called transposable element (TE) and misassembly events in genome assemblies. We propose to improve assemblies by taking into account the presence of repeat elements, including TEs, during the scaffolding step. We analyze the results and relate the misassemblies to TEs before and after correction.


2021 ◽  
Author(s):  
Georgios C Stefos ◽  
Georgios Theodorou ◽  
Ioannis Politis

G-Quadruplex structures are non-B DNA structures that occur in regions carrying short runs of guanines. They are implicated in several biological processes including transcription, translation, replication and telomere maintenance as well as in several pathological conditions like cancer and thus they have gained the attention of the scientific community. The rise of the -omics era significantly affected the G-quadruplex research and the genome-wide characterization of G-Quadruplexes has been rendered a necessary first step towards applying genomics approaches for their study. While in human and several model organisms there is a considerable number of works studying genome-wide the DNA motifs with potential to form G-quadruplexes (G4-motifs), there is a total absence of any similar studies regarding livestock animals. The objectives of the present study were to provide a detailed characterization of the bovine genic G4-motifs distribution and properties and to suggest a possible mechanism for the delivery of G4 motifs in the genes. Our data indicate that the distribution of G4s within bovine genes and the annotation of said genes to Gene Ontology terms are similar to what is already shown for other organisms. By investigating their structural characteristics and polymorphism, it is obvious that the overall stability of the putative quadruplex structures is in line with the current notion in the G4 field. Similarly to human, the bovine G4s are overrepresented in specific LINE repeat elements, the L1_BTs in the case of cattle. We suggest these elements as vehicles for delivery of G4 motifs in the introns of the bovine genes. Lastly, it seems that a basis exists for connecting traits of agricultural importance to the genetic variation of G4 motifs, thus, cattle could become an interesting new model organism for G4-related genetic studies.


2021 ◽  
Author(s):  
Arun H. Patil ◽  
Marc K. Halushka ◽  
Bastian K. Fromm

The telomere to telomere (T2T) genome project discovered and mapped ~240 million additional base pairs of primarily telomeric and centromeric reads. Much of this sequence was comprised of satellite sequences and large segmental duplications. We evaluated the extent to which human bona fide microRNAs (miRNAs) may be found in additional paralogous genomic loci or if previously undescribed microRNAs are present in these newly sequenced regions of the human genome. New genomic regions of the T2T project spanning ~240 million bp of sequence were obtained and evaluated by blastn for the human miRNAs contained in MirGeneDB2.0 (N=556) and miRBase (N = 1917) along with all species of MirGeneDB2.0 miRNAs (N=10,899). Additionally, bowtie was used to compare unmapped reads from >4,000 primary cell samples to the new T2T sequence. Based on sequence and structure, no bona fide miRNAs were identified. Ninety-seven miRNAs of questionable authenticity (frequently known repeat elements) were identified from the miRBase dataset across the newly described regions of the human genome. These 97 represent only 51 miRNA families due to paralogy of highly similar miRNAs such as 24 members of the hsa-mir-548 family. Altogether, this data strongly supports our having identified widely expressed bona fide miRNAs in the human genome and move us further toward the completion of human miRNA discovery.


PLoS Genetics ◽  
2021 ◽  
Vol 17 (11) ◽  
pp. e1009908
Author(s):  
Tie-Bo Zeng ◽  
Nicholas Pierce ◽  
Ji Liao ◽  
Purnima Singh ◽  
Kin Lau ◽  
...  

EHMT2 is the main euchromatic H3K9 methyltransferase. Embryos with zygotic, or maternal mutation in the Ehmt2 gene exhibit variable developmental delay. To understand how EHMT2 prevents variable developmental delay we performed RNA sequencing of mutant and somite stage-matched normal embryos at 8.5–9.5 days of gestation. Using four-way comparisons between delayed and normal embryos we clarified what it takes to be normal and what it takes to develop. We identified differentially expressed genes, for example Hox genes that simply reflected the difference in developmental progression of wild type and the delayed mutant uterus-mate embryos. By comparing wild type and zygotic mutant embryos along the same developmental window we detected a role of EHMT2 in suppressing variation in the transcriptional switches. We identified transcription changes where precise switching during development occurred only in the normal but not in the mutant embryo. At the 6-somite stage, gastrulation-specific genes were not precisely switched off in the Ehmt2−/− zygotic mutant embryos, while genes involved in organ growth, connective tissue development, striated muscle development, muscle differentiation, and cartilage development were not precisely switched on. The Ehmt2mat−/+ maternal mutant embryos displayed high transcriptional variation consistent with their variable survival. Variable derepression of transcripts occurred dominantly in the maternally inherited allele. Transcription was normal in the parental haploinsufficient wild type embryos despite their delay, consistent with their good prospects. Global profiling of transposable elements revealed EHMT2 targeted DNA methylation and suppression at LTR repeats, mostly ERVKs. In Ehmt2−/− embryos, transcription over very long distances initiated from such misregulated ‘driver’ ERVK repeats, encompassing a multitude of misexpressed ‘passenger’ repeats. In summary, EHMT2 reduced transcriptional variation of developmental switch genes and developmentally switching repeat elements at the six-somite stage embryos. These findings establish EHMT2 as a suppressor of transcriptional and developmental variation at the transition between gastrulation and organ specification.


Cells ◽  
2021 ◽  
Vol 10 (10) ◽  
pp. 2710
Author(s):  
Kui Duan ◽  
Chen-Yang Si ◽  
Shu-Mei Zhao ◽  
Zong-Yong Ai ◽  
Bao-Hua Niu ◽  
...  

Precise gene regulation is critical during embryo development. Long terminal repeat elements (LTRs) of endogenous retroviruses (ERVs) are dynamically expressed in blastocysts of mammalian embryos. However, the expression pattern of LTRs in monkey blastocyst is still unknown. By single-cell RNA-sequencing (seq) data of cynomolgus monkeys, we found that LTRs of several ERV families, including MacERV6, MacERV3, MacERV2, MacERVK1, and MacERVK2, were highly expressed in pre-implantation embryo cells including epiblast (EPI), trophectoderm (TrB), and primitive endoderm (PrE), but were depleted in post-implantation. We knocked down MacERV6-LTR1a in cynomolgus monkeys with a short hairpin RNA (shRNA) strategy to examine the potential function of MacERV6-LTR1a in the early development of monkey embryos. The silence of MacERV6-LTR1a mainly postpones the differentiation of TrB, EPI, and PrE cells in embryos at day 7 compared to control. Moreover, we confirmed MacERV6-LTR1a could recruit Estrogen Related Receptor Beta (ESRRB), which plays an important role in the maintenance of self-renewal and pluripotency of embryonic and trophoblast stem cells through different signaling pathways including FGF and Wnt signaling pathways. In summary, these results suggest that MacERV6-LTR1a is involved in gene regulation of the pre-implantation embryo of the cynomolgus monkeys.


EMBO Reports ◽  
2021 ◽  
Author(s):  
Deepika Puri ◽  
Birgit Koschorz ◽  
Bettina Engist ◽  
Megumi Onishi‐Seebacher ◽  
Devon Ryan ◽  
...  
Keyword(s):  

Author(s):  
Dong-Bin Chen ◽  
Ru-Song Zhang ◽  
Xiang-Dong Jin ◽  
Jian Yang ◽  
Peng Li ◽  
...  

Abstract To explore the characteristics of the mitochondrial genome (mitogenome) of the squeaking silkmoths Rhodinia, a genus of wild silkmoths in the family Saturniidae of Lepidoptera, and reveal phylogenetic relationships, the mitogenome of Rhodinia fugax Butler was determined. This wild silkmoth spins a green cocoon that has potential significance in sericulture, and exhibits a unique feature that its larvae can squeak loudly when touched. The mitogenome of R. fugax is a circular molecule of 15,334 bp long and comprises 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and an A + T-rich region, consistent with previous observations of Saturniidae species. The 370-bp A + T-rich region of R. fugax contains no tandem repeat elements and harbors several features common to the Bombycidea insects, but microsatellite AT repeat sequence preceded by the ATTTA motif is not present. Mitogenome-based phylogenetic analysis shows that R. fugax belongs to Attacini, instead of Saturniini. This study presents the first mitogenome for Rhodinia genus.


Sign in / Sign up

Export Citation Format

Share Document