processed pseudogenes
Recently Published Documents


TOTAL DOCUMENTS

109
(FIVE YEARS 11)

H-INDEX

31
(FIVE YEARS 1)

2021 ◽  
Author(s):  
Julie Demars ◽  
Yann Labrune ◽  
Nathalie Iannuccelli ◽  
Alice Deshayes ◽  
Sophie Leroux ◽  
...  

Deciphering the molecular architecture of coat coloration for a better understanding of the biological mechanisms underlying pigmentation still remains a challenge. We took advantage of a rabbit French experimental population in which both a pattern and a gradient of coloration from white to brown segregated within the himalayan phenotype. The whole experimental design was genotyped using the high density Affymetrix® AxiomOrcun™ SNP Array and phenotyped into 6 different groups ordered from the lighter to the darker. Genome-wide association analyses pinpointed an oligogenic determinism, under recessive and additive inheritance, involving genes already known in melanogenesis (ASIP, KIT, MC1R, TYR), and likely processed pseudogenes linked to ribosomal function, RPS20 and RPS14. We also identified (i) gene-gene interactions through ASIP:MC1R affecting light cream/beige phenotypes while KIT:RPS responsible of dark chocolate/brown colors and (ii) a genome-wide epistatic network involving several others coloration genes such as POT1 or HPS5. Finally, we determined the recessive inheritance of the English spotting phenotype likely involving a copy number variation affecting at least the end of the coding sequence of the KIT gene. Our analyses of coloration as a continuous trait allowed us to go beyond much of the established knowledge through the detection of additional genes and gene-gene interactions that may contribute to the molecular architecture of the coloration phenotype. Moreover, the characterization of a network including genes that contribute to melanogenesis and pigmentation, two processes affected in various human disorders, shows the potential interest of our rabbit model for transversal studies.


BioEssays ◽  
2021 ◽  
pp. 2100186
Author(s):  
Robin‐Lee Troskie ◽  
Geoffrey J. Faulkner ◽  
Seth W. Cheetham

2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Liqiang Tan ◽  
Weisheng Cheng ◽  
Fang Liu ◽  
Dan Ohtan Wang ◽  
Linwei Wu ◽  
...  

Abstract Background Canonical nonsense-mediated decay (NMD) is an important splicing-dependent process for mRNA surveillance in mammals. However, processed pseudogenes are not able to trigger NMD due to their lack of introns. It is largely unknown whether they have evolved other surveillance mechanisms. Results Here, we find that the RNAs of pseudogenes, especially processed pseudogenes, have dramatically higher m6A levels than their cognate protein-coding genes, associated with de novo m6A peaks and motifs in human cells. Furthermore, pseudogenes have rapidly accumulated m6A motifs during evolution. The m6A sites of pseudogenes are evolutionarily younger than neutral sites and their m6A levels are increasing, supporting the idea that m6A on the RNAs of pseudogenes is under positive selection. We then find that the m6A RNA modification of processed, rather than unprocessed, pseudogenes promotes cytosolic RNA degradation and attenuates interference with the RNAs of their cognate protein-coding genes. We experimentally validate the m6A RNA modification of two processed pseudogenes, DSTNP2 and NAP1L4P1, which promotes the RNA degradation of both pseudogenes and their cognate protein-coding genes DSTN and NAP1L4. In addition, the m6A of DSTNP2 regulation of DSTN is partially dependent on the miRNA miR-362-5p. Conclusions Our discovery reveals a novel evolutionary role of m6A RNA modification in cleaning up the unnecessary processed pseudogene transcripts to attenuate their interference with the regulatory network of protein-coding genes.


Author(s):  
Xiaowen Feng ◽  
Heng Li

Abstract LINE-1-mediated retrotransposition of protein-coding mRNAs is an active process in modern humans for both germline and somatic genomes. Prior works that surveyed human data mostly relied on detecting discordant mappings of paired-end short reads, or exon junctions contained in short reads. Moreover, there have been few genome-wide comparisons between gene retrocopies in great apes and humans. In this study, we introduced a more sensitive and accurate method to identify processed pseudogenes. Our method utilizes long-read assemblies, and more importantly, is able to provide full-length retrocopy sequences as well as flanking regions which are missed by short-read based methods. From 22 human individuals, we pinpointed 40 processed pseudogenes that are not present in the human reference genome GRCh38 and identified 17 pseudogenes that are in GRCh38 but absent from some input individuals. This represents a significantly higher discovery rate than previous reports (39 pseudogenes not in the reference genome out of 939 individuals). We also provided an overview of lineage-specific retrocopies in chimpanzee, gorilla, and orangutan genomes.


Author(s):  
Thiago L A Miller ◽  
Fernanda Orpinelli Rego ◽  
José Leonel L Buzzo ◽  
Pedro A F Galante

Abstract Motivation Retrocopies or processed pseudogenes are gene copies resulting from mRNA retrotransposition. These gene duplicates can be fixed, somatically inserted or polymorphic in the genome. However, knowledge regarding unfixed retrocopies (retroCNVs) is still limited, and the development of computational tools for effectively identifying and genotyping them is an urgent need. Results Here, we present sideRETRO, a pipeline dedicated not only to detecting retroCNVs in whole-genome or whole-exome sequencing data but also to revealing their insertion sites, zygosity and genomic context and classifying them as somatic or polymorphic events. We show that sideRETRO can identify novel retroCNVs and genotype them, in addition to finding polymorphic retroCNVs in whole-genome and whole-exome data. Therefore, sideRETRO fills a gap in the literature and presents an efficient and straightforward algorithm to accelerate the study of bona fide retroCNVs. Availability and implementation sideRETRO is available at https://github.com/galantelab/sideRETRO Supplementary information Supplementary data are available at Bioinformatics online.


2020 ◽  
Vol 295 (36) ◽  
pp. 12786-12795 ◽  
Author(s):  
Gwendolyn Kaeser ◽  
Jerold Chun

A new form of somatic gene recombination (SGR) has been identified in the human brain that affects the Alzheimer's disease gene, amyloid precursor protein (APP). SGR occurs when a gene sequence is cut and recombined within a single cell's genomic DNA, generally independent of DNA replication and the cell cycle. The newly identified brain SGR produces genomic complementary DNAs (gencDNAs) lacking introns, which integrate into locations distinct from germline loci. This brief review will present an overview of likely related recombination mechanisms and genomic cDNA-like sequences that implicate evolutionary origins for brain SGR. Similarities and differences exist between brain SGR and VDJ recombination in the immune system, the first identified SGR form that now has a well-defined enzymatic machinery. Both require gene transcription, but brain SGR uses an RNA intermediate and reverse transcriptase (RT) activity, which are characteristics shared with endogenous retrotransposons. The identified gencDNAs have similarities to other cDNA-like sequences existing throughout phylogeny, including intron-less genes and inactive germline processed pseudogenes, with likely overlapping biosynthetic processes. gencDNAs arise somatically in an individual to produce multiple copies; can be functional; appear most frequently within postmitotic cells; have diverse sequences; change with age; and can change with disease state. Normally occurring brain SGR may represent a mechanism for gene optimization and long-term cellular memory, whereas its dysregulation could underlie multiple brain disorders and, potentially, other diseases like cancer. The involvement of RT activity implicates already Food and Drug Administration–approved RT inhibitors as possible near-term interventions for managing SGR-associated diseases and suggest next-generation therapeutics targeting SGR elements.


2020 ◽  
Author(s):  
Wanxiangfu Tang ◽  
Ping Liang

AbstractMobile elements (MEs) can be divided into two major classes based on their transposition mechanisms as retrotransposons and DNA transposons. DNA transposons move in the genomes directly in the form of DNA in a cut-and-paste style, while retrotransposons utilize an RNA-intermediate to transpose in a “copy-and-paste” fashion. In addition to the target site duplications (TSDs), a hallmark of transposition shared by both classes, the DNA transposons also carry terminal inverted repeats (TIRs). DNA transposons constitute ~3% of primate genomes and they are thought to be inactive in the recent primate genomes since ~37My ago despite their success during early primate evolution. Retrotransposons can be further divided into Long Terminal Repeat retrotransposons (LTRs), which are characterized by the presence of LTRs at the two ends, and non-LTRs, which lack LTRs. In the primate genomes, LTRs constitute ~9% of genomes and have a low level of ongoing activity, while non-LTR retrotransposons represent the major types of MEs, contributing to ~37% of the genomes with some members being very young and currently active in retrotransposition. The four known types of non-LTR retrotransposons include LINEs, SINEs, SVAs, and processed pseudogenes, all characterized by the presence of a polyA tail and TSDs, which mostly range from 8 to 15 bp in length. All non-LTR retrotransposons are known to utilize the L1-based target-primed reverse transcription (TPRT) machineries for retrotransposition. In this study, we report a new type of non-LTR retrotransposon, which we named as retro-DNAs, to represent DNA transposons by sequence but non-LTR retrotransposons by the transposition mechanism in the recent primate genomes. By using a bioinformatics comparative genomics approach, we identified a total of 1,750 retro-DNAs, which represent 748 unique insertion events in the human genome and nine non-human primate genomes from the ape and monkey groups. These retro-DNAs, mostly as fragments of full-length DNA transposons, carry no TIRs but longer TSDs with ~23.5% also carrying a polyA tail and with their insertion site motifs and TSD length pattern characteristic of non-LTR retrotransposons. These features suggest that these retro-DNAs are DNA transposon sequences likely mobilized by the TPRT mechanism. Further, at least 40% of these retro-DNAs locate to genic regions, presenting significant potentials for impacting gene function. More interestingly, some retro-DNAs, as well as their parent sites, show certain levels of current transcriptional expression, suggesting that they have the potential to create more retro-DNAs in the current primate genomes. The identification of retro-DNAs, despite small in number, reveals a new mechanism in propagating the DNA transposons sequences in the primate genomes with the absence of canonical DNA transposon activity. It also suggests that the L1 TPRT machinery may have the ability to retrotranspose a wider variety of DNA sequences than what we currently know.


2020 ◽  
Author(s):  
Thiago L A Miller ◽  
Fernanda Orpinelli ◽  
José Leonel L Buzzo ◽  
Pedro A F Galante

ABSTRACTRetrocopies or processed pseudogenes are gene copies resulting from mRNA retrotransposition. These gene duplicates can be fixed, somatically inserted or dimorphic in the genome. However, knowledge regarding unfixed retrocopies (retroCNVs) is still limited, and the development of computational tools for effectively identifying and genotyping them is an urgent need. Here, we present sideRETRO, a pipeline dedicated not only to detecting retroCNVs in whole-genome or whole-exome sequencing data but also to revealing their insertion sites, zygosity, and genomic context and classifying them as somatic or dimorphic events. We show that sideRETRO can identify novel retroCNVs and genotype them (93.2% accuracy), in addition to identifying dimorphic retroCNVs in whole-genome and whole-exome data. Therefore, sideRETRO fills a gap in the literature and presents an efficient and straightforward algorithm to accelerate the study of retroCNVs.AvailabilitysideRETRO is available at https://github.com/galantelab/sideRETRO


2019 ◽  
Author(s):  
Anyou Wang ◽  
Hai Rong

Gene regulation interprets most variations of biological phenotype and remains a crucial topic in biology. Conventionally, manipulating gene sequences like knockout helps to infer gene regulation, but these inferences suffer several pitfalls like transcript compensation1, leading to biased results. An unbiased regulation has rarely been appreciated. Here, we develop a software, FINET2, to infer unbiased regulatory networks from massive data, including all human RNAseq data publicly available from Sequence Read Archive (SRA, 274469 samples) and The Cancer Genome Atlas (TCGA, 11574 samples), and unearth the general regulatory rules in normal genome and cancer as deposited3. Generally, the genome is positively regulated. Regulators primarily self-regulate their targets in the same annotated category, like processed-pseudogenes regulating processed-pseudogenes. At normal, ribosomal proteins drive the regulatory network, and proteins tightly control the genome and primarily regulate the remote proteins across chromosomes, but rarely regulate local targets (<1M bp), yet cancer noncoding RNAs, especially pseudogenes, strongly activate the cancer genome and induce local targets, including noncoding RNAs and proteins. As a result, the whole regulatory regime switches from a normal remote protein-controlled domain to a cancerous local noncoding RNA-activated niche. This parallels with our recent discovery from clinical data revealing noncoding RNAs as the deadliest drivers for cancer4, instead of proteins as conventionally thought. This refreshes the fundamental basis of cancer research and therapy. Our overall finding provides a systems version of the natural regulatory regime in human genome, which helps to correct the biased notions standing in current literature.


Sign in / Sign up

Export Citation Format

Share Document