long genes
Recently Published Documents


TOTAL DOCUMENTS

41
(FIVE YEARS 21)

H-INDEX

11
(FIVE YEARS 4)

2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Xing Guo ◽  
Dongming Fang ◽  
Sunil Kumar Sahu ◽  
Shuai Yang ◽  
Xuanmin Guang ◽  
...  

AbstractChloranthales remain the last major mesangiosperm lineage without a nuclear genome assembly. We therefore assemble a high-quality chromosome-level genome of Chloranthus spicatus to resolve enigmatic evolutionary relationships, as well as explore patterns of genome evolution among the major lineages of mesangiosperms (eudicots, monocots, magnoliids, Chloranthales, and Ceratophyllales). We find that synteny is highly conserved between genomic regions of Amborella, Vitis, and Chloranthus. We identify an ancient single whole-genome duplication (WGD) (κ) prior to the divergence of extant Chloranthales. Phylogenetic inference shows Chloranthales as sister to magnoliids. Furthermore, our analyses indicate that ancient hybridization may account for the incongruent phylogenetic placement of Chloranthales + magnoliids relative to monocots and eudicots in nuclear and chloroplast trees. Long genes and long introns are found to be prevalent in both Chloranthales and magnoliids compared to other angiosperms. Overall, our findings provide an improved context for understanding mesangiosperm relationships and evolution and contribute a valuable genomic resource for future investigations.


2021 ◽  
Author(s):  
Catalina Pereira ◽  
Ana Rita Rebelo ◽  
Dashiell Massey ◽  
John C. Schimenti ◽  
Robert S Weiss ◽  
...  

Genome instability (GIN) is a main contributing factor to congenital and somatic diseases, but its sporadic occurrence in individual cell cycles makes it difficult to study mechanistically. One profound manifestation of GIN is the formation of micronuclei (MN), the engulfment of chromosomes or chromosome fragments in their own nuclear structures separate from the main nucleus. Here, we developed MN-seq, an approach for sequencing the DNA contained within micronuclei. We applied MN-seq to mice with mutations in Mcm4 and Rad9a, which disrupt DNA replication, repair, and damage responses. Data analysis and simulations show that centromere presence, fragment length, and a heterogenous landscape of chromosomal fragility all contribute to the patterns of DNA present within MN. In particular, we show that long genes, but also gene-poor regions, are associated with chromosome breaks that lead to the enrichment of particular genomic sequences in MN, in a genetic background-specific manner. Finally, we introduce single-cell micronucleus sequencing (scMN-seq), an approach to sequence the DNA present in MN of individual cells. Together, sequencing micronuclei provides a systematic approach for studying GIN and reveals novel molecular associations with chromosome breakage and segregation.


Author(s):  
Sourena Soheili-Nezhad

Alzheimer’s disease (AD) is a genetically complex senile neurodegeneration with unknown etiology. The first gene discovered to be mutated in early-onset AD, the amyloid precursor protein (APP), has been widely assumed as a causal factor in the disease cascade due to its generation of Aβ species. APP has an evolutionarily conserved biological role and activates a signaling program with notable similarities to integrin—a cell adhesion receptor with a wide array of functions. Intriguingly, several AD genome-wide association study (GWAS) candidate genes, including the SHARPIN locus recently reported by us and others, influence signaling of the integrin pathway. Integrins are focal adhesion regulators and serve in nervous system development, synaptic plasticity, and Tau phosphorylation. These observations suggest that the function of APP probably goes beyond Aβ generation in AD. Aging—the strongest risk factor for AD—is associated with various clock-like events in cells. For instance, neurons are continuously impacted by stochastic ‘hits’ to their genomes in aging, in the forms of DNA damage, insertion-deletions, copy-number variations (CNVs) and other types of somatic mutations. DNA damage and somatic mutations can result in neoplastic changes and cancer in mitotically active cells. However, their consequences in post-mitotic cells such as aging neurons are less defined. The current hypothesis holds that the stochastic loss of DNA sequence data at random loci in aging affects longer genes by chance more frequently. As a result, the biological processes coordinated by long genes may be more vulnerable to such random aging effects. Curiously, as shown by us and others, long genes are strongly enriched for synapse- and cell adhesion-related ontologies, more than any other biological process or cellular compartment. In addition, among various cell types, neurons possess the highest levels of long gene expression and are therefore more vulnerable to such harmful effects. The long gene vulnerability hypothesis provides a simple link between aging and the genetic landscape of AD and warrants new strategies for disease modification.


Author(s):  
Feng Xiong ◽  
Ruoyu Wang ◽  
Joo-Hyung Lee ◽  
Shenglan Li ◽  
Shin-Fu Chen ◽  
...  

AbstractThe molecular basis underlying the interaction between retrotransposable elements (RTEs) and the human genome remains poorly understood. Here, we profiled N6-methyladenosine (m6A) deposition on nascent RNAs in human cells by developing a new method MINT-Seq, which revealed that many classes of RTE RNAs, particularly intronic LINE-1s (L1s), are strongly methylated. These m6A-marked intronic L1s (MILs) are evolutionarily young, sense-oriented to hosting genes, and are bound by a dozen RNA binding proteins (RBPs) that are putative novel readers of m6A-modified RNAs, including a nuclear matrix protein SAFB. Notably, m6A positively controls the expression of both autonomous L1s and co-transcribed L1 relics, promoting L1 retrotransposition. We showed that MILs preferentially reside in long genes with critical roles in DNA damage repair and sometimes in L1 suppression per se, where they act as transcriptional “roadblocks” to impede the hosting gene expression, revealing a novel host-weakening strategy by the L1s. In counteraction, the host uses the SAFB reader complex to bind m6A-L1s to reduce their levels, and to safeguard hosting gene transcription. Remarkably, our analysis identified thousands of MILs in multiple human fetal tissues, enlisting them as a novel category of cell-type-specific regulatory elements that often compromise transcription of long genes and confer their vulnerability in neurodevelopmental disorders. We propose that this m6A-orchestrated L1–host interaction plays widespread roles in gene regulation, genome integrity, human development and diseases.


Author(s):  
Alec Barrett ◽  
Rebecca McWhirter ◽  
Seth R Taylor ◽  
Alexis Weinreb ◽  
David M Miller ◽  
...  

Abstract A recent and powerful technique is to obtain transcriptomes from rare cell populations, such as single neurons in C. elegans, by enriching dissociated cells using fluorescent sorting. However, these cell samples often have low yields of RNA that present challenges in library preparation. This can lead to PCR duplicates, noisy gene expression for lowly expressed genes, and other issues that limit endpoint analysis. Further, some common resources, such as sequence specific kits for removing ribosomal RNA, are not optimized for non-mammalian samples. To advance library construction for such challenging samples, we compared two approaches for building RNAseq libraries from less than 10 nanograms of C. elegans RNA: SMARTSeq V4 (Takara), a widely used kit for selecting poly-adenylated transcripts; and SoLo Ovation (Tecan Genomics), a newly developed ribodepletion-based approach. For ribodepletion, we used a custom kit of 200 probes designed to match C. elegans rRNA gene sequences. We found that SoLo Ovation, in combination with our custom C. elegans probe set for rRNA depletion, detects an expanded set of noncoding RNAs, shows reduced noise in lowly expressed genes, and more accurately counts expression of long genes. The approach described here should be broadly useful for similar efforts to analyze transcriptomics when RNA is limiting.


2021 ◽  
Author(s):  
Alec Barrett ◽  
Rebecca McWhirter ◽  
Seth R Taylor ◽  
Alexis Weinreb ◽  
David M Miller ◽  
...  

ABSTRACTA recent and powerful technique is to obtain transcriptomes from rare cell populations, such as single neurons in C. elegans, by enriching dissociated cells using fluorescent sorting. However, these cell samples often have low yields of RNA that present challenges in library preparation. This can lead to PCR duplicates, noisy gene expression for lowly expressed genes, and other issues that limit endpoint analysis. Further, some common resources, such as sequence specific kits for removing ribosomal RNA, are not optimized for non-mammalian samples. To optimize library construction for such challenging samples, we compared two approaches for building RNAseq libraries from less than 10 nanograms of C. elegans RNA: SMARTSeq V4 (Takara), a widely used kit for selecting poly-adenylated transcripts; and SoLo Ovation (Tecan Genomics), a newly developed ribodepletion-based approach. For ribodepletion, we used a custom kit of 200 probes designed to match C. elegans rRNA gene sequences. We found that SoLo Ovation, in combination with our custom C. elegans probe set for rRNA depletion, detects an expanded set of noncoding RNAs, shows reduced noise in lowly expressed genes, and more accurately counts expression of long genes. The approach described here should be broadly useful for similar efforts to analyze transcriptomics when RNA is limiting.


Author(s):  
Ying Cao

Tumorigenesis is a complex biological phenomenon that includes extensive genetic and phenotypic heterogeneities and complicated regulatory mechanisms. In the recent few years, our studies demonstrate that tumor-initiating cells are similar to neural stem/progenitor cells in regulatory networks, tumorigenicity and pluripotent differentiation potential. In the review, I will make further discussion on these observations and propose a rule of cell biology by integrating these findings with evidence from developmental biology, tumor biology and evolution, which suggests that neural stemness underlies two coupled cell properties, tumorigenicity and pluripotent differentiation potential. Tumorigenicity and phenotypic heterogeneity in tumor is a result of acquirement of neural stemness in cells. The neural stemness property of tumor-initiating cells can hopefully integrate different concepts/hypotheses underlying tumorigenesis. Neural stem cells/neural progenitors and tumor-initiating cells share regulatory networks; both exhibit neural stemness, tumorigenicity and differentiation potential; both are dependent on expression or activation of ancestral genes (the atavistic effect); both rely primarily on aerobic glycolytic metabolism; both can differentiate into various cells or tissues that are derived from three germ layers, resembling severely disorganized or more severely degenerated process of embryonic development; both are enriched in long genes with more splice variants that provide more plastic scaffolds for cell differentiation, etc. The property of neural stemness might be a key point to understand tumorigenesis and pluripotent differentiation potential, and possibly explain certain pathological observations in tumors that have been inexplicable. Therefore, behind the complexity of tumorigenesis might be a general rule of cell biology, i.e., neural stemness represents the ground state of cell tumorigenicity and pluripotent differentiation potential.


2020 ◽  
Author(s):  
Sohyun Moon ◽  
Jacob Vazquez ◽  
Jerry Yingtao Zhao

AbstractBackgroundRecursive splicing (RS) is a mechanism to excise long introns from messenger RNA precursors. We focused on nuclear RNA, which is enriched for RS splicing intermediates and nascent transcripts, to investigate RS in the mouse brain.ResultsWe identified novel RS sites and discovered that RS is constitutive between excitatory and inhibitory neurons and between sexes in the mouse cerebral cortex. We found that the primary sequence context, including the U1 snRNA binding site, the polypyrimidine tract, and a strong 3’ splice site, distinguishes the RS AGGT site from hundreds of non-RS AGGT sites in the same intron. Moreover, we uncovered a new type of exon-like RS events termed exonicRS.ConclusionsWe demonstrate that nuclear total RNA sequencing is an efficient approach to identify RS events. We find the importance of the primary sequence context in the definition of RS AGGT sites. The exonicRS may represent an intermediate stage of RS sites evolving into annotated exons. Overall, our findings provide novel insights into the mechanisms of RS in long genes.


2020 ◽  
Author(s):  
Yura Kim ◽  
Mariam Naghavi ◽  
Ying-Tao Zhao

ABSTRACTThe human genome contains more than 4000 genes that are longer than 100 kb. These long genes require more time and resources to make a transcript than shorter genes do. Long genes have also been linked to various human diseases. Specific mechanisms are utilized by long genes to facilitate their transcription and co-transcriptional processes. This results in unique features in their multi-omics profiles. Although these unique profiles are important to understand long genes, a database that provides an integrated view and easy access to the multi-omics profiles of long genes does not exist. We leveraged the publicly accessible multi-omics data and systematically analyzed the genomic conservation, histone modifications, chromatin organization, tissue-specific transcriptome, and single cell transcriptome of 992 protein-coding genes that are longer than 200 kb in the mouse genome. We also examined the evolution history of their gene lengths in 15 species that belong to six Classes and 11 Orders. To share the multi-omics profiles of long genes, we developed a user-friendly and easy-to-use database, LongGeneDB (https://longgenedb.com), for users to search, browse, and download these profiles. LongGeneDB will be a useful data hub for the biomedical research community to understand long genes.


Sign in / Sign up

Export Citation Format

Share Document