Abstract
Long noncoding RNAs (lncRNAs) have emerged as critical regulators of biological processes. However, the aberrant expression of an isoform from the same lncRNA gene could lead to RNA with altered functions due to changes in their conformations, leading to diseases. Here, we describe a detailed characterization of the gene which encodes long intergenic non-protein coding RNA 01016 (LINC01016, a.k.a., LncRNA1195) with a focus on its structure, exon usage, and expression in human and macaque tissues. In this study, we show that it is among the highly expressed lncRNAs in the testis, exclusively conserved among non-human primates, suggesting its recent evolution and is expressed and processed into 12 distinct RNAs in testis, cervix, and uterus tissues. Further, we integrate de novo annotation of expressed LINC01016 transcripts and isoform-dependent gene expression analyses to show that human LINC01016 is a multi-exon gene, processed through differential exon usage with isoform-specific roles. Furthermore, in cervical, testicular, and uterine cancers, LINC01016 isoforms are differentially expressed, and their expression is predictive of survival in these cancers. The study has revealed an essential aspect of lncRNA biology, which is rarely associated with coding RNAs that lncRNA genes are precisely processed to generate isoforms with distinct biological roles in specific tissues.