Annotating full-length long noncoding RNAs

2018 ◽  
Vol 15 (1) ◽  
pp. 16-16

2020 ◽  
Author(s):  
Xiaomin Zheng ◽  
Yanjun Chen ◽  
Yifan Zhou ◽  
Danyang Li ◽  
Keke Shi ◽  
...  

AbstractLong noncoding RNAs (lncRNAs) are crucial factors during plant development and environmental responses. High-throughput and accurate identification of lncRNAs is still lacking in plants. To build an accurate atlas of lncRNA in cotton, we combined Isoform-sequencing (Iso-seq), strand-specific RNA-seq (ssRNA-seq), cap analysis gene expression (CAGE-seq) with PolyA-seq and compiled a pipeline named plant full-length lncRNA (PULL) to integrate multi-omics data. A total of 9240 lncRNAs from 21 tissue samples of the diploid cotton Gossypium arboreum were identified. We revealed that alternative usage of transcription start site (TSS) and transcription end site (TES) of lncRNAs occurs pervasively during plant growth and responses to stress. We identified the lncRNAs which co-expressed or be linked to the protein coding genes (PCGs) or GWAS studied SNPs associated with ovule and fiber development. We also mapped the genome-wide binding sites of two lncRNAs with chromatin isolation by RNA purification sequencing (ChIRP-seq) and validated the trans transcriptional regulation of lnc-Ga13g0352 via virus induced gene suppression (VIGS) assay. These findings provide valuable research resources for plant community and broaden our understandings of biogenesis and regulation function of plant lncRNAs.One sentence summaryThe full-length annotation and transcriptional regulation of long noncoding RNAs in cotton.



Author(s):  
Sílvia Carbonell Sala ◽  
Barbara Uszczyńska-Ratajczak ◽  
Julien Lagarde ◽  
Rory Johnson ◽  
Roderic Guigó


2017 ◽  
Author(s):  
Julien Lagarde ◽  
Barbara Uszczynska-Ratajczak ◽  
Silvia Carbonell ◽  
SÍlvia Pérez-Lluch ◽  
Amaya Abad ◽  
...  

AbstractAccurate annotations of genes and their transcripts is a foundation of genomics, but no annotation technique presently combines throughput and accuracy. As a result, reference gene collections remain incomplete: many gene models are fragmentary, while thousands more remain uncatalogued–particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), combining targeted RNA capture with third-generation long-read sequencing. We present an experimental re-annotation of the GENCODE intergenic lncRNA population in matched human and mouse tissues, resulting in novel transcript models for 3574 / 561 gene loci, respectively. CLS approximately doubles the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enable us to definitively characterize the genomic features of lncRNAs, including promoter- and gene-structure, and protein-coding potential. Thus CLS removes a longstanding bottleneck of transcriptome annotation, generating manual-quality full-length transcript models at high-throughput scales.Abbreviationsbpbase pairFLfull lengthntnucleotideROIread of insert, i.e. PacBio readSJsplice junctionSMRTsingle-molecule real-timeTMtranscript model



BMC Genomics ◽  
2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Ying Wan ◽  
Xiaoyang Liu ◽  
Dongwang Zheng ◽  
Yuying Wang ◽  
Huan Chen ◽  
...  


2017 ◽  
Vol 49 (12) ◽  
pp. 1731-1740 ◽  
Author(s):  
Julien Lagarde ◽  
Barbara Uszczynska-Ratajczak ◽  
Silvia Carbonell ◽  
Sílvia Pérez-Lluch ◽  
Amaya Abad ◽  
...  


2019 ◽  
Vol 29 (2) ◽  
pp. 113-121 ◽  
Author(s):  
Xue-ying Zhang ◽  
Lian-wen Zheng ◽  
Chun-jin Li ◽  
Ying Xu ◽  
Xu Zhou ◽  
...  


Sign in / Sign up

Export Citation Format

Share Document