scholarly journals Ab initio identification of transcription start sites in the Rhesus macaque genome by histone modification and RNA-Seq

2010 ◽  
Vol 39 (4) ◽  
pp. 1408-1418 ◽  
Author(s):  
Yi Liu ◽  
Dali Han ◽  
Yixing Han ◽  
Zheng Yan ◽  
Bin Xie ◽  
...  
Science ◽  
2007 ◽  
Vol 316 (5822) ◽  
pp. 222-234 ◽  
Author(s):  
◽  
R. A. Gibbs ◽  
J. Rogers ◽  
M. G. Katze ◽  
R. Bumgarner ◽  
...  

2016 ◽  
Author(s):  
Francisco Avila Cobos ◽  
Jasper Anckaert ◽  
Pieter-Jan Volders ◽  
Dries Rombaut ◽  
Jo Vandesompele ◽  
...  

AbstractSummaryReconstructing transcript models from RNA-sequencing (RNA-seq) data and establishing these as independent transcriptional units can be a challenging task. The Zipper plot is an application that enables users to interrogate putative transcription start sites (TSSs) in relation to various features that are indicative for transcriptional activity. These features are obtained from publicly available datasets including CAGE-sequencing (CAGE-seq), ChIP-sequencing (ChIP-seq) for histone marks and DNasesequencing (DNase-seq). The Zipper plot application requires three input fields (chromosome, genomic coordinate (hg19) of the TSS and strand) and generates a report that includes a detailed summary table, a Zipper plot and several statistics derived from this plot.Availability and ImplementationThe Zipper plot is implemented using the statistical programming language R and is freely available at http://[email protected]; [email protected]; [email protected] informationSupplementary Methods available online.


2008 ◽  
Vol 17 (8) ◽  
pp. 1127-1136 ◽  
Author(s):  
Arthur S. Lee ◽  
María Gutiérrez-Arcelus ◽  
George H. Perry ◽  
Eric J. Vallender ◽  
Welkin E. Johnson ◽  
...  

Science ◽  
2007 ◽  
Vol 316 (5822) ◽  
pp. 238-240 ◽  
Author(s):  
K. Han ◽  
M. K. Konkel ◽  
J. Xing ◽  
H. Wang ◽  
J. Lee ◽  
...  

2021 ◽  
Author(s):  
Juexiao Zhou ◽  
bin zhang ◽  
Haoyang Li ◽  
Longxi Zhou ◽  
Zhongxiao Li ◽  
...  

Abstract The accurate annotation of transcription start sites (TSSs) and their usage is critical for the mechanistic understanding of gene regulation under different biological contexts. To fulfil this, on one hand, specific high-throughput experimental technologies have been developed to capture TSSs in a genome-wide manner. On the other hand, various computational tools have also been developed for in silico prediction of TSSs solely based on genomic sequences. Most of these computational tools cast the problem as a binary classification task on a balanced dataset and thus result in drastic false positive predictions when applied on the genome-scale. To address these issues, we present DeeReCT-TSS, a deep-learning-based method that is capable of TSSs identification across the whole genome based on both DNA sequences and conventional RNA-seq data. We show that by effectively incorporating these two sources of information, DeeReCT-TSS significantly outperforms other solely sequence-based methods on the precise annotation of TSSs used in different cell types. Furthermore, we develop a meta-learning-based extension for simultaneous transcription start site (TSS) annotation on 10 cell types, which enables the identification of cell-type-specific TSS. Finally, we demonstrate the high precision of DeeReCT-TSS on two independent datasets from the ENCODE project by correlating our predicted TSSs with experimentally defined TSS chromatin states. Our application, pre-trained models and data are available at https://github.com/JoshuaChou2018/DeeReCT-TSS_release.


2017 ◽  
Author(s):  
Charles Cole ◽  
Ashley Byrne ◽  
Anna E. Beaudin ◽  
E. Camilla Forsberg ◽  
Christopher Vollmers

AbstractRNA-seq is a powerful technique to investigate and quantify entire transcriptomes. Recent advances in the field have made it possible to explore the transcriptomes of single cells. However, most widely used RNA-seq protocols fail to provide crucial information regarding transcription start sites. Here we present a protocol, Tn5Prime, that takes advantage of the Tn5 transposase based Smartseq2 protocol to create RNA-seq libraries that capture the 5’ end of transcripts. The Tn5Prime method dramatically streamlines the 5’ capture process and is both cost effective and reliable. By applying Tn5Prime to bulk RNA and single cell samples we were able to define transcription start sites as well as quantify transcriptomes at high accuracy and reproducibility. Additionally, similar to 3’ end based high-throughput methods like Drop-Seq and 10X Genomics Chromium, the 5’ capture Tn5Prime method allows the introduction of cellular identifiers during reverse transcription, simplifying the analysis of large numbers of single cells. In contrast to 3’ end based methods, Tn5Prime also enables the assembly of the variable 5’ ends of antibody sequences present in single B-cell data. Therefore, Tn5Prime presents a robust tool for both basic and applied research into the adaptive immune system and beyond.


Science ◽  
2020 ◽  
Vol 370 (6523) ◽  
pp. eabc6617
Author(s):  
Wesley C. Warren ◽  
R. Alan Harris ◽  
Marina Haukness ◽  
Ian T. Fiddes ◽  
Shwetha C. Murali ◽  
...  

The rhesus macaque (Macaca mulatta) is the most widely studied nonhuman primate (NHP) in biomedical research. We present an updated reference genome assembly (Mmul_10, contig N50 = 46 Mbp) that increases the sequence contiguity 120-fold and annotate it using 6.5 million full-length transcripts, thus improving our understanding of gene content, isoform diversity, and repeat organization. With the improved assembly of segmental duplications, we discovered new lineage-specific genes and expanded gene families that are potentially informative in studies of evolution and disease susceptibility. Whole-genome sequencing (WGS) data from 853 rhesus macaques identified 85.7 million single-nucleotide variants (SNVs) and 10.5 million indel variants, including potentially damaging variants in genes associated with human autism and developmental delay, providing a framework for developing noninvasive NHP models of human disease.


PLoS ONE ◽  
2016 ◽  
Vol 11 (12) ◽  
pp. e0167376 ◽  
Author(s):  
Julien S. Gradnigo ◽  
Abhishek Majumdar ◽  
Robert B. Norgren ◽  
Etsuko N. Moriyama

2011 ◽  
Author(s):  
Zhiyong Huang ◽  
Zhiyong Huang ◽  
Guangmei Yan ◽  
Jun Wang ◽  
Xiaoning Wang ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document