scholarly journals EPD in 2020: enhanced data visualization and extension to ncRNA promoters

Author(s):  
Patrick Meylan ◽  
René Dreos ◽  
Giovanna Ambrosini ◽  
Romain Groux ◽  
Philipp Bucher

Abstract The Eukaryotic Promoter Database (EPD), available online at https://epd.epfl.ch, provides accurate transcription start site (TSS) information for promoters of 15 model organisms plus corresponding functional genomics data that can be viewed in a genome browser, queried or analyzed via web interfaces, or exported in standard formats (FASTA, BED, CSV) for subsequent analysis with other tools. Recent work has focused on the improvement of the EPD promoter viewers, which use the UCSC Genome Browser as visualization platform. Thousands of high-resolution tracks for CAGE, ChIP-seq and similar data have been generated and organized into public track hubs. Customized, reproducible promoter views, combining EPD-supplied tracks with native UCSC Genome Browser tracks, can be accessed from the organism summary pages or from individual promoter entries. Moreover, thanks to recent improvements and stabilization of ncRNA gene catalogs, we were able to release promoter collections for certain classes of ncRNAs from human and mouse. Furthermore, we developed automatic computational protocols to assign orphan TSS peaks to downstream genes based on paired-end (RAMPAGE) TSS mapping data, which enabled us to add nearly 9000 new entries to the human promoter collection. Since our last article in this journal, EPD was extended to five more model organisms: rhesus monkey, rat, dog, chicken and Plasmodium falciparum.

2020 ◽  
Vol 49 (D1) ◽  
pp. D1046-D1057 ◽  
Author(s):  
Jairo Navarro Gonzalez ◽  
Ann S Zweig ◽  
Matthew L Speir ◽  
Daniel Schmelter ◽  
Kate R Rosenbloom ◽  
...  

Abstract For more than two decades, the UCSC Genome Browser database (https://genome.ucsc.edu) has provided high-quality genomics data visualization and genome annotations to the research community. As the field of genomics grows and more data become available, new modes of display are required to accommodate new technologies. New features released this past year include a Hi-C heatmap display, a phased family trio display for VCF files, and various track visualization improvements. Striving to keep data up-to-date, new updates to gene annotations include GENCODE Genes, NCBI RefSeq Genes, and Ensembl Genes. New data tracks added for human and mouse genomes include the ENCODE registry of candidate cis-regulatory elements, promoters from the Eukaryotic Promoter Database, and NCBI RefSeq Select and Matched Annotation from NCBI and EMBL-EBI (MANE). Within weeks of learning about the outbreak of coronavirus, UCSC released a genome browser, with detailed annotation tracks, for the SARS-CoV-2 RNA reference assembly.


BMC Genomics ◽  
2015 ◽  
Vol 16 (1) ◽  
Author(s):  
Jiyuan An ◽  
John Lai ◽  
David L A Wood ◽  
Atul Sajjanhar ◽  
Chenwei Wang ◽  
...  

Genetics ◽  
2002 ◽  
Vol 162 (4) ◽  
pp. 1863-1873 ◽  
Author(s):  
J Slate ◽  
P M Visscher ◽  
S MacGregor ◽  
D Stevens ◽  
M L Tate ◽  
...  

Abstract Recent empirical evidence indicates that although fitness and fitness components tend to have low heritability in natural populations, they may nonetheless have relatively large components of additive genetic variance. The molecular basis of additive genetic variation has been investigated in model organisms but never in the wild. In this article we describe an attempt to map quantitative trait loci (QTL) for birth weight (a trait positively associated with overall fitness) in an unmanipulated, wild population of red deer (Cervus elaphus). Two approaches were used: interval mapping by linear regression within half-sib families and a variance components analysis of a six-generation pedigree of >350 animals. Evidence for segregating QTL was found on three linkage groups, one of which was significant at the genome-wide suggestive linkage threshold. To our knowledge this is the first time that a QTL for any trait has been mapped in a wild mammal population. It is hoped that this study will stimulate further investigations of the genetic architecture of fitness traits in the wild.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Huihui Li ◽  
Mingzhe Xie ◽  
Yan Wang ◽  
Ludong Yang ◽  
Zhi Xie ◽  
...  

AbstractriboCIRC is a translatome data-oriented circRNA database specifically designed for hosting, exploring, analyzing, and visualizing translatable circRNAs from multi-species. The database provides a comprehensive repository of computationally predicted ribosome-associated circRNAs; a manually curated collection of experimentally verified translated circRNAs; an evaluation of cross-species conservation of translatable circRNAs; a systematic de novo annotation of putative circRNA-encoded peptides, including sequence, structure, and function; and a genome browser to visualize the context-specific occupant footprints of circRNAs. It represents a valuable resource for the circRNA research community and is publicly available at http://www.ribocirc.com.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Soo Bin Kwon ◽  
Jason Ernst

AbstractIdentifying genomic regions with functional genomic properties that are conserved between human and mouse is an important challenge in the context of mouse model studies. To address this, we develop a method to learn a score of evidence of conservation at the functional genomics level by integrating information from a compendium of epigenomic, transcription factor binding, and transcriptomic data from human and mouse. The method, Learning Evidence of Conservation from Integrated Functional genomic annotations (LECIF), trains neural networks to generate this score for the human and mouse genomes. The resulting LECIF score highlights human and mouse regions with shared functional genomic properties and captures correspondence of biologically similar human and mouse annotations. Analysis with independent datasets shows the score also highlights loci associated with similar phenotypes in both species. LECIF will be a resource for mouse model studies by identifying loci whose functional genomic properties are likely conserved.


2002 ◽  
Vol 06 (24) ◽  
pp. 958-965
Author(s):  
Jun Yu ◽  
Jian Wang ◽  
Huanming Yang

A coordinated international effort to sequence agricultural and livestock genomes has come to its time. While human genome and genomes of many model organisms (related to human health and basic biological interests) have been sequenced or plugged in the sequencing pipelines, agronomically important crop and livestock genomes have not been given high enough priority. Although we are facing many challenges in policy-making, grant funding, regional task emphasis, research community consensus and technology innovations, many initiatives are being announced and formulated based on the cost-effective and large-scale sequencing procedure, known as whole genome shotgun (WGS) sequencing that produces draft sequences covering a genome from 95 percent to 99 percent. Identified genes from such draft sequences, coupled with other resources, such as molecular markers, large-insert clones and cDNA sequences, provide ample information and tools to further our knowledge in agricultural and environmental biology in the genome era that just comes to its accelerated period. If the campaign succeeds, molecular biologists, geneticists and field biologists from all countries, rich or poor, would be brought to the same starting point and expect another astronomical increase of basic genomic information, ready to convert effectively into knowledge that will ultimately change our lives and environment into a greater and better future. We call upon national and international governmental agencies and organizations as well as research foundations to support this unprecedented movement.


2007 ◽  
pp. 17-34 ◽  
Author(s):  
Donna Karolchik ◽  
Gill Bejerano ◽  
Angie S. Hinrichs ◽  
Robert M. Kuhn ◽  
Webb Miller ◽  
...  

2017 ◽  
Vol 46 (D1) ◽  
pp. D762-D769 ◽  
Author(s):  
Jonathan Casper ◽  
Ann S Zweig ◽  
Chris Villarreal ◽  
Cath Tyner ◽  
Matthew L Speir ◽  
...  

Abstract The UCSC Genome Browser (https://genome.ucsc.edu) provides a web interface for exploring annotated genome assemblies. The assemblies and annotation tracks are updated on an ongoing basis—12 assemblies and more than 28 tracks were added in the past year. Two recent additions are a display of CRISPR/Cas9 guide sequences and an interactive navigator for gene interactions. Other upgrades from the past year include a command-line version of the Variant Annotation Integrator, support for Human Genome Variation Society variant nomenclature input and output, and a revised highlighting tool that now supports multiple simultaneous regions and colors.


Sign in / Sign up

Export Citation Format

Share Document