ucsc genome browser
Recently Published Documents


TOTAL DOCUMENTS

69
(FIVE YEARS 13)

H-INDEX

27
(FIVE YEARS 1)

2021 ◽  
Author(s):  
Phillip Wyss ◽  
Carol Song ◽  
Minou Bina

In mammals, Imprinting Control Regions (ICRs) regulate a subset of genes in a parent-of-origin-specific manner. In both human and mouse, previous studies identified a set of CpG-rich motifs that occurred as clusters in ICRs and germline Differentially Methylated Regions (gDMRs). These motifs consist of the ZFP57 binding site (ZFBS) overlapping a subset of MLL binding units known as MLL morphemes. Furthermore, by creating plots for displaying the density of these overlaps, it became possible to locate known and candidate ICRs in mouse and human genomic DNA. Since genomic imprinting impacts many developmental and key physiological processes, we performed genome-wide analyses to create plots displaying the density of the CpG-rich motifs (ZFBS-morph overlaps) along Bos Taurus chromosomal DNA. We tailored our datasets so that they could be displayed on the UCSC genome browser (the build bosTau8). On the genome browser, we could view the ZFP57 binding sites, the ZFBS-morph overlaps, and peaks in the density-plots in the context of cattle RefSeq Genes, Non-Cow RefSeq Genes, CpG islands, and Single nucleotide polymorphisms (SNPs). Our datasets revealed the correspondence of peaks in plots to known and deduced ICRs in Bos Taurus genomic DNA. We illustrate that by uploading our datasets onto the UCSC genome browser, we could discover candidate ICRs in cattle DNA. In enlarged views, we could pinpoint the genes in the vicinity of candidate ICRs and thus discover potential imprinted genes.


2021 ◽  
Vol 50 (D1) ◽  
pp. D1-D10
Author(s):  
Daniel J Rigden ◽  
Xosé M Fernández

Abstract The 2022 Nucleic Acids Research Database Issue contains 185 papers, including 87 papers reporting on new databases and 85 updates from resources previously published in the Issue. Thirteen additional manuscripts provide updates on databases most recently published elsewhere. Seven new databases focus specifically on COVID-19 and SARS-CoV-2, including SCoV2-MD, the first of the Issue's Breakthrough Articles. Major nucleic acid databases reporting updates include MODOMICS, JASPAR and miRTarBase. The AlphaFold Protein Structure Database, described in the second Breakthrough Article, is the stand-out in the protein section, where the Human Proteoform Atlas and GproteinDb are other notable new arrivals. Updates from DisProt, FuzDB and ELM comprehensively cover disordered proteins. Under the metabolism and signalling section Reactome, ConsensusPathDB, HMDB and CAZy are major returning resources. In microbial and viral genomes taxonomy and systematics are well covered by LPSN, TYGS and GTDB. Genomics resources include Ensembl, Ensembl Genomes and UCSC Genome Browser. Major returning pharmacology resource names include the IUPHAR/BPS guide and the Therapeutic Target Database. New plant databases include PlantGSAD for gene lists and qPTMplants for post-translational modifications. The entire Database Issue is freely available online on the Nucleic Acids Research website (https://academic.oup.com/nar). Our latest update to the NAR online Molecular Biology Database Collection brings the total number of entries to 1645. Following last year's major cleanup, we have updated 317 entries, listing 89 new resources and trimming 80 discontinued URLs. The current release is available at http://www.oxfordjournals.org/nar/database/c/.


Author(s):  
Anna Benet-Pagès ◽  
Kate Rosenbloom ◽  
Luis Nassar ◽  
Christopher Lee ◽  
Brian Raney ◽  
...  

The UCSC Genome Browser has been an important tool for genomics and clinical genetics since the sequence of the human genome was first released in 2000. As it has grown in scope to display more types of data it has also grown more complicated. The data, which are dispersed at many locations worldwide, are collected into one view on the Browser, where the graphical interface presents the data in one location. This supports the expertise of the researcher to interpret variants in the genome. Because the analysis of Single Nucleotide Variants (SNVs) and Copy Number Variants (CNVs) require interpretation of data at very different genomic scales, different data resources are required. We present here several Recommended Track Sets designed to facilitate the interpretation of variants in the clinic, offering quick access to datasets relevant to the appropriate scale.


2020 ◽  
Vol 49 (D1) ◽  
pp. D1-D9
Author(s):  
Daniel J Rigden ◽  
Xosé M Fernández

Abstract The 2021 Nucleic Acids Research database Issue contains 189 papers spanning a wide range of biological fields and investigation. It includes 89 papers reporting on new databases and 90 covering recent changes to resources previously published in the Issue. A further ten are updates on databases most recently published elsewhere. Seven new databases focus on COVID-19 and SARS-CoV-2 and many others offer resources for studying the virus. Major returning nucleic acid databases include NONCODE, Rfam and RNAcentral. Protein family and domain databases include COG, Pfam, SMART and Panther. Protein structures are covered by RCSB PDB and dispersed proteins by PED and MobiDB. In metabolism and signalling, STRING, KEGG and WikiPathways are featured, along with returning KLIFS and new DKK and KinaseMD, all focused on kinases. IMG/M and IMG/VR update in the microbial and viral genome resources section, while human and model organism genomics resources include Flybase, Ensembl and UCSC Genome Browser. Cancer studies are covered by updates from canSAR and PINA, as well as newcomers CNCdatabase and Oncovar for cancer drivers. Plant comparative genomics is catered for by updates from Gramene and GreenPhylDB. The entire Database Issue is freely available online on the Nucleic Acids Research website (https://academic.oup.com/nar). The NAR online Molecular Biology Database Collection has been substantially updated, revisiting nearly 1000 entries, adding 90 new resources and eliminating 86 obsolete databases, bringing the current total to 1641 databases. It is available at https://www.oxfordjournals.org/nar/database/c/.


2020 ◽  
Vol 49 (D1) ◽  
pp. D916-D923
Author(s):  
Adam Frankish ◽  
Mark Diekhans ◽  
Irwin Jungreis ◽  
Julien Lagarde ◽  
Jane E Loveland ◽  
...  

Abstract The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure, bioinformatics tools, and analysis, and the advances they support in the annotation of the human and mouse genomes including: the completion of first pass manual annotation for the mouse reference genome; targeted improvements to the annotation of genes associated with SARS-CoV-2 infection; collaborative projects to achieve convergence across reference annotation databases for the annotation of human and mouse protein-coding genes; and the first GENCODE manually supervised automated annotation of lncRNAs. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.


2020 ◽  
Vol 49 (D1) ◽  
pp. D1046-D1057 ◽  
Author(s):  
Jairo Navarro Gonzalez ◽  
Ann S Zweig ◽  
Matthew L Speir ◽  
Daniel Schmelter ◽  
Kate R Rosenbloom ◽  
...  

Abstract For more than two decades, the UCSC Genome Browser database (https://genome.ucsc.edu) has provided high-quality genomics data visualization and genome annotations to the research community. As the field of genomics grows and more data become available, new modes of display are required to accommodate new technologies. New features released this past year include a Hi-C heatmap display, a phased family trio display for VCF files, and various track visualization improvements. Striving to keep data up-to-date, new updates to gene annotations include GENCODE Genes, NCBI RefSeq Genes, and Ensembl Genes. New data tracks added for human and mouse genomes include the ENCODE registry of candidate cis-regulatory elements, promoters from the Eukaryotic Promoter Database, and NCBI RefSeq Select and Matched Annotation from NCBI and EMBL-EBI (MANE). Within weeks of learning about the outbreak of coronavirus, UCSC released a genome browser, with detailed annotation tracks, for the SARS-CoV-2 RNA reference assembly.


2020 ◽  
Author(s):  
Morgan Howard ◽  
Bruce Kane ◽  
Mary Lepry ◽  
Paul Stey ◽  
Ashok Ragavendran ◽  
...  

AbstractBackground and objectiveAdvances in tumor genome sequencing created an urgent need for bioinformatics tools to support the interpretation of the clinical significance of the variants detected. VarStack is a web tool which is a base to retrieve somatic variant data in cancer from existing databases.MethodsVarStack incorporates data from several publicly available databases and presents them with an easy-to-navigate user-interface. It currently supports data from the Catalogue of Somatic Mutations in Cancer (COSMIC), gnomAD, cBioPortal, ClinVar, OncoKB and UCSC Genome browser. It retrieves the data from these databases and returns back to the user in a fraction of the time it would take to manually navigate each site independently.ResultsUsers submit a variant with gene symbol, peptide change, and coding sequence change. They may select a variety of tumor specific studies in cBioportal to search through in addition to their original query. The results from the databases are presented in tabs. Users can export the results as a CSV file. VarStack also has the batch search feature in which user submits a list of variants and download a CSV file with the data from the databases. With the batch search and data download options users can easily incorporate VarStack into their workflow or tools. VarStack saves time by providing variant data to the user from multiple databases in an easy-to-export and interpretable format.AvailabilityVarStack is freely available under https://varstack.brown.edu.


Database ◽  
2020 ◽  
Vol 2020 ◽  
Author(s):  
Morgan Howard ◽  
Bruce Kane ◽  
Mary Lepry ◽  
Paul Stey ◽  
Ashok Ragavendran ◽  
...  

Abstract Advances in tumor genome sequencing created an urgent need for bioinformatics tools to support the interpretation of the clinical significance of the variants detected. VarStack is a web tool which is a base to retrieve somatic variant data relating to cancer from existing databases. VarStack incorporates data from several publicly available databases and presents them with an easy-to-navigate user interface. It currently supports data from the Catalogue of Somatic Mutations in Cancer, gnomAD, cBioPortal, ClinVar, OncoKB, CiViC and UCSC Genome Browser. It retrieves the data from these databases and returns them back to the user in a fraction of the time it would take to manually navigate each site independently. Users submit a variant with a gene symbol, peptide change and coding sequence change. They may select a variety of tumor-specific studies in cBioPortal to search through in addition to their original query. The results from the databases are presented in tabs. Users can export the results as an Excel file. VarStack also has the batch search feature in which the user can submit a list of variants and download an Excel file with the data from the databases. With the batch search and data download options, users can easily incorporate VarStack into their workflow or tools. VarStack saves time by providing somatic variant information to the user from multiple databases in an easy-to-export and interpretable format. VarStack is freely available under https://varstack.brown.edu.


2019 ◽  
Author(s):  
Stefan Kurtenbach ◽  
J. William Harbour

AbstractWhile there are sophisticated resources available for displaying NGS data, including the Integrative Genomics Viewer (IGV) and the UCSC genome browser, exporting regions and assembling figures for publication remains challenging. In particular, customizing track appearance and overlaying track replicates is a manual and time-consuming process. Here, we present SparK, a tool which auto-generates publication-ready, high-resolution, true vector graphic figures from any NGS-based tracks, including RNA-seq, ChIP-seq, and ATAC-seq. Novel functions of SparK include averaging of replicates, plotting standard deviation tracks, and highlighting significantly changed areas. SparK is written in Python 3, making it executable on any major OS platform. Using command line prompts to generate figures allows later changes to be made very easy. For instance, if the genomic region of the plot needs to be changed, or tracks need to be added or removed, the figure can easily be re-generated within seconds without the manual process of re-exporting and re-assembling everything. After plotting with SparK, changes to the output SVG vector graphic files are simple to make, including text, lines, and colors. SparK is publicly available on GitHub: https://github.com/harbourlab/SparK.


Author(s):  
Patrick Meylan ◽  
René Dreos ◽  
Giovanna Ambrosini ◽  
Romain Groux ◽  
Philipp Bucher

Abstract The Eukaryotic Promoter Database (EPD), available online at https://epd.epfl.ch, provides accurate transcription start site (TSS) information for promoters of 15 model organisms plus corresponding functional genomics data that can be viewed in a genome browser, queried or analyzed via web interfaces, or exported in standard formats (FASTA, BED, CSV) for subsequent analysis with other tools. Recent work has focused on the improvement of the EPD promoter viewers, which use the UCSC Genome Browser as visualization platform. Thousands of high-resolution tracks for CAGE, ChIP-seq and similar data have been generated and organized into public track hubs. Customized, reproducible promoter views, combining EPD-supplied tracks with native UCSC Genome Browser tracks, can be accessed from the organism summary pages or from individual promoter entries. Moreover, thanks to recent improvements and stabilization of ncRNA gene catalogs, we were able to release promoter collections for certain classes of ncRNAs from human and mouse. Furthermore, we developed automatic computational protocols to assign orphan TSS peaks to downstream genes based on paired-end (RAMPAGE) TSS mapping data, which enabled us to add nearly 9000 new entries to the human promoter collection. Since our last article in this journal, EPD was extended to five more model organisms: rhesus monkey, rat, dog, chicken and Plasmodium falciparum.


Sign in / Sign up

Export Citation Format

Share Document