Integrating Biomolecular and Clinical Data for Cancer Research: Concepts and Challenges

2012 ◽  
pp. 159-172
Author(s):  
Pornpimol Charoentong ◽  
Hubert Hackl ◽  
Bernhard Mlecnik ◽  
Gabriela Bindea ◽  
Jerome Galon ◽  
...  
2021 ◽  
pp. 561-569
Author(s):  
Steven A. Eschrich ◽  
Jamie K. Teer ◽  
Phillip Reisman ◽  
Erin Siegel ◽  
Chandan Challa ◽  
...  

PURPOSE The use of genomics within cancer research and clinical oncology practice has become commonplace. Efforts such as The Cancer Genome Atlas have characterized the cancer genome and suggested a wealth of targets for implementing precision medicine strategies for patients with cancer. The data produced from research studies and clinical care have many potential secondary uses beyond their originally intended purpose. Effective storage, query, retrieval, and visualization of these data are essential to create an infrastructure to enable new discoveries in cancer research. METHODS Moffitt Cancer Center implemented a molecular data warehouse to complement the extensive enterprise clinical data warehouse (Health and Research Informatics). Seven different sequencing experiment types were included in the warehouse, with data from institutional research studies and clinical sequencing. RESULTS The implementation of the molecular warehouse involved the close collaboration of many teams with different expertise and a use case–focused approach. Cornerstones of project success included project planning, open communication, institutional buy-in, piloting the implementation, implementing custom solutions to address specific problems, data quality improvement, and data governance, unique aspects of which are featured here. We describe our experience in selecting, configuring, and loading molecular data into the molecular data warehouse. Specifically, we developed solutions for heterogeneous genomic sequencing cohorts (many different platforms) and integration with our existing clinical data warehouse. CONCLUSION The implementation was ultimately successful despite challenges encountered, many of which can be generalized to other research cancer centers.


2015 ◽  
Vol 75 (24) ◽  
pp. 5194-5201 ◽  
Author(s):  
Rebecca S. Jacobson ◽  
Michael J. Becich ◽  
Roni J. Bollag ◽  
Girish Chavan ◽  
Julia Corrigan ◽  
...  

2021 ◽  
Author(s):  
Rebecca Asiimwe ◽  
Stephanie Lam ◽  
Samuel Leung ◽  
Shanzhao Wang ◽  
Rachel Wan ◽  
...  

Abstract Background To drive translational medicine, modern day biobanks need to integrate with other sources of data (clinical, genomics) to support novel data-intensive research. Currently, vast amounts of research and clinical data remain in silos, held and managed by individual researchers, operating under different standards and governance structures; a framework that impedes sharing and use of data. In this article, we describe the journey of British Columbia’s Gynecological Cancer Research Program (OVCARE) in moving a traditional tumour biobank, outcome unit, and a collection of data silos, into an integrated data commons to support data standardization, data, and resources sharing under collaborative governance, as a means of providing the gynecologic cancer research community in British Columbia access to tissue samples and associated clinical and molecular data from thousands of patients. Results Through several engagements with stakeholders from various research institutions within our research community, we identified priorities and assessed infrastructure needs required to optimize and support data collections, storage and sharing, under three main research domains: 1) biospecimen collections, 2) molecular and genomics data, and 3) clinical data. We further built a governance model and a resource portal to implement protocols and standard operating procedures for seamless collections, management and governance of interoperable data, making genomic, and clinical data available to the broader research community. Conclusions Proper infrastructures for data collection, sharing and governance is a translational research imperative. We have consolidated our data holdings into a data commons, along with standardized operating procedures to meet research and ethics requirements of the gynecologic cancer community in British Columbia. The developed infrastructure brings together, diverse data, computing framework, as well as tools and applications for managing, analyzing, and sharing data. Our data commons bridges data access gaps and barriers to precision medicine and approaches for diagnostics, treatment and prevention of gynecological cancers, by providing access to large datasets required for data-intensive science.


2021 ◽  
Vol 16 ◽  
pp. 117727192110095
Author(s):  
Valerie Speirs

Tissue obtained from biobanks is frequently employed in biomarker studies. Biomarkers define objective, measurable characteristics of biological and biomedical procedures and have been used as indicators of clinical outcome. This article outlines some of the steps scientists should consider when embarking on biomarker research in cancer research using samples from biobanks and the importance and challenges of linking clinical data to biological samples.


Author(s):  
Hyo Soung Cha ◽  
Jip Min Jung ◽  
Seob Yoon Shin ◽  
Young Mi Jang ◽  
Phillip Park ◽  
...  

Data warehousing is the most important technology to address recent advances in precision medicine. However, a generic clinical data warehouse does not address unstructured and insufficient data. In precision medicine, it is essential to develop a platform that can collect and utilize data. Data were collected from electronic medical records, genomic sequences, tumor biopsy specimens, and national cancer control initiative databases in the National Cancer Center (NCC), Korea. Data were de-identified and stored in a safe and independent space. Unstructured clinical data were standardized and incorporated into cancer registries and linked to cancer genome sequences and tumor biopsy specimens. Finally, national cancer control initiative data from the public domain were independently organized and linked to cancer registries. We constructed a system for integrating and providing various cancer data called the Korea Cancer Big Data Platform (K-CBP). Although the K-CBP could be used for cancer research, the legal and regulatory aspects of data distribution and usage need to be addressed first. Nonetheless, the system will continue collecting data from cancer-related resources that will hopefully facilitate precision-based research.


2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Rebecca Asiimwe ◽  
Stephanie Lam ◽  
Samuel Leung ◽  
Shanzhao Wang ◽  
Rachel Wan ◽  
...  

Abstract Background To drive translational medicine, modern day biobanks need to integrate with other sources of data (clinical, genomics) to support novel data-intensive research. Currently, vast amounts of research and clinical data remain in silos, held and managed by individual researchers, operating under different standards and governance structures; a framework that impedes sharing and effective use of data. In this article, we describe the journey of British Columbia’s Gynecological Cancer Research Program (OVCARE) in moving a traditional tumour biobank, outcomes unit, and a collection of data silos, into an integrated data commons to support data standardization and resource sharing under collaborative governance, as a means of providing the gynecologic cancer research community in British Columbia access to tissue samples and associated clinical and molecular data from thousands of patients. Results Through several engagements with stakeholders from various research institutions within our research community, we identified priorities and assessed infrastructure needs required to optimize and support data collections, storage and sharing, under three main research domains: (1) biospecimen collections, (2) molecular and genomics data, and (3) clinical data. We further built a governance model and a resource portal to implement protocols and standard operating procedures for seamless collections, management and governance of interoperable data, making genomic, and clinical data available to the broader research community. Conclusions Proper infrastructures for data collection, sharing and governance is a translational research imperative. We have consolidated our data holdings into a data commons, along with standardized operating procedures to meet research and ethics requirements of the gynecologic cancer community in British Columbia. The developed infrastructure brings together, diverse data, computing frameworks, as well as tools and applications for managing, analyzing, and sharing data. Our data commons bridges data access gaps and barriers to precision medicine and approaches for diagnostics, treatment and prevention of gynecological cancers, by providing access to large datasets required for data-intensive science.


Author(s):  
Jianyuan Zhou ◽  
Xuecang Li ◽  
Jiaxin Chen ◽  
Taisong Li ◽  
Weijie Zhan ◽  
...  

AbstractChromatin accessibility is a crucial epigenetic concept that plays a biological role in oncology. As humans become more involved in cancer research, a comprehensive database is required to identify and annotate tumor chromatin accessible regions (CARs). Here, CATA was developed to provide cancer-related CAR annotation. Currently, CATA possesses 2,991,163 CARs, relevant clinical data, and transcription factor binding predictions for cancer CARs from 410 tumor samples of 24 cancer types. Furthermore, CARs were annotated by SNPs, risk SNPs, eQTLs, linkage disequilibrium SNPs, transcription factors, CNV, SNV, enhancer, and 450K methylation sites in our database. By combining all these resources, we believe that CATA will provide better service for researchers on oncology. Our database is accessible at http://bio.licpathway.net/cata/


Sign in / Sign up

Export Citation Format

Share Document