COSPAR Panel on Space Weather Resolutions on Metadata Standards and Data Access

2021 ◽  
Vol 212 ◽  
pp. 19
Author(s):  
Kimberlyn McGrail ◽  
Brent Diverty ◽  
Lisa Lix

IntroductionNotwithstanding Canada’s exceptional longitudinal health data and research centres with extensive experience transforming data into knowledge, many Canadian studies based on linked administrative data have focused on a single province or territory. Health Data Research Network Canada (HDRN Canada), a new not-for-profit corporation, will bring together major national, provincial and territorial health data stewards from across Canada. HDRN Canada’s first initiative is the $81 million SPOR Canadian Data Platform funded under the Canadian Institutes of Health Research Strategy for Patient-Oriented Research (SPOR). Objectives and ApproachHDRN Canada is a distributed network through which individual data-holding centres work together to (i) create a single portal and support system for researchers requesting multi-jurisdictional data, (ii) harmonize and validate case definitions and key analytic variables across jurisdictions, (iii) expand the sources and types of data linkages, (iv) develop technological infrastructure to improve data access and collection, (v) create supports for advanced analytics and (vi) establish strong partnerships with patients, the public and with Indigenous communities. We will share our experiences and gather international feedback on our network and its goals from symposium participants. ResultsIn January 2020, HDRN Canada launched its Data Access Support Hub (DASH) which includes an inventory listing over 380 datasets, information about more than 120 algorithms and a repository of requirements and processes for accessing data. HDRN Canada is receiving requests for multi-province research studies that would be challenging to conduct without HDRN Canada. Conclusion / ImplicationsThus far, HDRN Canada services and tools have been developed primarily for Canadian researchers but HDRN Canada can also serve as a prompt for an international discussion about what has/has not worked in terms of multi-jurisdictional research data infrastructure. It can also present an opportunity for the development of metadata, standards and common approaches that support more multi-country research.


Author(s):  
Manuel Rodríguez-Pascual ◽  
Christos Kanellopoulos ◽  
Antonio Juan Rubio-Montero ◽  
Diego Darriba ◽  
Ognjen Prnjat ◽  
...  

Nowadays, computing calculations are becoming more and more demanding due to the huge pool of resources available. This demand must be satisfied in terms of computational efficiency and resilience, which is compromised in distributed and heterogeneous platforms. Not only this, data obtained are often either reused by other researchers or recalculated. In this work, a set of tools to overcome the problem of creating and executing fault tolerant distributed applications on dynamic environments is presented. Such a set also ensures the reproducibility of the performed experiments providing a portable, unattended and resilient framework that encapsulates the infrastructure-dependent operations away from the application developers and users, allowing performing experiments based on Open Access data repositories. In this way, users can seamlessly search and lately access datasets that can be automatically retrieved as input data into a code already integrated in the proposed workflow. Such a search is based on metadata standards and relies on Persistent Identifiers (PID) to assign specific repositories. The applications profit from Distributed Toolbox, a framework devoted to the creation and execution of distributed applications and includes tools for unattended cluster and grid execution, where a total fault tolerance is provided. By decoupling the definition of the remote tasks from its execution and control, the development, execution and maintenance of distributed applications is significantly simplified with respect to previous solutions, increasing their robustness and allowing running them on different computational platforms with little effort. The integration with Open Access databases and employment of PIDs for long-lasting references ensures that the data related to the experiments will persist, closing a complete research circle of data access/processing/storage/dissemination of results.


2010 ◽  
Vol 7 (3) ◽  
Author(s):  
Rasmus H. Fogh ◽  
Wayne Boucher ◽  
John M.C. Ionides ◽  
Wim F. Vranken ◽  
Tim J. Stevens ◽  
...  

SummaryIn recent years the amount of biological data has exploded to the point where much useful information can only be extracted by complex computational analyses. Such analyses are greatly facilitated by metadata standards, both in terms of the ability to compare data originating from different sources, and in terms of exchanging data in standard forms, e.g. when running processes on a distributed computing infrastructure. However, standards thrive on stability whereas science tends to constantly move, with new methods being developed and old ones modified. Therefore maintaining both metadata standards, and all the code that is required to make them useful, is a non-trivial problem. Memops is a framework that uses an abstract definition of the metadata (described in UML) to generate internal data structures and subroutine libraries for data access (application programming interfaces - APIs - currently in Python, C and Java) and data storage (in XML files or databases). For the individual project these libraries obviate the need for writing code for input parsing, validity checking or output. Memops also ensures that the code is always internally consistent, massively reducing the need for code reorganisation. Across a scientific domain a Memops-supported data model makes it easier to support complex standards that can capture all the data produced in a scientific area, share them among all programs in a complex software pipeline, and carry them forward to deposition in an archive. The principles behind the Memops generation code will be presented, along with example applications in Nuclear Magnetic Resonance (NMR) spectroscopy and structural biology.


Space Weather ◽  
2004 ◽  
Vol 2 (4) ◽  
pp. n/a-n/a ◽  
Author(s):  
Anna Belehaki ◽  
Jean Lilensten ◽  
Toby Clark
Keyword(s):  

Space Weather ◽  
2003 ◽  
Vol 1 (1) ◽  
pp. n/a-n/a ◽  
Author(s):  
Robert Robinson
Keyword(s):  

Space Weather ◽  
2005 ◽  
Vol 3 (1) ◽  
pp. n/a-n/a
Author(s):  
Sarah Simpson
Keyword(s):  

Space Weather ◽  
2004 ◽  
Vol 2 (12) ◽  
pp. n/a-n/a
Author(s):  
Jonathan Lifland
Keyword(s):  

Space Weather ◽  
2004 ◽  
Vol 2 (7) ◽  
pp. n/a-n/a
Author(s):  
Sarah Simpson
Keyword(s):  

Space Weather ◽  
2005 ◽  
Vol 3 (8) ◽  
pp. n/a-n/a
Author(s):  
Stephen Cole

Sign in / Sign up

Export Citation Format

Share Document