Data protection by design: Building the foundations of trustworthy data sharing

Data & Policy ◽

10.1017/dap.2020.1 ◽

2020 ◽

Vol 2 ◽

Author(s):

Sophie Stalla-Bourdillon ◽

Gefion Thuermer ◽

Johanna Walker ◽

Laura Carmichael ◽

Elena Simperl

Keyword(s):

Organizational Structure ◽

Data Sharing ◽

Data Protection ◽

Open Data ◽

Design Approach ◽

Core Data ◽

Commercial Agreements

Abstract Data trusts have been conceived as a mechanism to enable the sharing of data across entities where other formats, such as open data or commercial agreements, are not appropriate, and make data sharing both easier and more scalable. By our definition, a data trust is a legal, technical, and organizational structure for enabling the sharing of data for a variety of purposes. The concept of the “data trust” requires further disambiguation from other facilitating structures such as data collaboratives. Irrespective of the terminology used, attempting to create trust in order to facilitate data sharing, and create benefit to individuals, groups of individuals, or society at large, requires at a minimum a process-based mechanism, that is, a workflow that should have a trustworthiness-by-design approach at its core. Data protection by design should be a key component of such an approach.

Download Full-text

Data Dentistry: How Data Are Changing Clinical Care and Research

Journal of Dental Research ◽

10.1177/00220345211020265 ◽

2021 ◽

pp. 002203452110202

Author(s):

F. Schwendicke ◽

J. Krois

Keyword(s):

Health Care ◽

Data Sharing ◽

Clinical Care ◽

Open Data ◽

User Interaction ◽

Data Availability ◽

Related Data ◽

Data User ◽

Regulatory Data ◽

Consumer Data

Data are a key resource for modern societies and expected to improve quality, accessibility, affordability, safety, and equity of health care. Dental care and research are currently transforming into what we term data dentistry, with 3 main applications: 1) medical data analysis uses deep learning, allowing one to master unprecedented amounts of data (language, speech, imagery) and put them to productive use. 2) Data-enriched clinical care integrates data from individual (e.g., demographic, social, clinical and omics data, consumer data), setting (e.g., geospatial, environmental, provider-related data), and systems level (payer or regulatory data to characterize input, throughput, output, and outcomes of health care) to provide a comprehensive and continuous real-time assessment of biologic perturbations, individual behaviors, and context. Such care may contribute to a deeper understanding of health and disease and a more precise, personalized, predictive, and preventive care. 3) Data for research include open research data and data sharing, allowing one to appraise, benchmark, pool, replicate, and reuse data. Concerns and confidence into data-driven applications, stakeholders’ and system’s capabilities, and lack of data standardization and harmonization currently limit the development and implementation of data dentistry. Aspects of bias and data-user interaction require attention. Action items for the dental community circle around increasing data availability, refinement, and usage; demonstrating safety, value, and usefulness of applications; educating the dental workforce and consumers; providing performant and standardized infrastructure and processes; and incentivizing and adopting open data and data sharing.

Download Full-text

Fengyun Meteorological Satellite Products for Earth System Science Applications

Advances in Atmospheric Sciences ◽

10.1007/s00376-021-0425-3 ◽

2021 ◽

Author(s):

Di Xian ◽

Peng Zhang ◽

Ling Gao ◽

Ruijing Sun ◽

Haizhen Zhang ◽

...

Keyword(s):

Data Sharing ◽

Satellite Data ◽

Prediction Models ◽

Weather Forecasting ◽

Numerical Models ◽

Weather Prediction ◽

Vegetation Indices ◽

Open Data ◽

Earth System ◽

Inversion Algorithm

AbstractFollowing the progress of satellite data assimilation in the 1990s, the combination of meteorological satellites and numerical models has changed the way scientists understand the earth. With the evolution of numerical weather prediction models and earth system models, meteorological satellites will play a more important role in earth sciences in the future. As part of the space-based infrastructure, the Fengyun (FY) meteorological satellites have contributed to earth science sustainability studies through an open data policy and stable data quality since the first launch of the FY-1A satellite in 1988. The capability of earth system monitoring was greatly enhanced after the second-generation polar orbiting FY-3 satellites and geostationary orbiting FY-4 satellites were developed. Meanwhile, the quality of the products generated from the FY-3 and FY-4 satellites is comparable to the well-known MODIS products. FY satellite data has been utilized broadly in weather forecasting, climate and climate change investigations, environmental disaster monitoring, etc. This article reviews the instruments mounted on the FY satellites. Sensor-dependent level 1 products (radiance data) and inversion algorithm-dependent level 2 products (geophysical parameters) are introduced. As an example, some typical geophysical parameters, such as wildfires, lightning, vegetation indices, aerosol products, soil moisture, and precipitation estimation have been demonstrated and validated by in-situ observations and other well-known satellite products. To help users access the FY products, a set of data sharing systems has been developed and operated. The newly developed data sharing system based on cloud technology has been illustrated to improve the efficiency of data delivery.

Download Full-text

Research on Core Data Protection Based on Cloud Computing

Advances in Intelligent Systems and Computing - Cyber Security Intelligence and Analytics ◽

10.1007/978-3-030-15235-2_154 ◽

2019 ◽

pp. 1167-1174

Author(s):

Guoru Deng ◽

Shuai Hu ◽

Hui Yuan ◽

Liang Dong ◽

Yan Zhuang ◽

...

Keyword(s):

Cloud Computing ◽

Data Protection ◽

Core Data

Download Full-text

Archaeological documentation and data sharing: digital surveying and open data approach applied to archaeological fieldworks

Virtual Archaeology Review ◽

10.4995/var.2019.10377 ◽

2019 ◽

Vol 10 (20) ◽

pp. 17 ◽

Cited By ~ 3

Author(s):

Mattia Previtali ◽

Riccardo Valente

Keyword(s):

Information System ◽

Geographic Information System ◽

Data Sharing ◽

Open Data ◽

Geographic Information ◽

Published Data ◽

Archaeological Data ◽

Disciplinary Expertise ◽

Large Level ◽

The Impact

The open data paradigm is changing the research approach in many fields such as remote sensing and the social sciences. This is supported by governmental decisions and policies that are boosting the open data wave, and in this context archaeology is also affected by this new trend. In many countries, archaeological data are still protected or only limited access is allowed. However, the strong political and economic support for the publication of government data as open data will change the accessibility and disciplinary expertise in the archaeological field too. In order to maximize the impact of data, their technical openness is of primary importance. Indeed, since a spreadsheet is more usable than a PDF of a table, the availability of digital archaeological data, which is structured using standardised approaches, is of primary importance for the real usability of published data. In this context, the main aim of this paper is to present a workflow for archaeological data sharing as open data with a large level of technical usability and interoperability. Primary data is mainly acquired through the use of digital techniques (e.g. digital cameras and terrestrial laser scanning). The processing of this raw data is performed with commercial software for scan registration and image processing, allowing for a simple and semi-automated workflow. Outputs obtained from this step are then processed in modelling and drawing environments to generate digital models, both 2D and 3D. These crude geometrical data are then enriched with further information to generate a Geographic Information System (GIS) which is finally published as open data using Open Geospatial Consortium (OGC) standards to maximise interoperability.Highlights:<ul><li>Open data will change the accessibility and disciplinary expertise in the archaeological field.</li><li>The main aim of this paper is to present a workflow for archaeological data sharing as open data with a large level of interoperability.</li><li>Digital acquisition techniques are used to document archaeological excavations and a Geographic Information System (GIS) is generated that is published as open data.</li></ul>

Download Full-text

NIfTI-MRS: A standard format for magnetic resonance spectroscopic data

10.1101/2021.11.09.467912 ◽

2021 ◽

Author(s):

William T Clarke ◽

Mark Mikkelsen ◽

Georg Oeltzschner ◽

Tiffany Bell ◽

Amirmohammad Shamaei ◽

...

Keyword(s):

Data Sharing ◽

Open Data ◽

Imaging Data ◽

Standard Format ◽

Conversion Point ◽

Imaging Tool ◽

Data Formats ◽

Multiple Data ◽

Single Voxel ◽

Online Documentation

Purpose: The use of multiple data formats in the MRS community currently hinders data sharing and integration. NIfTI-MRS is proposed as a standard MR spectroscopy data format, which is implemented as an extension to the neuroimaging informatics technology initiative (NIfTI) format. Using this standardised format will facilitate data sharing, ease algorithm development, and encourage the integration of MRS analysis with other imaging modalities. Methods: A file format based on the NIfTI header extension framework was designed to incorporate essential spectroscopic metadata and additional encoding dimensions. A detailed description of the specification is provided. An open-source command-line conversion program is implemented to enable conversion of single-voxel and spectroscopic imaging data to NIfTI-MRS. To provide visualisation of data in NIfTI-MRS, a dedicated plugin is implemented for FSLeyes, the FSL image viewer. Results: Alongside online documentation, ten example datasets are provided in the proposed format. In addition, minimal examples of NIfTI-MRS readers have been implemented. The conversion software, spec2nii, currently converts fourteen formats to NIfTI-MRS, including DICOM and vendor proprietary formats. Conclusion: The proposed format aims to solve the issue of multiple data formats being used in the MRS community. By providing a single conversion point, it aims to simplify the processing and analysis of MRS data, thereby lowering the barrier to use of MRS. Furthermore, it can serve as the basis for open data sharing, collaboration, and interoperability of analysis programs. It also opens possibility of greater standardisation and harmonisation. By aligning with the dominant format in neuroimaging, NIfTI-MRS enables the use of mature tools present in the imaging community, demonstrated in this work by using a dedicated imaging tool, FSLeyes, as a viewer.

Download Full-text

Quo Vadis Open data?

Masaryk University Journal of Law and Technology ◽

10.5817/mujlt2018-2-4 ◽

2018 ◽

Vol 12 (2) ◽

pp. 179-220

Author(s):

Jozef Andraško ◽

Matúš Mesarčík

Keyword(s):

Public Sector ◽

Data Protection ◽

New Technologies ◽

Open Data ◽

Legal Order ◽

Free Access ◽

Access To Information ◽

European Union Law ◽

Public Sector Information ◽

Quo Vadis

New technologies have irreversibly changed the nature of the traditional way of exercising the right to free access to information. In the current information society, the information available to public authorities is not just a tool for controlling the public administration and increasing its transparency. Information has become an asset that individuals and legal entities also seek to use for business purposes. PSI particularly in form of open data create new opportunities for developing and improving the performance of public administration.In that regard, authors analyze the term open data and its legal framework from the perspective of European Union law, Slovak legal order and Czech legal order. Furthermore, authors focus is on the relation between open data regime, public sector information re-use regime and free access to information regime.New data protection regime represented by General Data Protection Regulation poses several challenges when it comes to processing of public sector information in form of open data. The article highlights the most important challenges of new regime being compliance with purpose specification, selection of legal ground and other important issues.

Download Full-text

Supporting evidence-based analysis for modified risk tobacco products through a toxicology data-sharing infrastructure

F1000Research ◽

10.12688/f1000research.10493.2 ◽

2017 ◽

Vol 6 ◽

pp. 12 ◽

Cited By ~ 6

Author(s):

Stéphanie Boué ◽

Thomas Exner ◽

Samik Ghosh ◽

Vincenzo Belcastro ◽

Joh Dokler ◽

...

Keyword(s):

Data Sharing ◽

Data Science ◽

Disease Risk ◽

Open Data ◽

Supporting Evidence ◽

Tobacco Products ◽

Us Fda ◽

Using Data

The US FDA defines modified risk tobacco products (MRTPs) as products that aim to reduce harm or the risk of tobacco-related disease associated with commercially marketed tobacco products. Establishing a product’s potential as an MRTP requires scientific substantiation including toxicity studies and measures of disease risk relative to those of cigarette smoking. Best practices encourage verification of the data from such studies through sharing and open standards. Building on the experience gained from the OpenTox project, a proof-of-concept database and website (INTERVALS) has been developed to share results from both in vivo inhalation studies and in vitro studies conducted by Philip Morris International R&D to assess candidate MRTPs. As datasets are often generated by diverse methods and standards, they need to be traceable, curated, and the methods used well described so that knowledge can be gained using data science principles and tools. The data-management framework described here accounts for the latest standards of data sharing and research reproducibility. Curated data and methods descriptions have been prepared in ISA-Tab format and stored in a database accessible via a search portal on the INTERVALS website. The portal allows users to browse the data by study or mechanism (e.g., inflammation, oxidative stress) and obtain information relevant to study design, methods, and the most important results. Given the successful development of the initial infrastructure, the goal is to grow this initiative and establish a public repository for 21st-century preclinical systems toxicology MRTP assessment data and results that supports open data principles.

Download Full-text

Open Data Protection

10.17875/gup2017-1061 ◽

2017 ◽

Keyword(s):

Data Protection ◽

Open Data

Download Full-text

International data-sharing norms: from the OECD to the General Data Protection Regulation (GDPR)

Human Genetics ◽

10.1007/s00439-018-1919-7 ◽

2018 ◽

Vol 137 (8) ◽

pp. 575-582 ◽

Cited By ~ 26

Author(s):

Mark Phillips

Keyword(s):

Data Sharing ◽

Data Protection ◽

General Data Protection Regulation ◽

International Data ◽

General Data

Download Full-text

Data ownership and data publishing

ARPHA Conference Abstracts ◽

10.3897/aca.2.e39250 ◽

2019 ◽

Vol 2 ◽

Author(s):

Lyubomir Penev

Keyword(s):

Data Protection ◽

Open Data ◽

Data Publishing ◽

Supplementary File ◽

Biodiversity Data ◽

Biodiversity Knowledge ◽

Data Ownership ◽

Data Hoarding ◽

Data Elements ◽

Access To Data

"Data ownership" is actually an oxymoron, because there could not be a copyright (ownership) on facts or ideas, hence no data onwership rights and law exist. The term refers to various kinds of data protection instruments: Intellectual Property Rights (IPR) (mostly copyright) asserted to indicate some kind of data ownership, confidentiality clauses/rules, database right protection (in the European Union only), or personal data protection (GDPR) (Scassa 2018). Data protection is often realised via different mechanisms of "data hoarding", that is witholding access to data for various reasons (Sieber 1989). Data hoarding, however, does not put the data into someone's ownership. Nonetheless, the access to and the re-use of data, and biodiversuty data in particular, is hampered by technical, economic, sociological, legal and other factors, although there should be no formal legal provisions related to copyright that may prevent anyone who needs to use them (Egloff et al. 2014, Egloff et al. 2017, see also the Bouchout Declaration). One of the best ways to provide access to data is to publish these so that the data creators and holders are credited for their efforts. As one of the pioneers in biodiversity data publishing, Pensoft has adopted a multiple-approach data publishing model, resulting in the ARPHA-BioDiv toolbox and in extensive Strategies and Guidelines for Publishing of Biodiversity Data (Penev et al. 2017a, Penev et al. 2017b). ARPHA-BioDiv consists of several data publishing workflows: Deposition of underlying data in an external repository and/or its publication as supplementary file(s) to the related article which are then linked and/or cited in-tex. Supplementary files are published under their own DOIs to increase citability). Description of data in data papers after they have been deposited in trusted repositories and/or as supplementary files; the systme allows for data papers to be submitted both as plain text or converted into manuscripts from Ecological Metadata Language (EML) metadata. Import of structured data into the article text from tables or via web services and their susequent download/distribution from the published article as part of the integrated narrative and data publishing workflow realised by the Biodiversity Data Journal. Publication of data in structured, semanticaly enriched, full-text XMLs where data elements are machine-readable and easy-to-harvest. Extraction of Linked Open Data (LOD) from literature, which is then converted into interoperable RDF triples (in accordance with the OpenBiodiv-O ontology) (Senderov et al. 2018) and stored in the OpenBiodiv Biodiversity Knowledge Graph Deposition of underlying data in an external repository and/or its publication as supplementary file(s) to the related article which are then linked and/or cited in-tex. Supplementary files are published under their own DOIs to increase citability). Description of data in data papers after they have been deposited in trusted repositories and/or as supplementary files; the systme allows for data papers to be submitted both as plain text or converted into manuscripts from Ecological Metadata Language (EML) metadata. Import of structured data into the article text from tables or via web services and their susequent download/distribution from the published article as part of the integrated narrative and data publishing workflow realised by the Biodiversity Data Journal. Publication of data in structured, semanticaly enriched, full-text XMLs where data elements are machine-readable and easy-to-harvest. Extraction of Linked Open Data (LOD) from literature, which is then converted into interoperable RDF triples (in accordance with the OpenBiodiv-O ontology) (Senderov et al. 2018) and stored in the OpenBiodiv Biodiversity Knowledge Graph In combination with text and data mining (TDM) technologies for legacy literature (PDF) developed by Plazi, these approaches show different angles to the future of biodiversity data publishing and, lay the foundations of an entire data publishing ecosystem in the field, while also supplying FAIR (Findable, Accessible, Interoperable and Reusable) data to several interoperable overarching infrastructures, such as Global Biodiversity Information Facility (GBIF), Biodiversity Literature Repository (BLR), Plazi TreatmentBank, OpenBiodiv, as well as to various end users.

Download Full-text