Code Convention Adherence in Research Data Infrastructure Software: An Exploratory Study

Author(s):  
Michael Smit
2020 ◽  
Vol 4 (1) ◽  
pp. 122-142
Author(s):  
Inna Kouper ◽  
Anjanette H Raymond ◽  
Stacey Giroux

AbstractMaking decisions regarding data and the overall credibility of research constitutes research data governance. In this paper, we present results of an exploratory study of the stakeholders of research data governance. The study was conducted among individuals who work in academic and research institutions in the US, with the goal of understanding what entities are perceived as making decisions regarding data and who researchers believe should be responsible for governing research data. Our results show that there is considerable diversity and complexity across stakeholders, both in terms of who they are and their ideas about data governance. To account for this diversity, we propose to frame research data governance in the context of polycentric governance of a knowledge commons. We argue that approaching research data from the commons perspective will allow for a governance framework that can balance the goals of science and society, allow us to shift the discussion toward protection from enclosure and knowledge resilience, and help to ensure that multiple voices are included in all levels of decision-making.


2020 ◽  
Author(s):  
Pierre Tremouilhac ◽  
Chia-Lin Lin ◽  
Pei-Chi Huang ◽  
Yu-Chieh Huang ◽  
An Nguyen ◽  
...  

<p>We describe the development of a repository for chemistry research data (called Chemotion) that provides solutions for current challenges to store research data in a feasible manner, allowing the conservation of domain specific information in a machine readable format. A main advantage of the repository Chemotion is the comprehensive functionality, which offers options to collect, prepare and reuse data using discipline specific methods and data processing tools. For selected analytical data, automated procedures are implemented to facilitate the curation of the data. Chemotion provides functions to facilitate the publishing process of data and the citation of the deposited data. It supports automated Digital Object Identifier (DOI) generation, the comparison of the submissions with PubChem instances, and workflows for peer reviewing of the submissions including embargo settings. The described developments were used to establish a research data infrastructure that is hosted at the Karlsruhe Institute of Technology (KIT), including the necessary storage and support to build a new community-driven repository as a comprehensive alternative to commercial databases. </p>


2020 ◽  
Vol 6 ◽  
Author(s):  
Christoph Steinbeck ◽  
Oliver Koepler ◽  
Felix Bach ◽  
Sonja Herres-Pawlis ◽  
Nicole Jung ◽  
...  

The vision of NFDI4Chem is the digitalisation of all key steps in chemical research to support scientists in their efforts to collect, store, process, analyse, disclose and re-use research data. Measures to promote Open Science and Research Data Management (RDM) in agreement with the FAIR data principles are fundamental aims of NFDI4Chem to serve the chemistry community with a holistic concept for access to research data. To this end, the overarching objective is the development and maintenance of a national research data infrastructure for the research domain of chemistry in Germany, and to enable innovative and easy to use services and novel scientific approaches based on re-use of research data. NFDI4Chem intends to represent all disciplines of chemistry in academia. We aim to collaborate closely with thematically related consortia. In the initial phase, NFDI4Chem focuses on data related to molecules and reactions including data for their experimental and theoretical characterisation. This overarching goal is achieved by working towards a number of key objectives: Key Objective 1: Establish a virtual environment of federated repositories for storing, disclosing, searching and re-using research data across distributed data sources. Connect existing data repositories and, based on a requirements analysis, establish domain-specific research data repositories for the national research community, and link them to international repositories. Key Objective 2: Initiate international community processes to establish minimum information (MI) standards for data and machine-readable metadata as well as open data standards in key areas of chemistry. Identify and recommend open data standards in key areas of chemistry, in order to support the FAIR principles for research data. Finally, develop standards, if there is a lack. Key Objective 3: Foster cultural and digital change towards Smart Laboratory Environments by promoting the use of digital tools in all stages of research and promote subsequent Research Data Management (RDM) at all levels of academia, beginning in undergraduate studies curricula. Key Objective 4: Engage with the chemistry community in Germany through a wide range of measures to create awareness for and foster the adoption of FAIR data management. Initiate processes to integrate RDM and data science into curricula. Offer a wide range of training opportunities for researchers. Key Objective 5: Explore synergies with other consortia and promote cross-cutting development within the NFDI. Key Objective 6: Provide a legally reliable framework of policies and guidelines for FAIR and open RDM.


2020 ◽  
Author(s):  
Pierre Tremouilhac ◽  
Chia-Lin Lin ◽  
Pei-Chi Huang ◽  
Yu-Chieh Huang ◽  
An Nguyen ◽  
...  

<p>We describe the development of a repository for chemistry research data (called Chemotion) that provides solutions for current challenges to store research data in a feasible manner, allowing the conservation of domain specific information in a machine readable format. A main advantage of the repository Chemotion is the comprehensive functionality, which offers options to collect, prepare and reuse data using discipline specific methods and data processing tools. For selected analytical data, automated procedures are implemented to facilitate the curation of the data. Chemotion provides functions to facilitate the publishing process of data and the citation of the deposited data. It supports automated Digital Object Identifier (DOI) generation, the comparison of the submissions with PubChem instances, and workflows for peer reviewing of the submissions including embargo settings. The described developments were used to establish a research data infrastructure that is hosted at the Karlsruhe Institute of Technology (KIT), including the necessary storage and support to build a new community-driven repository as a comprehensive alternative to commercial databases. </p>


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e11140
Author(s):  
Sheeba Samuel ◽  
Birgitta König-Ries

Scientific experiments and research practices vary across disciplines. The research practices followed by scientists in each domain play an essential role in the understandability and reproducibility of results. The “Reproducibility Crisis”, where researchers find difficulty in reproducing published results, is currently faced by several disciplines. To understand the underlying problem in the context of the reproducibility crisis, it is important to first know the different research practices followed in their domain and the factors that hinder reproducibility. We performed an exploratory study by conducting a survey addressed to researchers representing a range of disciplines to understand scientific experiments and research practices for reproducibility. The survey findings identify a reproducibility crisis and a strong need for sharing data, code, methods, steps, and negative and positive results. Insufficient metadata, lack of publicly available data, and incomplete information in study methods are considered to be the main reasons for poor reproducibility. The survey results also address a wide number of research questions on the reproducibility of scientific results. Based on the results of our explorative study and supported by the existing published literature, we offer general recommendations that could help the scientific community to understand, reproduce, and reuse experimental data and results in the research data lifecycle.


Neuroforum ◽  
2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Carsten M. Klingner ◽  
Petra Ritter ◽  
Stefan Brodoehl ◽  
Christian Gaser ◽  
André Scherag ◽  
...  

Abstract In clinical neuroscience, there are considerable difficulties in translating basic research into clinical applications such as diagnostic tools or therapeutic interventions. This gap, known as the “valley of death,” was mainly attributed to the problem of “small numbers” in clinical neuroscience research, i.e. sample sizes that are too small (Hutson et al., 2017). As a possible solution, it has been repeatedly suggested to systematically manage research data to provide long-term storage, accessibility, and federate data. This goal is supported by a current call of the DFG for a national research data infrastructure (NFDI). This article will review current challenges and possible solutions specific to clinical neuroscience and discuss them in the context of other national and international health data initiatives. A successful NFDI consortium will help to overcome not only the “valley of death” but also promises a path to individualized medicine by enabling big data to produce generalizable results based on artificial intelligence and other methods.


Author(s):  
Daniel Fuß ◽  
Jutta von Maurice ◽  
Hans-Günther Roßbach

AbstractThe article provides an insight into the conceptual and methodological framework as well as the research data infrastructure of the German National Educational Panel Study (NEPS). The NEPS study has been set up to build a profound empirical basis for the description and analysis of educational processes and competence development across the life span. Its large-scale database consists of longitudinal information from more than 60,000 target respondents – distributed over six different starting cohorts ranging from newborns to adults – and from relevant context persons such as parents or teachers. The complex multicohort sequence design schedules annual or even semiannual survey waves including a broad spectrum of competence assessments. All data are thoroughly prepared, documented, and disseminated free of charge in the form of regularly expanded Scientific Use Files. In addition to some background information about NEPS in general, this paper primarily focuses on issues of data collection, data structure, data availability, and the requirements for different types of data access. The number of more than 1,000 data users involved in over 700 research projects so far serves to highlight the potential of NEPS as a unique research data infrastructure for educational research and beyond.


2016 ◽  
Vol 77 (4) ◽  
pp. 500-519 ◽  
Author(s):  
Yi Shen

The data landscape study at Virginia Tech addresses the changing modes of faculty scholarship and supports the development of a user-centric data infrastructure, management, and curation system. The study investigates faculty researchers’ current practices in organizing, describing, and preserving data and the emerging needs for services and education. The results demonstrate the changing nature of faculty demands regarding data documentation, storage, and archiving and identify opportunities for libraries to develop a coherent service, research, and education system to address the evolving needs.


Sign in / Sign up

Export Citation Format

Share Document