scholarly journals A short guide to increase FAIRness of atmospheric model data

2020 ◽  
Vol 29 (6) ◽  
pp. 483-491
Author(s):  
Anette Ganske ◽  
Daniel Heydebreck ◽  
Heinke Höck ◽  
Angelina Kraft ◽  
Johannes Quaas ◽  
...  
Keyword(s):  
1999 ◽  
Vol 217 (3-4) ◽  
pp. 303-313 ◽  
Author(s):  
G.W Kite ◽  
U Haberlandt
Keyword(s):  

2021 ◽  
Author(s):  
Anette Ganske ◽  
Amandine Kaiser ◽  
Angelina Kraft ◽  
Daniel Heydebreck ◽  
Andrea Lammert ◽  
...  

<p>As in many scientific disciplines, there are a variety of activities in Earth system sciences that address the important aspects of good research data management. What has not been sufficiently investigated and dealt with so far is the easy discoverability and re-use of quality-checked data. This aspect is taken up by the EASYDAB label.</p><p>EASYDAB<sup>1</sup> is a currently developed branding for FAIR and open data from the Earth System Sciences. The branding can be adopted by institutions running a data repository which stores data from the Earth System Sciences. EASYDAB is always connected to a research data publication with DataCite DOIs. Data published under EASYDAB are characterized by a high maturity, extensive metadata information and compliance with a comprehensive discipline-specific standard. For these datasets, the EASYDAB logo is added to the landing page of the data repository. Thereby, repositories can indicate their efforts to publish data with high maturity.</p><p>The first standard made for EASYDAB is the ATMODAT standard<sup>2</sup>, which has been developed within the AtMoDat<sup>3</sup> project (Atmospheric Model Data). It incorporates concrete recommendations and requirements related to the maturity, publication and enhanced FAIRness of atmospheric model data. The requirements are for rich metadata with controlled vocabularies, structured landing pages, file formats (netCDF) and the structure within files. Human- and machine-readable landing pages are a core element of the ATMODAT standard and should hold and present discipline-specific metadata on simulation and variable level. </p><p>The ATMODAT standard includes checklists for the data producer and the data curator so that the compliance with the standard can easily be obtained by both sides. To facilitate automatic checking of the netCDF files headers, a checker program will also be provided and published with DOI. Moreover, a checker for the compliance with the requirements for the DOI Metadata will be developed and made openly available. </p><p>The integration of standards from other disciplines in the Earth System Sciences, such as oceanography, into EASYDAB is helpful and desirable to improve the re-use of reviewed, high-quality data. </p><p> <sup>1</sup>www.easydab.de</p><p><sup>2</sup>https://cera-www.dkrz.de/WDCC/ui/cerasearch/entry?acronym=atmodat_standard_en_v3_0</p><p><sup>3</sup>www.atmodat.de</p>


2021 ◽  
pp. 331-333
Author(s):  
Vivien Voss ◽  
K. Heinke Schlünzen ◽  
David Grawe
Keyword(s):  

2021 ◽  
Author(s):  
Andrea Lammert ◽  
Anette Ganske ◽  
Amandine Kaiser ◽  
Angelina Kraft

<p>Due to the increasing amount of data produced in science, concepts for data reusability are of immense importance. One aspect is the publication of data in a way that ensures that it is findable, reusable, traceable and comparable (FAIR<sup>1</sup> principles). However, putting these principles into practice often causes significant difficulties for researchers. Therefore some repositories accept datasets described only with the minimum metadata required for DOI allocation. Unfortunately, this contains not  enough information to conform to the FAIR principles - many research data cannot be reused despite having a DOI. In contrast, other repositories aid the researchers by providing advice and strictly controlling the data and their metadata. To simplify the process of defining the needed amount of metadata and of controlling the data and metadata, the AtMoDat<sup>2</sup> (Atmospheric Model Data) project developed a detailed standard for the FAIR publication of atmospheric model data.</p><p>For this purpose we have developed a concept for the “ideal” description of atmospheric model data. A prerequisite for this is the data publication with a DataCite DOI. The ATMODAT standard<sup>3</sup> was developed to implement this concept. The standard defines the data format as NetCDF, mandatory metadata (for DOI, landing page and data header), and naming conventions used in climate research - the Climate and Forecast conventions (CF-conventions<sup>4</sup>). However, many variable names used in urban climate research, for example, are not part of the CF-conventions. For this, standard names have to be defined together with the community and the inclusion in the list of CF-conventions has to be requested. Furthermore we developed and published Python routines which allow data producers as well as repositories to check model output data against the standard. </p><p>The ATMODAT standard will first be applied by the project partners of the two participating universities (University of Hamburg and Leipzig). Here, climate model data are processed with a post-processor in preparation for publication. Subsequently, the files including the specified metadata for the DataCite metadata schema will be published by the World Data Center for Climate<sup>5</sup> (WDCC). Data fulfilling the AtMoDat standard will be marked at the landing page by a special EASYDAB<sup>6</sup> (Earth System Data Branding) logo. EASYDAB is a currently developed branding for FAIR and open data from the Earth System Sciences. This indicates to future data users that the dataset is a verified dataset that can be easily reused. The standardization of the data and the further steps are easily transferable to data from other disciplines.</p><p>1 Wilkinson, M., Dumontier, M., Aalbersberg, I. et al.: The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3, 160018 (2016). https://doi.org/10.1038/sdata.2016.18 </p><p>2 https://www.atmodat.de/</p><p>3 https://cera-www.dkrz.de/WDCC/ui/cerasearch/entry?acronym=atmodat_standard_en_v3_0</p><p>4 https://cfconventions.org/</p><p>5 https://cera-www.dkrz.de/WDCC/ui/cerasearch/</p><p>6 https://www.easydab.de/</p><p> </p>


2020 ◽  
Author(s):  
Vivien Voss ◽  
K. Heinke Schlünzen ◽  
David Grawe
Keyword(s):  

<p>Mit Hilfe der numerischen Modellierung werden komplexe Prozesse auch in kleinräumig gegliederten Gebieten und innerhalb von Städten darstellbar. Durch geringe Gitterweiten und numerischen Restriktionen benötigen derartig hochauflösende Untersuchungen sehr viele Ressourcen; die Nachnutzung der Modellergebnisse ist aber noch wenig ausgeprägt. Vergleiche von Modelldaten, Weitergabe der Ergebnisse oder die Reproduzierung von Simulationen werden gegenwärtig durch uneinheitliche Datenstrukturen, nicht standardisierte Variablennamen und fehlende Angaben zum Modell-Setup erschwert. In der Meteorologie verwendete Datenkonventionen, wie z.B. der CMIP6 Standard, sind auf großskalige Modelldaten angepasst, beinhalten aber bisher nicht die Größen und Variablen, die in der mikroskaligen hindernisauflösenden Modellierung relevant sind. Dies betrifft sowohl die numerische als auch die physikalische Modellierung.</p> <p>In dem Projekt AtMoDat (Atmospheric Model Data) soll der bisherigen Standard auf die Bedürfnisse der mikroskaligen Modellierung erweitert werden. Bisher wurde mit Hilfe einer Online Umfrage (uhh.de/orm-survey) einige mikroskalige Modelle und deren Eigenschaften erfasst und Vorstellungen und Anregungen zu dem Standard gesammelt. Ergebnisse daraus werden vorgestellt. Vorläufige Variablennamen für die Mikroskala werden für den AtMoDat-Standard vorgeschlagen. Die Methodik der Erweiterung wird vorgestellt und diskutiert.</p>


2020 ◽  
Author(s):  
Daniel Neumann ◽  
Anette Ganske ◽  
Vivien Voss ◽  
Angelina Kraft ◽  
Heinke Höck ◽  
...  

<p>The generation of high quality research data is expensive. The FAIR principles were established to foster the reuse of such data for the benefit of the scientific community and beyond. Publishing research data with metadata and DataCite DOIs in public repositories makes them findable and accessible (FA of FAIR). However, DOIs and basic metadata do not guarantee the data are actually reusable without discipline-specific knowledge: if data are saved in proprietary or undocumented file formats, if detailed discipline-specific metadata are missing and if quality information on the data and metadata are not provided. In this contribution, we present ongoing work in the AtMoDat project, -a consortium of atmospheric scientists and infrastructure providers, which aims on improving the reusability of atmospheric model data.<br>  <br>Consistent standards are necessary to simplify the reuse of research data. Although standardization of file structure and metadata is well established for some subdomains of the earth system modeling community – e.g. CMIP –, several other subdomains are lacking such standardization. Hence, scientists from the Universities of Hamburg and Leipzig and infrastructure operators cooperate in the AtMoDat project in order to advance standardization for model output files in specific subdomains of the atmospheric modeling community. Starting from the demanding CMIP6 standard, the aim is to establish an easy-to-use standard that is at least compliant with the Climate and Forecast (CF) conventions. In parallel, an existing netCDF file convention checker is extended to check for the new standards. This enhanced checker is designed to support the creation of compliant files and thus lower the hurdle for data producers to comply with the new standard. The transfer of this approach to further sub-disciplines of the earth system modeling community will be supported by a best-practice guide and other documentation. A showcase of a standard for the urban atmospheric modeling community will be presented in this session. The standard is based on CF Conventions and adapts several global attributes and controlled vocabularies from the well-established CMIP6 standard.<br>  <br>Additionally, the AtMoDat project aims on introducing a generic quality indicator into the DataCite metadata schema to foster further reuse of data. This quality indicator should require a discipline-specific implementation of a quality standard linked to the indicator. We will present the concept of the generic quality indicator in general and in the context of urban atmospheric modeling data. </p>


1980 ◽  
Vol 14 (3) ◽  
pp. 134-139 ◽  
Author(s):  
Mary desJardins ◽  
A. Frederick Hasler
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document