scholarly journals Enabling petascale science: data management, troubleshooting, and scalable science services

2008 ◽  
Vol 125 ◽  
pp. 012068 ◽  
Author(s):  
A Baranovski ◽  
K Beattie ◽  
S Bharathi ◽  
J Boverhof ◽  
J Bresnahan ◽  
...  
Keyword(s):  
2016 ◽  
Vol 12 (S325) ◽  
pp. 385-388
Author(s):  
P. de Teodoro ◽  
S. Nieto ◽  
B. Altieri

AbstractEuclid is the ESA M2 mission and a milestone in the understanding of the geometry of the Universe. In total Euclid will produce up to 26 PB per year of observations. The Science Archive Systems (SAS) belongs to the Euclid Archive System (EAS) that sits in the core of the Euclid Science Ground Segment (SGS). The SAS is being built at the ESAC Science Data Centre (ESDC), which is responsible for the development and operations of the scientific archives for the Astronomy, Planetary and Heliophysics missions of ESA. The SAS is focused on the needs of the scientific community and is intended to provide access to the most valuable scientific metadata from the Euclid mission. In this paper we describe the architectural design of the system, implementation progress and the main challenges from the data management point of view in the building of the SAS.


1986 ◽  
Vol 56 ◽  
pp. 31-44 ◽  
Author(s):  
W CAMPBELL ◽  
P SMITH ◽  
R PRICE ◽  
L ROELOFS

2010 ◽  
Vol 33 (1) ◽  
pp. 23 ◽  
Author(s):  
Jesse M. Blum ◽  
Guy C. Warner ◽  
Simon B. Jones ◽  
Paul S. Lambert ◽  
Alison S. F. Dawson

Metadata Creation, Transformation and Discovery for Social Science Data Management: The DAMES Project Infrastructure


Author(s):  
Joyce Gosata Maphanyane ◽  
Read Brown Mthanganyika Mapeo ◽  
Sethunya Simela

Chapter 19 and Chapter 20 are on the emerging approaches to data management for a new geospatial science research. This chapter gives the essences and the methodologies of data matter perspective, and it has two sections; Section A highlights the subject theme; the essences of geospatial science data matter; while Section B expands that into the geospatial science data methodologies. Chapter 20 is the about the data management optimization perspective. It has only one section; Section C, that develops further the essence and methodologies of geospatial data cultivated in these two previous sections. The whole analytical discussion is in the emerging fields and how they had optimized and totally changed the geospatial science data management panorama.


Big Data ◽  
2016 ◽  
pp. 2199-2225
Author(s):  
Chris A. Mattmann ◽  
Andrew Hart ◽  
Luca Cinquini ◽  
Joseph Lazio ◽  
Shakeh Khudikyan ◽  
...  

Big data as a paradigm focuses on data volume, velocity, and on the number and complexity of various data formats and metadata, a set of information that describes other data types. This is nowhere better seen than in the development of the software to support next generation astronomical instruments including the MeerKAT/KAT-7 Square Kilometre Array (SKA) precursor in South Africa, in the Low Frequency Array (LOFAR) in Europe, in two instruments led in part by the U.S. National Radio Astronomy Observatory (NRAO) with its Expanded Very Large Array (EVLA) in Socorro, NM, and Atacama Large Millimeter Array (ALMA) in Chile, and in other instruments such as the Large Synoptic Survey Telescope (LSST) to be built in northern Chile. This chapter highlights the big data challenges in constructing data management systems for these astronomical instruments, specifically the challenge of integrating legacy science codes, handling data movement and triage, building flexible science data portals and user interfaces, allowing for flexible technology deployment scenarios, and in automatically and rapidly mitigating the difference in science data formats and metadata models. The authors discuss these challenges and then suggest open source solutions to them based on software from the Apache Software Foundation including Apache Object-Oriented Data Technology (OODT), Tika, and Solr. The authors have leveraged these solutions to effectively and expeditiously build many precursor and operational software systems to handle data from these astronomical instruments and to prepare for the coming data deluge from those not constructed yet. Their solutions are not specific to the astronomical domain and they are already applicable to a number of science domains including Earth, planetary, and biomedicine.


Sign in / Sign up

Export Citation Format

Share Document