An integrated storage and data management system for a high energy physics experiment

JAZELLE: An enhanced data management system for high energy physics

10.1063/1.39573 ◽

1990 ◽

Author(s):

A. S. Johnson ◽

M. I. Briedenbach ◽

H. Hissen ◽

P. F. Kunz ◽

D. J. Sherden ◽

...

Keyword(s):

Data Management ◽

Management System ◽

High Energy Physics ◽

High Energy ◽

Data Management System ◽

Energy Physics

Download Full-text

Rucio beyond ATLAS: experiences from Belle II, CMS, DUNE, EISCAT3D, LIGO/VIRGO, SKA, XENON

EPJ Web of Conferences ◽

10.1051/epjconf/202024511006 ◽

2020 ◽

Vol 245 ◽

pp. 11006 ◽

Cited By ~ 1

Author(s):

Mario Lassnig ◽

Martin Barisits ◽

Paul J Laycock ◽

Cédric Serfon ◽

Eric W Vaandering ◽

...

Keyword(s):

Data Management ◽

High Energy Physics ◽

High Energy ◽

Data Management System ◽

Neutrino Experiment ◽

Dark Matter Search ◽

Long Term Storage ◽

Belle Ii ◽

Physics Experiment ◽

Energy Physics

For many scientific projects, data management is an increasingly complicated challenge. The number of data-intensive instruments generating unprecedented volumes of data is growing and their accompanying workflows are becoming more complex. Their storage and computing resources are heterogeneous and are distributed at numerous geographical locations belonging to different administrative domains and organisations. These locations do not necessarily coincide with the places where data is produced nor where data is stored, analysed by researchers, or archived for safe long-term storage. To fulfil these needs, the data management system Rucio has been developed to allow the high-energy physics experiment ATLAS at LHC to manage its large volumes of data in an efficient and scalable way. But ATLAS is not alone, and several diverse scientific projects have started evaluating, adopting, and adapting the Rucio system for their own needs. As the Rucio community has grown, many improvements have been introduced, customisations have been added, and many bugs have been fixed. Additionally, new dataflows have been investigated and operational experiences have been documented. In this article we collect and compare the common successes, pitfalls, and oddities that arose in the evaluation efforts of multiple diverse experiments, and compare them with the ATLAS experience. This includes the high-energy physics experiments Belle II and CMS, the neutrino experiment DUNE, the scattering radar experiment EISCAT3D, the gravitational wave observatories LIGO and VIRGO, the SKA radio telescope, and the dark matter search experiment XENON.

Download Full-text

The Data Ocean Project

EPJ Web of Conferences ◽

10.1051/epjconf/201921404020 ◽

2019 ◽

Vol 214 ◽

pp. 04020 ◽

Cited By ~ 2

Author(s):

Martin Barisits ◽

Fernando Barreiro ◽

Thomas Beermann ◽

Karan Bhatia ◽

Kaushik De ◽

...

Keyword(s):

High Energy Physics ◽

Workflow Management ◽

High Energy ◽

Data Management System ◽

Cloud Platform ◽

Scientific Experiments ◽

Physics Experiment ◽

Future Work ◽

Work Done ◽

Energy Physics

Transparent use of commercial cloud resources for scientific experiments is a hard problem. In this article, we describe the first steps of the Data Ocean R&D collaboration between the high-energy physics experiment ATLAS together with Google Cloud Platform, to allow seamless use of Google Compute Engine and Google Cloud Storage for physics analysis. We start by describing the three preliminary use cases that were identified at the beginning of the project. The following sections then detail the work done in the data management system Rucio and the workflow management systems PanDA and Harvester to interface Google Cloud Platform with the ATLAS distributed computing environment, and show the results of the integration tests. Afterwards, we describe the setup and results from a full ATLAS user analysis that was executed natively on Google Cloud Platform, and give estimates on projected costs. We close with a summary and and outlook on future work.

Download Full-text

Enabling Data Intensive Science on Supercomputers for High Energy Physics R&D Projects in HL-LHC Era

EPJ Web of Conferences ◽

10.1051/epjconf/202022601007 ◽

2020 ◽

Vol 226 ◽

pp. 01007

Author(s):

Alexei Klimentov ◽

Douglas Benjamin ◽

Alessandro Di Girolamo ◽

Kaushik De ◽

Johannes Elmsheuser ◽

...

Keyword(s):

Data Storage ◽

Management System ◽

High Energy Physics ◽

Hadron Collider ◽

High Energy ◽

Data Management System ◽

Workload Management ◽

Data Intensive ◽

Community Needs ◽

Energy Physics

The ATLAS experiment at CERN’s Large Hadron Collider uses theWorldwide LHC Computing Grid, the WLCG, for its distributed computing infrastructure. Through the workload management system PanDA and the distributed data management system Rucio, ATLAS provides seamless access to hundreds of WLCG grid and cloud based resources that are distributed worldwide, to thousands of physicists. PanDA annually processes more than an exabyte of data using an average of 350,000 distributed batch slots, to enable hundreds of new scientific results from ATLAS. However, the resources available to the experiment have been insufficient to meet ATLAS simulation needs over the past few years as the volume of data from the LHC has grown. The problem will be even more severe for the next LHC phases. High Luminosity LHC will be a multiexabyte challenge where the envisaged Storage and Compute needs are a factor 10 to 100 above the expected technology evolution. The High Energy Physics (HEP) community needs to evolve current computing and data organization models in order to introduce changes in the way it uses and manages the infrastructure, focused on optimizations to bring performance and efficiency not forgetting simplification of operations. In this paper we highlight recent R&D projects in HEP related to data lake prototype, federated data storage and data carousel.

Download Full-text

Evolution of the open-source data management system Ru-cio for LHC Run-3 and beyond ATLAS

EPJ Web of Conferences ◽

10.1051/epjconf/201921404054 ◽

2019 ◽

Vol 214 ◽

pp. 04054

Author(s):

Martin Barisits ◽

Thomas Beermann ◽

Joaquin Bogado ◽

Vincent Garonne ◽

Tomas Javurek ◽

...

Keyword(s):

Data Management ◽

Open Source ◽

Management System ◽

High Energy Physics ◽

High Energy ◽

Full Scale ◽

Data Management System ◽

Distributed Data ◽

Open Source Data ◽

Level Data

Rucio, the distributed data management system of the ATLAS experiment already manages more than 400 Petabytes of physics data on the grid. Rucio was incrementally improved throughout LHC Run-2 and is currently being prepared for the HL-LHC era of the experiment. Next to these improvements the system is currently evolving into a full-scale generic data management system for application beyond ATLAS, or even beyond high-energy physics. This contribution focuses on the development roadmap of Rucio for LHC Run-3, such as event level data management, generic meta-data support and increased usage of networks and tapes. At the same time Rucio is evolving beyond the original ATLAS requirements. This includes additional authentication mechanisms, generic database compatibility, deployment and packaging of the software stack in containers, and a project paradigm shift to a full-scale open source project..

Download Full-text

Results from the first use of microstrip gas chambers in a high energy physics experiment

Conference Record of the 1991 IEEE Nuclear Science Symposium and Medical Imaging Conference ◽

10.1109/nssmic.1991.258980 ◽

2002 ◽

Author(s):

F. Angelini ◽

R. Bellazzini ◽

A. Brez ◽

M.M. Massai ◽

G. Spandre ◽

...

Keyword(s):

High Energy Physics ◽

High Energy ◽

Physics Experiment ◽

Energy Physics ◽

Microstrip Gas Chambers

Download Full-text

Scientific Data Management and Application in High Energy Physics

Big Scientific Data Management - Lecture Notes in Computer Science ◽

10.1007/978-3-030-28061-1_11 ◽

2019 ◽

pp. 92-104

Author(s):

Gang Chen ◽

Yaodong Cheng

Keyword(s):

Data Management ◽

High Energy Physics ◽

High Energy ◽

Scientific Data ◽

Scientific Data Management ◽

Energy Physics

Download Full-text

Analysis of Requirements for the Design of a Detector Control System in a High Energy Physics Experiment

10.22323/1.350.0006 ◽

2019 ◽

Author(s):

Juan Carlos Cabanillas Noris ◽

Ildefonso León Monzón ◽

Mario Iván Martínez Hernández ◽

Solangel Rojas Torres

Keyword(s):

Control System ◽

High Energy Physics ◽

High Energy ◽

Physics Experiment ◽

Energy Physics ◽

Detector Control System

Download Full-text

Data Acquistion for a High Energy Physics Experiment Using a Parallel/Serial CAMAC Driver

IEEE Transactions on Nuclear Science ◽

10.1109/tns.1981.4331877 ◽

1981 ◽

Vol 28 (5) ◽

pp. 3913-3915

Author(s):

Bert T. Yost

Keyword(s):

High Energy Physics ◽

High Energy ◽

Physics Experiment ◽

Data Acquistion ◽

Energy Physics

Download Full-text

Using of FPGA Coprocessor for Improving the Execution Speed of the Pattern Recognition Algorithm for ATLAS – High Energy Physics Experiment

Field Programmable Logic and Application - Lecture Notes in Computer Science ◽

10.1007/978-3-540-30117-2_80 ◽

2004 ◽

pp. 791-800

Author(s):

Christian Hinkelbein ◽

Andrei Khomich ◽

Andreas Kugel ◽

Reinhard Männer ◽

Matthias Müller

Keyword(s):

Pattern Recognition ◽

High Energy Physics ◽

High Energy ◽

Recognition Algorithm ◽

Pattern Recognition Algorithm ◽

Execution Speed ◽

Physics Experiment ◽

Energy Physics

Download Full-text