Reaching new peaks for the future of the CMS HTCondor Global Pool

The CMS experiment at CERN employs a distributed computing infrastructure to satisfy its data processing and simulation needs. The CMS Submission Infrastructure team manages a dynamic HTCondor pool, aggregating mainly Grid clusters worldwide, but also HPC, Cloud and opportunistic resources. This CMS Global Pool, which currently involves over 70 computing sites worldwide and peaks at 350k CPU cores, is employed to successfully manage the simultaneous execution of up to 150k tasks. While the present infrastructure is sufficient to harness the current computing power scales, CMS latest estimates predict a noticeable expansion in the amount of CPU that will be required in order to cope with the massive data increase of the High-Luminosity LHC (HL-LHC) era, planned to start in 2027. This contribution presents the latest results of the CMS Submission Infrastructure team in exploring and expanding the scalability reach of our Global Pool, in order to preventively detect and overcome any barriers in relation to the HL-LHC goals, while maintaining high effciency in our workload scheduling and resource utilization.

Download Full-text

Exascale Data Processing in Heterogeneous Distributed Computing Infrastructure for Applications in High Energy Physics

Physics of Particles and Nuclei ◽

10.1134/s1063779620060052 ◽

2020 ◽

Vol 51 (6) ◽

pp. 995-1068

Author(s):

A. A. Klimentov

Keyword(s):

Distributed Computing ◽

Data Processing ◽

High Energy Physics ◽

High Energy ◽

Heterogeneous Distributed Computing ◽

Distributed Computing Infrastructure ◽

Computing Infrastructure ◽

Energy Physics

Download Full-text

Application of asynchronous invocation architecture in massive data processing of distribution automation

2006 China International Conference on Electricity Distribution (CICED 2006) ◽

10.1049/cp:20061809 ◽

2006 ◽

Author(s):

Yi Liao ◽

Dong Liu

Keyword(s):

Data Processing ◽

Distribution Automation ◽

Massive Data ◽

Massive Data Processing

Download Full-text

Archer: A Community Distributed Computing Infrastructure for Computer Architecture Research and Education

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - Collaborative Computing: Networking, Applications and Worksharing ◽

10.1007/978-3-642-03354-4_7 ◽

2009 ◽

pp. 70-84 ◽

Cited By ~ 5

Author(s):

Renato J. Figueiredo ◽

P. Oscar Boykin ◽

José A. B. Fortes ◽

Tao Li ◽

Jie-Kwon Peir ◽

...

Keyword(s):

Distributed Computing ◽

Computer Architecture ◽

Research And Education ◽

Distributed Computing Infrastructure ◽

Computing Infrastructure

Download Full-text

A Study on Prediction Models for Massive Data Processing

2008 International Conference on Convergence and Hybrid Information Technology ◽

10.1109/ichit.2008.244 ◽

2008 ◽

Author(s):

RongHua Du ◽

AiPing Li

Keyword(s):

Data Processing ◽

Prediction Models ◽

Massive Data ◽

Massive Data Processing

Download Full-text

A Trade-Off between Computing Power and Energy Consumption of On-Board Data Processing in GPU Accelerated In-Orbit Space Systems

TRANSACTIONS OF THE JAPAN SOCIETY FOR AERONAUTICAL AND SPACE SCIENCES AEROSPACE TECHNOLOGY JAPAN ◽

10.2322/tastj.19.700 ◽

2021 ◽

Vol 19 (5) ◽

pp. 700-708

Author(s):

Nandinbaatar TSOG ◽

Saad MUBEEN ◽

Mikael SJÖDIN ◽

Fredrik BRUHN

Keyword(s):

Energy Consumption ◽

Data Processing ◽

Orbit Space ◽

Space Systems ◽

Computing Power ◽

Trade Off ◽

Power And Energy

Download Full-text

Research on Database Massive Data Processing and Mining Method based on Hadoop Cloud Platform

The Open Automation and Control Systems Journal ◽

10.2174/1874444301406011463 ◽

2014 ◽

Vol 6 (1) ◽

pp. 1463-1467

Author(s):

Zhao Xiaoyong ◽

Yang Chunrong

Keyword(s):

Data Processing ◽

Massive Data ◽

Mining Method ◽

Cloud Platform ◽

Massive Data Processing

Download Full-text

Massive Data Processing Using Mapreduce Aggregation To Make Digitized India

Proceedings of the International Conference for Phoenixes on Emerging Current Trends in Engineering and Management (PECTEAM 2018) ◽

10.2991/pecteam-18.2018.17 ◽

2018 ◽

Author(s):

S Thilagavathi ◽

S Vimala ◽

K Valarmathi ◽

R Priya ◽

S Sathya

Keyword(s):

Data Processing ◽

Massive Data ◽

Massive Data Processing

Download Full-text

Safeguard Data-Processing System: A Means to Effective Computer Resource Utilization

Bell System Technical Journal ◽

10.1002/j.1538-7305.1975.tb03305.x ◽

1975 ◽

Vol 54 (10) ◽

pp. S191-S196

Author(s):

J. P. Kuoni

Keyword(s):

Data Processing ◽

Resource Utilization ◽

Processing System ◽

Data Processing System ◽

System A ◽

Computer Resource

Download Full-text

Distributed Computing Infrastructure Based on Dynamic Container Clusters

Computational Science and Its Applications – ICCSA 2016 - Lecture Notes in Computer Science ◽

10.1007/978-3-319-42108-7_20 ◽

2016 ◽

pp. 263-275 ◽

Cited By ~ 3

Author(s):

Vladimir Korkhov ◽

Sergey Kobyshev ◽

Artem Krosheninnikov ◽

Alexander Degtyarev ◽

Alexander Bogdanov

Keyword(s):

Distributed Computing ◽

Distributed Computing Infrastructure ◽

Computing Infrastructure

Download Full-text

Analyzing and Estimating the Performance of Concurrent Kernels Execution on GPUs

10.5753/wscad.2017.245 ◽

2017 ◽

Author(s):

Rommel Cruz ◽

Lucia Drummond ◽

Esteban Clua ◽

Cristiana Bentes

Keyword(s):

Resource Utilization ◽

Real World ◽

Power Efficiency ◽

Necessary Conditions ◽

Performance Degradation ◽

Computing Power ◽

Concurrent Execution ◽

Depth Study ◽

New Generation

GPUs have established a new baseline for power efficiency and computing power, delivering larger bandwidth and more computing units in each new generation. Modern GPUs support the concurrent execution of kernels to maximize resource utilization, allowing other kernels to better exploit idle resources. However, the decision on the simultaneous execution of different kernels is made by the hardware, and sometimes GPUs do not allow the execution of blocks from other kernels, even with the availability of resources. In this work, we present an in-depth study on the simultaneous execution of kernels on the GPU. We present the necessary conditions for executing kernels simultaneously, we define the factors that influence competition, and describe a model that can determine performance degradation. Finally, we validate the model using synthetic and real-world kernels with different computation and memory requirements.

Download Full-text