ELS: Emulation system for debugging and tuning large-scale parallel programs on small clusters

This chapter describes experiences using Cloud infrastructures for scientific computing, both for serial and parallel computing. Amazon’s High Performance Computing (HPC) Cloud computing resources were compared to traditional HPC resources to quantify performance as well as assessing the complexity and cost of using the Cloud. Furthermore, a shared Cloud infrastructure is compared to standard desktop resources for scientific simulations. Whilst this is only a small scale evaluation these Cloud offerings, it does allow some conclusions to be drawn, particularly that the Cloud can currently not match the parallel performance of dedicated HPC machines for large scale parallel programs but can match the serial performance of standard computing resources for serial and small scale parallel programs. Also, the shared Cloud infrastructure cannot match dedicated computing resources for low level benchmarks, although for an actual scientific code, performance is comparable.

Download Full-text

Re-Running Large-Scale Parallel Programs Using Two Nodes

2018 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Ubiquitous Computing & Communications, Big Data & Cloud Computing, Social Computing & Networking, Sustainable Computing & Communications (ISPA/IUCC/BDCloud/SocialCom/SustainCom) ◽

10.1109/bdcloud.2018.00079 ◽

2018 ◽

Author(s):

Yayu Guo ◽

Fang Lin ◽

Yi Liu ◽

Depei Qian

Keyword(s):

Large Scale ◽

Parallel Programs

Download Full-text

Error detection in large-scale parallel programs with long runtimes

Future Generation Computer Systems ◽

10.1016/s0167-739x(02)00178-4 ◽

2003 ◽

Vol 19 (5) ◽

pp. 689-700

Author(s):

Dieter Kranzlmüller ◽

Nam Thoai ◽

Jens Volkert

Keyword(s):

Error Detection ◽

Large Scale ◽

Parallel Programs

Download Full-text

Annai scalable run-time support for interactive debugging and performance analysis of large-scale parallel programs

Lecture Notes in Computer Science - Euro-Par'96 Parallel Processing ◽

10.1007/3-540-61626-8_6 ◽

1996 ◽

pp. 64-69 ◽

Cited By ~ 3

Author(s):

Christian Clémençon ◽

Akiyoshi Endo ◽

Josef Fritscher ◽

Andreas Müller ◽

Brian J. N. Wylie

Keyword(s):

Performance Analysis ◽

Large Scale ◽

Parallel Programs ◽

Run Time ◽

And Performance

Download Full-text

Improving Execution Time of Parallel Programs on Large Scale Chip Multiprocessors with Constant Average Power Processing

2017 IEEE International Conference on Computer Design (ICCD) ◽

10.1109/iccd.2017.113 ◽

2017 ◽

Author(s):

Kramer Straube ◽

Christopher Nitta ◽

Raj Amirtharajah ◽

Matthew Farrens ◽

Venkatesh Akella

Keyword(s):

Execution Time ◽

Large Scale ◽

Chip Multiprocessors ◽

Average Power ◽

Parallel Programs

Download Full-text

Identifying scalability bottlenecks for large-scale parallel programs with graph analysis

Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming ◽

10.1145/3332466.3374518 ◽

2020 ◽

Author(s):

Yuyang Jin ◽

Haojie Wang ◽

Xiongchao Tang ◽

Torsten Hoefler ◽

Xu Liu ◽

...

Keyword(s):

Large Scale ◽

Parallel Programs ◽

Graph Analysis

Download Full-text

Staggering and blocking: a technique to optimize the parallel programs for large scale parallel processing systems

Proceedings of TENCON'94 - 1994 IEEE Region 10's 9th Annual International Conference on: 'Frontiers of Computer Technology' ◽

10.1109/tencon.1994.369252 ◽

2002 ◽

Author(s):

Guohua Jin ◽

Fujie Chen

Keyword(s):

Parallel Processing ◽

Large Scale ◽

Parallel Programs

Download Full-text

Deconvolute individual genomes from metagenome sequences through read clustering

10.1101/620666 ◽

2019 ◽

Cited By ~ 1

Author(s):

Kexue Li ◽

Lili Wang ◽

Lizhen Shi ◽

Li Deng ◽

Zhong Wang

Keyword(s):

Large Scale ◽

False Negative ◽

Next Generation Sequencing Data ◽

Clustering Methods ◽

Sequencing Data ◽

Clustering Problem ◽

Sequencing Coverage ◽

Metagenome Assembly ◽

Almost All ◽

Small Clusters

ABSTRACTMotivationMetagenome assembly from short next-generation sequencing data is a challenging process due to its large scale and computational complexity. Clustering short reads before assembly offers a unique opportunity for parallel downstream assembly of genomes with individualized optimization. However, current read clustering methods suffer either false negative (under-clustering) or false positive (over-clustering) problems.ResultsBased on a previously developed scalable read clustering method on Apache Spark, SpaRC, that has very low false positives, here we extended its capability by adding a new method to further cluster small clusters. This method exploits statistics derived from multiple samples in a dataset to reduce the under-clustering problem. Using a synthetic dataset from mouse gut microbiomes we show that this method has the potential to cluster almost all of the reads from genomes with sufficient sequencing coverage. We also explored several clustering parameters that deferentially affect genomes with various sequencing coverage.Availabilityhttps://bitbucket.org/berkeleylab/jgi-sparc/[email protected]

Download Full-text

Practical simulation of large-scale parallel programs and its performance analysis of the NAS Parallel Benchmarks

Euro-Par’98 Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/bfb0057859 ◽

1998 ◽

pp. 244-254 ◽

Cited By ~ 2

Author(s):

Kazuto Kubota ◽

Ken’ichi Itakura ◽

Mitsuhisa Sato ◽

Taisuke Boku

Keyword(s):

Performance Analysis ◽

Large Scale ◽

Parallel Programs

Download Full-text