Machine Learning for Performance Prediction of Spark Cloud Applications

Hardware architectures become increasingly complex as the compute capabilities grow to exascale. We present the Analytical Memory Model with Pipelines (AMMP) of the Performance Prediction Toolkit (PPT). PPT-AMMP takes high-level source code and hardware architecture parameters as input and predicts runtime of that code on the target hardware platform, which is defined in the input parameters. PPT-AMMP transforms the code to an (architecture-independent) intermediate representation, then (i) analyzes the basic block structure of the code, (ii) processes architecture-independent virtual memory access patterns that it uses to build memory reuse distance distribution models for each basic block, and (iii) runs detailed basic-block level simulations to determine hardware pipeline usage. PPT-AMMP uses machine learning and regression techniques to build the prediction models based on small instances of the input code, then integrates into a higher-order discrete-event simulation model of PPT running on Simian PDES engine. We validate PPT-AMMP on four standard computational physics benchmarks and present a use case of hardware parameter sensitivity analysis to identify bottleneck hardware resources on different code inputs. We further extend PPT-AMMP to predict the performance of a scientific application code, namely, the radiation transport mini-app SNAP. To this end, we analyze multi-variate regression models that accurately predict the reuse profiles and the basic block counts. We validate predicted SNAP runtimes against actual measured times.

Download Full-text

Performance Prediction and Optimization of the Air‐Cooled Condenser in a Large‐Scale Power Plant Using Machine Learning

Energy Technology ◽

10.1002/ente.202100045 ◽

2021 ◽

pp. 2100045

Author(s):

Heng Chen ◽

Weike Peng ◽

Chunming Nie ◽

Gang Xu ◽

Jing Lei

Keyword(s):

Machine Learning ◽

Power Plant ◽

Performance Prediction ◽

Large Scale

Download Full-text

Application of machine learning algorithms to performance prediction of rocking shallow foundations during earthquake loading

Soil Dynamics and Earthquake Engineering ◽

10.1016/j.soildyn.2021.106965 ◽

2021 ◽

Vol 151 ◽

pp. 106965

Author(s):

Sivapalan Gajan

Keyword(s):

Machine Learning ◽

Performance Prediction ◽

Learning Algorithms ◽

Shallow Foundations ◽

Machine Learning Algorithms ◽

Earthquake Loading

Download Full-text

Synthesis, characterization and machine learning based performance prediction of straw activated carbon

Journal of Cleaner Production ◽

10.1016/j.jclepro.2018.12.093 ◽

2019 ◽

Vol 212 ◽

pp. 1210-1223 ◽

Cited By ~ 17

Author(s):

Wen Jiang ◽

Xianjun Xing ◽

Shan Li ◽

Xianwen Zhang ◽

Wenquan Wang

Keyword(s):

Machine Learning ◽

Activated Carbon ◽

Performance Prediction

Download Full-text

Machine learning-based management of cloud applications in hybrid clouds: A Hadoop case study

2017 IEEE 16th International Symposium on Network Computing and Applications (NCA) ◽

10.1109/nca.2017.8171352 ◽

2017 ◽

Cited By ~ 1

Author(s):

D. R. Avresky ◽

Alessandro Pellegrini ◽

Pierangelo Di Sanzo

Keyword(s):

Machine Learning ◽

Hybrid Clouds ◽

Cloud Applications

Download Full-text

Novel Framework for Performance Prediction of Small and Medium Scale Enterprises: A Machine Learning Approach

2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI) ◽

10.1109/icacci.2018.8554747 ◽

2018 ◽

Author(s):

Nishant Jain ◽

Abhinav Tomar ◽

Prasanta K. Jana

Keyword(s):

Machine Learning ◽

Performance Prediction ◽

Learning Approach ◽

Medium Scale ◽

Machine Learning Approach

Download Full-text

Development of machine-learning performance prediction models for asphalt mixtures

Advances in Materials and Pavement Performance Prediction II ◽

10.1201/9781003027362-9 ◽

2020 ◽

pp. 36-39

Author(s):

E. Omer ◽

S. Saadeh

Keyword(s):

Machine Learning ◽

Performance Prediction ◽

Prediction Models ◽

Asphalt Mixtures ◽

Learning Performance

Download Full-text

A study secure multi authentication based data classification model in cloud based system

International Journal of Advances in Applied Sciences ◽

10.11591/ijaas.v9.i3.pp240-254 ◽

2020 ◽

Vol 9 (3) ◽

pp. 240

Author(s):

Sakshi Kaushal ◽

Bala Buksh

Keyword(s):

Machine Learning ◽

Cloud Computing ◽

Data Classification ◽

Classification Model ◽

Sensitive Data ◽

Learning Technique ◽

Mathematical Algorithms ◽

Encryption Algorithms ◽

Cloud Applications ◽

Technology Resources

Cloud computing is the most popular term among enterprises and news. The concepts come true because of fast internet bandwidth and advanced cooperation technology. Resources on the cloud can be accessed through internet without self built infrastructure. Cloud computing is effectively manage the security in the cloud applications. Data classification is a machine learning technique used to predict the class of the unclassified data. Data mining uses different tools to know the unknown, valid patterns and relationships in the dataset. These tools are mathematical algorithms, statistical models and Machine Learning (ML) algorithms. In this paper author uses improved Bayesian technique to classify the data and encrypt the sensitive data using hybrid stagnography. The encrypted and non encrypted sensitive data is sent to cloud environment and evaluate the parameters with different encryption algorithms.

Download Full-text