High performance computing for machine learning

Arpad Kerestely

doi:10.31926/but.mif.2020.13.62.2.26

High performance computing for machine learning

SERIES III - MATEMATICS, INFORMATICS, PHYSICS ◽

10.31926/but.mif.2020.13.62.2.26 ◽

2021 ◽

Vol 13(62) (2) ◽

pp. 705-714

Author(s):

Arpad Kerestely

Keyword(s):

Machine Learning ◽

High Performance Computing ◽

High Performance ◽

Learning Algorithm ◽

Learning Algorithms ◽

Computing System ◽

Machine Learning Algorithms ◽

Storage Space ◽

The Past ◽

Performance Computing

Efficient High Performance Computing for Machine Learning has become a necessity in the past few years. Data is growing exponentially in domains like healthcare, government, economics and with the development of IoT, smartphones and gadgets. This big volume of data, needs a storage space which no traditional computing system can offer, and needs to be fed to Machine Learning algorithms so useful information can be extracted out of it. The larger the dataset that is fed to a Machine Learning algorithm the more precise the results will be, but also the time to compute those results will increase. Thus, the need for Efficient High Performance computing in the aid of faster and better Machine Learning algorithms. This paper aims to unveil how one benefits from another, what research has achieved so far and where is it heading.

Download Full-text

Utilizing High Performance Computing to Improve the Application of Machine Learning for Time-Efficient Prediction of Buildings’ Daylighting Performance

Journal of Physics Conference Series ◽

10.1088/1742-6596/2069/1/012153 ◽

2021 ◽

Vol 2069 (1) ◽

pp. 012153

Author(s):

Rania Labib

Keyword(s):

Machine Learning ◽

High Performance Computing ◽

High Performance ◽

Learning Algorithm ◽

Energy Performance ◽

Machine Learning Algorithms ◽

Machine Learning Algorithm ◽

Validation Data ◽

Time Required ◽

Performance Computing

Abstract Architects often investigate the daylighting performance of hundreds of design solutions and configurations to ensure an energy-efficient solution for their designs. To shorten the time required for daylighting simulations, architects usually reduce the number of variables or parameters of the building and facade design. This practice usually results in the elimination of design variables that could contribute to an energy-optimized design configuration. Therefore, recent research has focused on incorporating machine learning algorithms that require the execution of only a relatively small subset of the simulations to predict the daylighting and energy performance of buildings. Although machine learning has been shown to be accurate, it still becomes a time-consuming process due to the time required to execute a set of simulations to be used as training and validation data. Furthermore, to save time, designers often decide to use a small simulation subset, which leads to a poorly designed machine learning algorithm that produces inaccurate results. Therefore, this study aims to introduce an automated framework that utilizes high performance computing (HPC) to execute the simulations necessary for the machine learning algorithm while saving time and effort. High performance computing facilitates the execution of thousands of tasks simultaneously for a time-efficient simulation process, therefore allowing designers to increase the size of the simulation’s subset. Pairing high performance computing with machine learning allows for accurate and nearly instantaneous building performance predictions.

Download Full-text

Reducing energy usage in resource-intensive Java-based scientific applications via micro-benchmark based code refactorings

Computer Science and Information Systems ◽

10.2298/csis180608009l ◽

2019 ◽

Vol 16 (2) ◽

pp. 541-564

Author(s):

Mathias Longo ◽

Ana Rodriguez ◽

Cristian Mateos ◽

Alejandro Zunino

Keyword(s):

Machine Learning ◽

High Performance Computing ◽

Computer Simulations ◽

High Performance ◽

Machine Learning Algorithms ◽

Energy Usage ◽

Energy Aware ◽

Scientific Application ◽

Performance Computing ◽

Reducing Energy

In-silico research has grown considerably. Today?s scientific code involves long-running computer simulations and hence powerful computing infrastructures are needed. Traditionally, research in high-performance computing has focused on executing code as fast as possible, while energy has been recently recognized as another goal to consider. Yet, energy-driven research has mostly focused on the hardware and middleware layers, but few efforts target the application level, where many energy-aware optimizations are possible. We revisit a catalog of Java primitives commonly used in OO scientific programming, or micro-benchmarks, to identify energy-friendly versions of the same primitive. We then apply the micro-benchmarks to classical scientific application kernels and machine learning algorithms for both single-thread and multi-thread implementations on a server. Energy usage reductions at the micro-benchmark level are substantial, while for applications obtained reductions range from 3.90% to 99.18%.

Download Full-text

Research on High Performance Computing of Power System Based on Machine Learning Algorithm

2020 International Conference on Computer Information and Big Data Applications (CIBDA) ◽

10.1109/cibda50819.2020.00053 ◽

2020 ◽

Author(s):

Jilin Chen ◽

Yong Chen ◽

Zhonghua Guo ◽

Weijiang Qiu

Keyword(s):

Machine Learning ◽

Power System ◽

High Performance Computing ◽

High Performance ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Performance Computing

Download Full-text

Intelligent system of English composition scoring model based on improved machine learning algorithm

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189235 ◽

2020 ◽

pp. 1-11

Author(s):

Jie Liu ◽

Lin Lin ◽

Xiufang Liang

Keyword(s):

Machine Learning ◽

Evaluation System ◽

Intelligent System ◽

Learning Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Assessment System ◽

English Composition ◽

Region Extraction ◽

Constraint Model

The online English teaching system has certain requirements for the intelligent scoring system, and the most difficult stage of intelligent scoring in the English test is to score the English composition through the intelligent model. In order to improve the intelligence of English composition scoring, based on machine learning algorithms, this study combines intelligent image recognition technology to improve machine learning algorithms, and proposes an improved MSER-based character candidate region extraction algorithm and a convolutional neural network-based pseudo-character region filtering algorithm. In addition, in order to verify whether the algorithm model proposed in this paper meets the requirements of the group text, that is, to verify the feasibility of the algorithm, the performance of the model proposed in this study is analyzed through design experiments. Moreover, the basic conditions for composition scoring are input into the model as a constraint model. The research results show that the algorithm proposed in this paper has a certain practical effect, and it can be applied to the English assessment system and the online assessment system of the homework evaluation system algorithm system.

Download Full-text

Construction of high performance computing system for fusion research using cluster technology

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.02132 ◽

2009 ◽

Vol 29 (8) ◽

pp. 2132-2135

Author(s):

Wei PAN ◽

Liao-yuan CHEN ◽

Yong-ge LI ◽

Jin-hua ZHANG ◽

Li PAN ◽

...

Keyword(s):

High Performance Computing ◽

High Performance ◽

Computing System ◽

Fusion Research ◽

High Performance Computing System ◽

Performance Computing ◽

Cluster Technology

Download Full-text

Investigating the usefulness of a micro high performance computing system as an educational tool

Proceedings of the 2nd International Conference on Intelligent and Innovative Computing Applications ◽

10.1145/3415088.3415105 ◽

2020 ◽

Author(s):

Nkundwe Moses Mwasaga ◽

Mike Joy

Keyword(s):

High Performance Computing ◽

High Performance ◽

Computing System ◽

Educational Tool ◽

High Performance Computing System ◽

Performance Computing

Download Full-text

High-performance computing system and artificial language recognition in the visual application of intangible cultural heritage art design

Personal and Ubiquitous Computing ◽

10.1007/s00779-021-01619-z ◽

2021 ◽

Author(s):

Xiaodan Peng

Keyword(s):

Cultural Heritage ◽

High Performance Computing ◽

High Performance ◽

Computing System ◽

Intangible Cultural Heritage ◽

Artificial Language ◽

Language Recognition ◽

High Performance Computing System ◽

Art Design ◽

Performance Computing

Download Full-text

An IoT-Focused Intrusion Detection System Approach Based on Preprocessing Characterization for Cybersecurity Datasets

Sensors ◽

10.3390/s21020656 ◽

2021 ◽

Vol 21 (2) ◽

pp. 656

Author(s):

Xavier Larriva-Novo ◽

Víctor A. Villagrá ◽

Mario Vega-Barbas ◽

Diego Rivera ◽

Mario Sanz Rodrigo

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

High Performance ◽

Learning Algorithm ◽

Detection System ◽

Machine Learning Algorithms ◽

Statistical Characteristics ◽

Detection Techniques ◽

Traffic Characteristics ◽

Benchmark Datasets

Security in IoT networks is currently mandatory, due to the high amount of data that has to be handled. These systems are vulnerable to several cybersecurity attacks, which are increasing in number and sophistication. Due to this reason, new intrusion detection techniques have to be developed, being as accurate as possible for these scenarios. Intrusion detection systems based on machine learning algorithms have already shown a high performance in terms of accuracy. This research proposes the study and evaluation of several preprocessing techniques based on traffic categorization for a machine learning neural network algorithm. This research uses for its evaluation two benchmark datasets, namely UGR16 and the UNSW-NB15, and one of the most used datasets, KDD99. The preprocessing techniques were evaluated in accordance with scalar and normalization functions. All of these preprocessing models were applied through different sets of characteristics based on a categorization composed by four groups of features: basic connection features, content characteristics, statistical characteristics and finally, a group which is composed by traffic-based features and connection direction-based traffic characteristics. The objective of this research is to evaluate this categorization by using various data preprocessing techniques to obtain the most accurate model. Our proposal shows that, by applying the categorization of network traffic and several preprocessing techniques, the accuracy can be enhanced by up to 45%. The preprocessing of a specific group of characteristics allows for greater accuracy, allowing the machine learning algorithm to correctly classify these parameters related to possible attacks.

Download Full-text

Machine Learning in P&C Insurance: A Review for Pricing and Reserving

Risks ◽

10.3390/risks9010004 ◽

2020 ◽

Vol 9 (1) ◽

pp. 4 ◽

Cited By ~ 1

Author(s):

Christopher Blier-Wong ◽

Hélène Cossette ◽

Luc Lamontagne ◽

Etienne Marceau

Keyword(s):

Machine Learning ◽

Insurance Industry ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Actuarial Science ◽

Science Field ◽

The Past ◽

Highly Nonlinear ◽

Computer Scientists ◽

Applications Of Machine Learning

In the past 25 years, computer scientists and statisticians developed machine learning algorithms capable of modeling highly nonlinear transformations and interactions of input features. While actuaries use GLMs frequently in practice, only in the past few years have they begun studying these newer algorithms to tackle insurance-related tasks. In this work, we aim to review the applications of machine learning to the actuarial science field and present the current state of the art in ratemaking and reserving. We first give an overview of neural networks, then briefly outline applications of machine learning algorithms in actuarial science tasks. Finally, we summarize the future trends of machine learning for the insurance industry.

Download Full-text