Automatic generation of high-performance quantized machine learning kernels

Rising power costs and constraints are driving a growing focus on the energy efficiency of high performance computing systems. The unique characteristics of a particular system and workload and their effect on performance and energy efficiency are typically difficult for application users to assess and to control. Settings for optimum performance and energy efficiency can also diverge, so we need to identify trade-off options that guide a suitable balance between energy use and performance. We present statistical and machine learning models that only require a small number of runs to make accurate Pareto-optimal trade-off predictions using parameters that users can control. We study model training and validation using several parallel kernels and more complex workloads, including Algebraic Multigrid (AMG), Large-scale Atomic Molecular Massively Parallel Simulator, and Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. We demonstrate that we can train the models using as few as 12 runs, with prediction error of less than 10%. Our AMG results identify trade-off options that provide up to 45% improvement in energy efficiency for around 10% performance loss. We reduce the sample measurement time required for AMG by 90%, from 13 h to 74 min.

Download Full-text

Use of Machine Learning to Investigate the Quantitative Checklist for Autism in Toddlers (Q-CHAT) towards Early Autism Screening

Diagnostics ◽

10.3390/diagnostics11030574 ◽

2021 ◽

Vol 11 (3) ◽

pp. 574

Author(s):

Gennaro Tartarisco ◽

Giovanni Cicceri ◽

Davide Di Pietro ◽

Elisa Leonardi ◽

Stefania Aiello ◽

...

Keyword(s):

Machine Learning ◽

High Performance ◽

Behavioral Science ◽

Autistic Traits ◽

Classification Performance ◽

Recursive Feature Elimination ◽

Diagnostic Tools ◽

Support Vector ◽

K Nearest Neighbors ◽

Autism Screening

In the past two decades, several screening instruments were developed to detect toddlers who may be autistic both in clinical and unselected samples. Among others, the Quantitative CHecklist for Autism in Toddlers (Q-CHAT) is a quantitative and normally distributed measure of autistic traits that demonstrates good psychometric properties in different settings and cultures. Recently, machine learning (ML) has been applied to behavioral science to improve the classification performance of autism screening and diagnostic tools, but mainly in children, adolescents, and adults. In this study, we used ML to investigate the accuracy and reliability of the Q-CHAT in discriminating young autistic children from those without. Five different ML algorithms (random forest (RF), naïve Bayes (NB), support vector machine (SVM), logistic regression (LR), and K-nearest neighbors (KNN)) were applied to investigate the complete set of Q-CHAT items. Our results showed that ML achieved an overall accuracy of 90%, and the SVM was the most effective, being able to classify autism with 95% accuracy. Furthermore, using the SVM–recursive feature elimination (RFE) approach, we selected a subset of 14 items ensuring 91% accuracy, while 83% accuracy was obtained from the 3 best discriminating items in common to ours and the previously reported Q-CHAT-10. This evidence confirms the high performance and cross-cultural validity of the Q-CHAT, and supports the application of ML to create shorter and faster versions of the instrument, maintaining high classification accuracy, to be used as a quick, easy, and high-performance tool in primary-care settings.

Download Full-text

An IoT-Focused Intrusion Detection System Approach Based on Preprocessing Characterization for Cybersecurity Datasets

Sensors ◽

10.3390/s21020656 ◽

2021 ◽

Vol 21 (2) ◽

pp. 656

Author(s):

Xavier Larriva-Novo ◽

Víctor A. Villagrá ◽

Mario Vega-Barbas ◽

Diego Rivera ◽

Mario Sanz Rodrigo

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

High Performance ◽

Learning Algorithm ◽

Detection System ◽

Machine Learning Algorithms ◽

Statistical Characteristics ◽

Detection Techniques ◽

Traffic Characteristics ◽

Benchmark Datasets

Security in IoT networks is currently mandatory, due to the high amount of data that has to be handled. These systems are vulnerable to several cybersecurity attacks, which are increasing in number and sophistication. Due to this reason, new intrusion detection techniques have to be developed, being as accurate as possible for these scenarios. Intrusion detection systems based on machine learning algorithms have already shown a high performance in terms of accuracy. This research proposes the study and evaluation of several preprocessing techniques based on traffic categorization for a machine learning neural network algorithm. This research uses for its evaluation two benchmark datasets, namely UGR16 and the UNSW-NB15, and one of the most used datasets, KDD99. The preprocessing techniques were evaluated in accordance with scalar and normalization functions. All of these preprocessing models were applied through different sets of characteristics based on a categorization composed by four groups of features: basic connection features, content characteristics, statistical characteristics and finally, a group which is composed by traffic-based features and connection direction-based traffic characteristics. The objective of this research is to evaluate this categorization by using various data preprocessing techniques to obtain the most accurate model. Our proposal shows that, by applying the categorization of network traffic and several preprocessing techniques, the accuracy can be enhanced by up to 45%. The preprocessing of a specific group of characteristics allows for greater accuracy, allowing the machine learning algorithm to correctly classify these parameters related to possible attacks.

Download Full-text

Chemometrics‐based models hyphenated with ensemble machine learning for retention time simulation of Isoquercitrin in Coriander sativum L. using high performance liquid chromatography

Journal of Separation Science ◽

10.1002/jssc.202000890 ◽

2020 ◽

Author(s):

Abdullahi Garba Usman ◽

Selin Işik ◽

Sani Isah Abba ◽

Filiz Meriçli

Keyword(s):

Machine Learning ◽

High Performance Liquid Chromatography ◽

Liquid Chromatography ◽

Retention Time ◽

High Performance ◽

Ensemble Machine Learning ◽

Time Simulation

Download Full-text

An explainable machine learning model to predict and elucidate the compressive behavior of high-performance concrete

Results in Engineering ◽

10.1016/j.rineng.2021.100245 ◽

2021 ◽

pp. 100245

Author(s):

Debaditya Chakraborty ◽

Ibukun Awolusi ◽

Lilianna Gutierrez

Keyword(s):

Machine Learning ◽

High Performance ◽

High Performance Concrete ◽

Learning Model ◽

Compressive Behavior ◽

Machine Learning Model

Download Full-text

High-performance computing and machine learning applied in thermal systems analysis

Journal of Thermal Analysis and Calorimetry ◽

10.1007/s10973-021-10952-7 ◽

2021 ◽

Author(s):

Mostafa Safdari Shadloo ◽

Amin Rahmat ◽

Larry K. B. Li ◽

Omid Mahian ◽

Avinash Alagumalai

Keyword(s):

Machine Learning ◽

High Performance Computing ◽

Systems Analysis ◽

High Performance ◽

Thermal Systems ◽

Performance Computing

Download Full-text

High-performance Machine Learning in Enabling Large-scale Load Analysis Considering Class Imbalance and Frequency Domain Characteristics

2020 IEEE Sustainable Power and Energy Conference (iSPEC) ◽

10.1109/ispec50848.2020.9350922 ◽

2020 ◽

Author(s):

Xi Wang ◽

Quan Tang ◽

Haiyan Wang ◽

Ruiguang Ma ◽

Zizhuo Tang

Keyword(s):

Machine Learning ◽

Frequency Domain ◽

High Performance ◽

Large Scale ◽

Class Imbalance ◽

Load Analysis

Download Full-text

Automatic Generation of 3D Natural Anime-like Non-Player Characters with Machine Learning

2020 International Conference on Cyberworlds (CW) ◽

10.1109/cw49994.2020.00023 ◽

2020 ◽

Author(s):

Ruizhe Li ◽

Masanori Nakayama ◽

Issei Fujishiro

Keyword(s):

Machine Learning ◽

Automatic Generation

Download Full-text

Design and Development of Lubricating Material Database and Research on Performance Prediction Method of Machine Learning

Scientific Reports ◽

10.1038/s41598-019-56776-2 ◽

2019 ◽

Vol 9 (1) ◽

Cited By ~ 3

Author(s):

Dan Jia ◽

Haitao Duan ◽

Shengpeng Zhan ◽

Yongliang Jin ◽

Bingxue Cheng ◽

...

Keyword(s):

Machine Learning ◽

High Performance ◽

Prediction Method ◽

Lubricating Oil ◽

Physical Parameters ◽

First Principle ◽

Vast Number ◽

First Principle Calculation ◽

Lubricating Material ◽

Machine Learning Model

AbstractLong developing period and cumbersome evaluation for the lubricating materials performance seriously jeopardize the successful development and application of any database system in tribological field. Such major setback can be solved effectively by implementing approaches with high throughput calculation. However, it often involves with vast number of output files, which are computed on the basis of first principle computation, having different data format from that of their experimental counterparts. Commonly, the input, storage and management of first principle calculation files and their individually test counterparts, implementing fast query and display in the database, adding to the use of physical parameters, as predicted with the performance estimated by first principle approach, may solve such setbacks. Investigation is thus performed for establishing database website specifically for lubricating materials, which satisfies both data: (i) as calculated on the basis of first principles and (ii) as obtained by practical experiment. It further explores preliminarily the likely relationship between calculated physical parameters of lubricating oil and its respectively tribological and anti-oxidative performance as predicted by lubricant machine learning model. Success of the method facilitates in instructing the obtainment of optimal design, preparation and application for any new lubricating material so that accomplishment of high performance is possible.

Download Full-text

Service-Aware Two-Level Partitioning for Machine Learning-based Network Intrusion Detection with High Performance and High Scalability

IEEE Access ◽

10.1109/access.2020.3048900 ◽

2021 ◽

pp. 1-1

Author(s):

Yeongje Uhm ◽

Wooguil Pak

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

High Performance ◽

Network Intrusion Detection ◽

Network Intrusion ◽

High Scalability

Download Full-text

Automatic generation of high-performance quantized machine learning kernels

Statistical and machine learning models for optimizing energy in parallel applications

Use of Machine Learning to Investigate the Quantitative Checklist for Autism in Toddlers (Q-CHAT) towards Early Autism Screening

An IoT-Focused Intrusion Detection System Approach Based on Preprocessing Characterization for Cybersecurity Datasets

Chemometrics‐based models hyphenated with ensemble machine learning for retention time simulation of Isoquercitrin in Coriander sativum L. using high performance liquid chromatography

An explainable machine learning model to predict and elucidate the compressive behavior of high-performance concrete

High-performance computing and machine learning applied in thermal systems analysis

High-performance Machine Learning in Enabling Large-scale Load Analysis Considering Class Imbalance and Frequency Domain Characteristics

Automatic Generation of 3D Natural Anime-like Non-Player Characters with Machine Learning

Design and Development of Lubricating Material Database and Research on Performance Prediction Method of Machine Learning

Service-Aware Two-Level Partitioning for Machine Learning-based Network Intrusion Detection with High Performance and High Scalability

Export Citation Format