Applying neural networks to predict HPC-I/O bandwidth over seismic data on lustre file system for ExSeisDat

AbstractHPC or super-computing clusters are designed for executing computationally intensive operations that typically involve large scale I/O operations. This most commonly involves using a standard MPI library implemented in C/C++. The MPI-I/O performance in HPC clusters tends to vary significantly over a range of configuration parameters that are generally not taken into account by the algorithm. It is commonly left to individual practitioners to optimise I/O on a case by case basis at code level. This can often lead to a range of unforeseen outcomes. The ExSeisDat utility is built on top of the native MPI-I/O library comprising of Parallel I/O and Workflow Libraries to process seismic data encapsulated in SEG-Y file format. The SEG-Y File data structure is complex in nature, due to the alternative arrangement of trace header and trace data. Its size scales to petabytes and the chances of I/O performance degradation are further increased by ExSeisDat. This research paper presents a novel study of the changing I/O performance in terms of bandwidth, with the use of parallel plots against various MPI-I/O, Lustre (Parallel) File System and SEG-Y File parameters. Another novel aspect of this research is the predictive modelling of MPI-I/O behaviour over SEG-Y File benchmarks using Artificial Neural Networks (ANNs). The accuracy ranges from 62.5% to 96.5% over the set of trained ANN models. The computed Mean Square Error (MSE), Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) values further support the generalisation of the prediction models. This paper demonstrates that by using our ANNs prediction technique, the configurations can be tuned beforehand to avoid poor I/O performance.

Download Full-text

Prediction of Blended Yarn Evenness and Tensile Properties by Using Artificial Neural Network and Multiple Linear Regression

Autex Research Journal ◽

10.1515/aut-2015-0018 ◽

2016 ◽

Vol 16 (2) ◽

pp. 43-50 ◽

Cited By ~ 8

Author(s):

Samander Ali Malik ◽

Assad Farooq ◽

Thomas Gereke ◽

Chokri Cherif

Keyword(s):

Neural Networks ◽

Linear Regression ◽

Multiple Linear Regression ◽

Tensile Properties ◽

Regression Models ◽

Prediction Models ◽

Research Work ◽

Absolute Error ◽

Knowledge Domain ◽

Artificial Neural

Abstract The present research work was carried out to develop the prediction models for blended ring spun yarn evenness and tensile parameters using artificial neural networks (ANNs) and multiple linear regression (MLR). Polyester/cotton blend ratio, twist multiplier, back roller hardness and break draft ratio were used as input parameters to predict yarn evenness in terms of CVm% and yarn tensile properties in terms of tenacity and elongation. Feed forward neural networks with Bayesian regularisation support were successfully trained and tested using the available experimental data. The coefficients of determination of ANN and regression models indicate that there is a strong correlation between the measured and predicted yarn characteristics with an acceptable mean absolute error values. The comparative analysis of two modelling techniques shows that the ANNs perform better than the MLR models. The relative importance of input variables was determined using rank analysis through input saliency test on optimised ANN models and standardised coefficients of regression models. These models are suitable for yarn manufacturers and can be used within the investigated knowledge domain.

Download Full-text

Large-scale, high-resolution comparison of the core visual object recognition behavior of humans, monkeys, and state-of-the-art deep artificial neural networks

10.1101/240614 ◽

2018 ◽

Cited By ~ 8

Author(s):

Rishi Rajalingham ◽

Elias B. Issa ◽

Pouya Bashivan ◽

Kohitij Kar ◽

Kailyn Schmidt ◽

...

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

High Resolution ◽

Large Scale ◽

State Of The Art ◽

Neural Mechanisms ◽

Mechanistic Models ◽

Object Discrimination ◽

Ann Models ◽

Artificial Neural

ABSTRACTPrimates—including humans—can typically recognize objects in visual images at a glance even in the face of naturally occurring identity-preserving image transformations (e.g. changes in viewpoint). A primary neuroscience goal is to uncover neuron-level mechanistic models that quantitatively explain this behavior by predicting primate performance for each and every image. Here, we applied this stringent behavioral prediction test to the leading mechanistic models of primate vision (specifically, deep, convolutional, artificial neural networks; ANNs) by directly comparing their behavioral signatures against those of humans and rhesus macaque monkeys. Using high-throughput data collection systems for human and monkey psychophysics, we collected over one million behavioral trials for 2400 images over 276 binary object discrimination tasks. Consistent with previous work, we observed that state-of-the-art deep, feed-forward convolutional ANNs trained for visual categorization (termed DCNNIC models) accurately predicted primate patterns of object-level confusion. However, when we examined behavioral performance for individual images within each object discrimination task, we found that all tested DCNNIC models were significantly non-predictive of primate performance, and that this prediction failure was not accounted for by simple image attributes, nor rescued by simple model modifications. These results show that current DCNNIC models cannot account for the image-level behavioral patterns of primates, and that new ANN models are needed to more precisely capture the neural mechanisms underlying primate object vision. To this end, large-scale, high-resolution primate behavioral benchmarks—such as those obtained here—could serve as direct guides for discovering such models.SIGNIFICANCE STATEMENTRecently, specific feed-forward deep convolutional artificial neural networks (ANNs) models have dramatically advanced our quantitative understanding of the neural mechanisms underlying primate core object recognition. In this work, we tested the limits of those ANNs by systematically comparing the behavioral responses of these models with the behavioral responses of humans and monkeys, at the resolution of individual images. Using these high-resolution metrics, we found that all tested ANN models significantly diverged from primate behavior. Going forward, these high-resolution, large-scale primate behavioral benchmarks could serve as direct guides for discovering better ANN models of the primate visual system.

Download Full-text

Prediction of Horizontal Daily Global Solar Irradiation Using Artificial Neural Networks (ANNs) in the Castile and León Region, Spain

Agronomy ◽

10.3390/agronomy10010096 ◽

2020 ◽

Vol 10 (1) ◽

pp. 96

Author(s):

Francisco J. Diez ◽

Luis M. Navas-Gracia ◽

Leticia Chico-Santamarta ◽

Adriana Correa-Guimaraes ◽

Andrés Martínez-Rodríguez

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Predictive Modelling ◽

Solar Irradiation ◽

Data Series ◽

Markov Analysis ◽

Global Solar Irradiation ◽

Ann Models ◽

Artificial Neural ◽

Hidden Layer

This article evaluates horizontal daily global solar irradiation predictive modelling using artificial neural networks (ANNs) for its application in agricultural sciences and technologies. An eight year data series (i.e., training networks period between 2004–2010, with 2011 as the validation year) was measured at an agrometeorological station located in Castile and León, Spain, owned by the irrigation advisory system SIAR. ANN models were designed and evaluated with different neuron numbers in the input and hidden layers. The only neuron used in the outlet layer was the global solar irradiation simulated the day after. Evaluated values of the input data were the horizontal daily global irradiation of the current day [H(t)] and two days before [H(t−1), H(t−2)], the day of the year [J(t)], and the daily clearness index [Kt(t)]. Validated results showed that best adjustment models are the ANN 7 model (RMSE = 3.76 MJ/(m2·d), with two inputs ([H(t), Kt(t)]) and four neurons in the hidden layer) and the ANN 4 model (RMSE = 3.75 MJ/(m2·d), with two inputs ([H(t), J(t)]) and two neurons in the hidden layer). Thus, the studied ANN models had better results compared to classic methods (CENSOLAR typical year, weighted moving mean, linear regression, Fourier and Markov analysis) and are practically easier as they need less input variables.

Download Full-text

Body Condition Score Estimation Based on Regression Analysis Using a 3D Camera

Sensors ◽

10.3390/s20133705 ◽

2020 ◽

Vol 20 (13) ◽

pp. 3705

Author(s):

Thi Thi Zin ◽

Pann Thinzar Seint ◽

Pyke Tin ◽

Yoichiro Horii ◽

Ikuo Kobayashi

Keyword(s):

Body Condition ◽

Large Scale ◽

Body Condition Score ◽

Absolute Error ◽

The Body ◽

Analytical Models ◽

Percentage Error ◽

Time Saving ◽

3D Camera ◽

Condition Score

The Body Condition Score (BCS) for cows indicates their energy reserves, the scoring for which ranges from very thin to overweight. These measurements are especially useful during calving, as well as early lactation. Achieving a correct BCS helps avoid calving difficulties, losses and other health problems. Although BCS can be rated by experts, it is time-consuming and often inconsistent when performed by different experts. Therefore, the aim of our system is to develop a computerized system to reduce inconsistencies and to provide a time-saving solution. In our proposed system, the automatic body condition scoring system is introduced by using a 3D camera, image processing techniques and regression models. The experimental data were collected on a rotary parlor milking station on a large-scale dairy farm in Japan. The system includes an application platform for automatic image selection as a primary step, which was developed for smart monitoring of individual cows on large-scale farms. Moreover, two analytical models are proposed in two regions of interest (ROI) by extracting 3D surface roughness parameters. By applying the extracted parameters in mathematical equations, the BCS is automatically evaluated based on measurements of model accuracy, with one of the two models achieving a mean absolute percentage error (MAPE) of 3.9%, and a mean absolute error (MAE) of 0.13.

Download Full-text

Application of artificial neural networks to predict the COVID-19 outbreak

Global Health Research and Policy ◽

10.1186/s41256-020-00175-y ◽

2020 ◽

Vol 5 (1) ◽

Cited By ~ 2

Author(s):

Hamid Reza Niazkar ◽

Majid Niazkar

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Incubation Period ◽

Prediction Models ◽

Time Interval ◽

Policy Makers ◽

Ann Models ◽

Artificial Neural ◽

Relative Errors ◽

Mean Square Errors

Abstract Background Millions of people have been infected worldwide in the COVID-19 pandemic. In this study, we aim to propose fourteen prediction models based on artificial neural networks (ANN) to predict the COVID-19 outbreak for policy makers. Methods The ANN-based models were utilized to estimate the confirmed cases of COVID-19 in China, Japan, Singapore, Iran, Italy, South Africa and United States of America. These models exploit historical records of confirmed cases, while their main difference is the number of days that they assume to have impact on the estimation process. The COVID-19 data were divided into a train part and a test part. The former was used to train the ANN models, while the latter was utilized to compare the purposes. The data analysis shows not only significant fluctuations in the daily confirmed cases but also different ranges of total confirmed cases observed in the time interval considered. Results Based on the obtained results, the ANN-based model that takes into account the previous 14 days outperforms the other ones. This comparison reveals the importance of considering the maximum incubation period in predicting the COVID-19 outbreak. Comparing the ranges of determination coefficients indicates that the estimated results for Italy are the best one. Moreover, the predicted results for Iran achieved the ranges of [0.09, 0.15] and [0.21, 0.36] for the mean absolute relative errors and normalized root mean square errors, respectively, which were the best ranges obtained for these criteria among different countries. Conclusion Based on the achieved results, the ANN-based model that takes into account the previous fourteen days for prediction is suggested to predict daily confirmed cases, particularly in countries that have experienced the first peak of the COVID-19 outbreak. This study has not only proved the applicability of ANN-based model for prediction of the COVID-19 outbreak, but also showed that considering incubation period of SARS-COV-2 in prediction models may generate more accurate estimations.

Download Full-text

Prediction by using Artificial Neural Networks and Box-Jenkins methodologies: Comparison Study

Journal of Al-Qadisiyah for Computer Science and Mathematics ◽

10.29304/jqcm.2017.9.2.325 ◽

2017 ◽

Vol 9 (2) ◽

Author(s):

Mohammed Habib Al- Sharoot ◽

Emaan Yousif Abdoon

Keyword(s):

Neural Networks ◽

Exchange Rate ◽

Absolute Error ◽

Percentage Error ◽

Neural Network Approach ◽

Decision Making Processes ◽

The Mean ◽

Us Dollar ◽

Stable Economy ◽

Better Than

The variations in exchange rate, especially the sudden unexpected increases and decreases, have significant impact on the national economy of any country. Iraq is no exception; therefore, the accurate forecasting of exchange rate of Iraqi dinar to US dollar plays an important role in the planning and decision-making processes as well as the maintenance of a stable economy in Iraq. This research aims to compare Box-Jenkins methodology to neural networks in terms of forecasting the exchange rate of Iraqi dinar to US dollar based on data provided by the Iraqi Central Bank for the period 30/01/2004 and 30/12/2014. Based on the Mean Square Error (MSE), the Mean Absolute Error (MAE), and the Mean Absolute Percentage Error (MAPE) as criteria to compare the two methodologies, it was concluded that Box-Jenkins is better than neural network approach in forecasting.

Download Full-text

Use of Artificial Neural Networks and Arima to Forecasting Consumption Sawnwood of Pinus sp. in Brazil

International Forestry Review ◽

10.1505/146554819825863735 ◽

2019 ◽

Vol 21 (1) ◽

pp. 51-61 ◽

Cited By ~ 1

Author(s):

D.A. Buratto ◽

R. Timofeiczyk Junior ◽

J.C.G.L. Silva ◽

J.R. Frega ◽

M.S.S.A. Wiecheteck ◽

...

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Mean Squared Error ◽

Arima Model ◽

Secondary Data ◽

Information Criterion ◽

Absolute Error ◽

Percentage Error ◽

Explanatory Variables ◽

Artificial Neural

The objective of this study was to analyze the application of an artificial neural networks model and an ARIMA model to predict the consumption of sawnwood of pine. For this, we use real and secondary data collected and obtained from a historical data source, corresponding to the period from 1997 to 2016, which were later tested to generate the forecast models. Based on economic and statistical criteria, six explanatory variables were used to fit the best model. The choice of the model was made based on Mean Squared Error, Mean Absolute Error, Theil U metric, Percentage Error of Forecast and Akaike value information criterion. The results indicated that the models generated through the ARIMA model presented better performance when compared to the artificial neural network. The best adjusted model estimated a reduction of 1.33% in consumption of sawnwood of pine in Brazil for the period between 2017 and 2020.

Download Full-text

A Hybridly Optimized LSTM-Based Data Flow Prediction Model for Dependable Online Ticketing

Wireless Communications and Mobile Computing ◽

10.1155/2021/9951607 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Chunmei Fan ◽

Jiansheng Zhu ◽

Haroon Elahi ◽

Lipeng Yang ◽

Beibei Li

Keyword(s):

Service Delivery ◽

Network Flow ◽

Prediction Models ◽

Short Term Memory ◽

Real Data ◽

Communication Technologies ◽

Absolute Error ◽

Percentage Error ◽

Fifth Generation ◽

Proposed Model

Fifth-generation (5G) communication technologies and artificial intelligence enable the design and deployment of sophisticated solutions for enhanced user experience and superior network-based service delivery. However, the performance of the systems offering 5G-based services depends on various factors. In this paper, we consider the case of the online railway ticketing system in China that serves the needs of hundreds of millions of people daily. This system’s online access rates vary over time, and fluctuations are experienced, affecting its overall dependability and service quality. We use long short-term memory network, particle swarm optimization, and differential evolution to construct DP-LSTM—a hybridly optimized model to predict network flow for dependable and quality-enhanced service delivery. We evaluate the proposed model using real data collected over six months from the “12306 online ticketing” system. We compare the performance of the proposed model with mainstream network traffic prediction models. We use mean absolute percentage error, mean absolute error, and root mean square error for performance evaluation. Experimental results show the superiority of the proposed model.

Download Full-text

Toward a State-of-the-Art of Fly-Rock Prediction Technology in Open-Pit Mines Using EANNs Model

Applied Sciences ◽

10.3390/app9214554 ◽

2019 ◽

Vol 9 (21) ◽

pp. 4554 ◽

Cited By ~ 10

Author(s):

Hoang Nguyen ◽

Xuan-Nam Bui ◽

Trung Nguyen-Thoi ◽

Prashanth Ragam ◽

Hossein Moayedi

Keyword(s):

State Of The Art ◽

Promising Result ◽

Absolute Error ◽

Original Data ◽

Training Data ◽

Open Pit ◽

Training Dataset ◽

Percentage Error ◽

Ann Model ◽

Ann Models

Fly-rock induced by blasting is an undesirable phenomenon in quarries. It can be dangerous for humans, equipment, and buildings. To minimize its undesirable hazards, we proposed a state-of-the-art technology of fly-rock prediction based on artificial neural network (ANN) models and their robust combination, called EANNs model (ensemble of ANN models); 210 fly-rock events were recorded to develop and test the ANN and EANNs models. Of thi sample, 80% of the whole dataset was assigned to develop the models, the remaining 20% was assigned to confirm the models developed. Accordingly, five ANN models were designed and developed using the training dataset (i.e., 80% of the whole original data) first; then, their predictions on the training dataset were ensembled to generate a new training dataset. Subsequently, another ANN model was developed based on the new set of training data (i.e., EANNs model). Its performance was evaluated through a variety of performance indices, such as MAE (mean absolute error), MAPE (mean absolute percentage error), RMSE (root-mean-square error), R2 (correlation coefficient), and VAF (variance accounted for). A promising result was found for the proposed EANNs model in predicting blast-induced fly-rock with a MAE = 2.777, MAPE = 0.017, RMSE = 4.346, R2 = 0.986, and VAF = 98.446%. To confirm the performance of the proposed EANNs model, another ANN model with the same structure was developed and tested on the training and testing datasets. The findings also indicated that the proposed EANNs model yielded better performance than those of the ANN model with the same structure.

Download Full-text

Machine learning models for predicting international tourist arrivals in Indonesia during the COVID-19 pandemic: a multisource Internet data approach

Journal of Tourism Futures ◽

10.1108/jtf-10-2021-0239 ◽

2022 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Dinda Thalia Andariesta ◽

Meditya Wasesa

Keyword(s):

Machine Learning ◽

Prediction Models ◽

Absolute Error ◽

Percentage Error ◽

Support Vector ◽

Tourism Demand ◽

Learning Models ◽

Content Type ◽

International Tourist ◽

Machine Learning Models

PurposeThis research presents machine learning models for predicting international tourist arrivals in Indonesia during the COVID-19 pandemic using multisource Internet data.Design/methodology/approachTo develop the prediction models, this research utilizes multisource Internet data from TripAdvisor travel forum and Google Trends. Temporal factors, posts and comments, search queries index and previous tourist arrivals records are set as predictors. Four sets of predictors and three distinct data compositions were utilized for training the machine learning models, namely artificial neural networks (ANNs), support vector regression (SVR) and random forest (RF). To evaluate the models, this research uses three accuracy metrics, namely root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE).FindingsPrediction models trained using multisource Internet data predictors have better accuracy than those trained using single-source Internet data or other predictors. In addition, using more training sets that cover the phenomenon of interest, such as COVID-19, will enhance the prediction model's learning process and accuracy. The experiments show that the RF models have better prediction accuracy than the ANN and SVR models.Originality/valueFirst, this study pioneers the practice of a multisource Internet data approach in predicting tourist arrivals amid the unprecedented COVID-19 pandemic. Second, the use of multisource Internet data to improve prediction performance is validated with real empirical data. Finally, this is one of the few papers to provide perspectives on the current dynamics of Indonesia's tourism demand.

Download Full-text