A New Hyper-Parameter Optimization Method for Power Load Forecast Based on Recurrent Neural Networks

Artificial neural networks (ANN) are often more suitable for classification problems. Even then, training of ANN is a surviving challenge task for large and high dimensional natured search space problems. These hitches are more for applications that involves process of fine tuning of ANN control parameters: weights and bias. There is no single search and optimization method that suits the weights and bias of ANN for all the problems. The traditional heuristic approach fails because of their poorer convergence speed and chances of ending up with local optima. In this connection, the meta-heuristic algorithms prove to provide consistent solution for optimizing ANN training parameters. This chapter will provide critics on both heuristics and meta-heuristic existing literature for training neural networks algorithms, applicability, and reliability on parameter optimization. In addition, the real-time applications of ANN will be presented. Finally, future directions to be explored in the field of ANN are presented which will of potential interest for upcoming researchers.

Download Full-text

Multi-Zone Unit for Recurrent Neural Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5958 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5150-5157

Author(s):

Fandong Meng ◽

Jinchao Zhang ◽

Yang Liu ◽

Jie Zhou

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Critical Role ◽

Transition Function ◽

Learning Problems ◽

Arbitrary Length ◽

Transition Functions ◽

Multiple Datasets ◽

Analysis Task ◽

Hidden States

Recurrent neural networks (RNNs) have been widely used to deal with sequence learning problems. The input-dependent transition function, which folds new observations into hidden states to sequentially construct fixed-length representations of arbitrary-length sequences, plays a critical role in RNNs. Based on single space composition, transition functions in existing RNNs often have difficulty in capturing complicated long-range dependencies. In this paper, we introduce a new Multi-zone Unit (MZU) for RNNs. The key idea is to design a transition function that is capable of modeling multiple space composition. The MZU consists of three components: zone generation, zone composition, and zone aggregation. Experimental results on multiple datasets of the character-level language modeling task and the aspect-based sentiment analysis task demonstrate the superiority of the MZU.

Download Full-text

Deep recurrent neural networks based binaural speech segregation for the selection of closest target of interest

Multimedia Tools and Applications ◽

10.1007/s11042-017-5458-3 ◽

2017 ◽

Vol 77 (15) ◽

pp. 20129-20156 ◽

Cited By ~ 3

Author(s):

R. Venkatesan ◽

A. Balaji Ganesh

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Speech Segregation ◽

Selection Of

Download Full-text

Sample-efficient Optimization Using Neural Networks

10.26686/wgtn.17147828.v1 ◽

2021 ◽

Author(s):

◽

Mashall Aryan

Keyword(s):

Neural Network ◽

Neural Networks ◽

Monte Carlo ◽

Gaussian Process ◽

Optimization Methods ◽

Optimization Method ◽

Monte Carlo Sampling ◽

Bayesian Optimization ◽

Expected Improvement ◽

Bayesian Neural Network

<p>The solution to many science and engineering problems includes identifying the minimum or maximum of an unknown continuous function whose evaluation inflicts non-negligible costs in terms of resources such as money, time, human attention or computational processing. In such a case, the choice of new points to evaluate is critical. A successful approach has been to choose these points by considering a distribution over plausible surfaces, conditioned on all previous points and their evaluations. In this sequential bi-step strategy, also known as Bayesian Optimization, first a prior is defined over possible functions and updated to a posterior in the light of available observations. Then using this posterior, namely the surrogate model, an infill criterion is formed and utilized to find the next location to sample from. By far the most common prior distribution and infill criterion are Gaussian Process and Expected Improvement, respectively. The popularity of Gaussian Processes in Bayesian optimization is partially due to their ability to represent the posterior in closed form. Nevertheless, the Gaussian Process is afflicted with several shortcomings that directly affect its performance. For example, inference scales poorly with the amount of data, numerical stability degrades with the number of data points, and strong assumptions about the observation model are required, which might not be consistent with reality. These drawbacks encourage us to seek better alternatives. This thesis studies the application of Neural Networks to enhance Bayesian Optimization. It proposes several Bayesian optimization methods that use neural networks either as their surrogates or in the infill criterion. This thesis introduces a novel Bayesian Optimization method in which Bayesian Neural Networks are used as a surrogate. This has reduced the computational complexity of inference in surrogate from cubic (on the number of observation) in GP to linear. Different variations of Bayesian Neural Networks (BNN) are put into practice and inferred using a Monte Carlo sampling. The results show that Monte Carlo Bayesian Neural Network surrogate could performed better than, or at least comparably to the Gaussian Process-based Bayesian optimization methods on a set of benchmark problems. This work develops a fast Bayesian Optimization method with an efficient surrogate building process. This new Bayesian Optimization algorithm utilizes Bayesian Random-Vector Functional Link Networks as surrogate. In this family of models the inference is only performed on a small subset of the entire model parameters and the rest are randomly drawn from a prior. The proposed methods are tested on a set of benchmark continuous functions and hyperparameter optimization problems and the results show the proposed methods are competitive with state-of-the-art Bayesian Optimization methods. This study proposes a novel Neural network-based infill criterion. In this method locations to sample from are found by minimizing the joint conditional likelihood of the new point and parameters of a neural network. The results show that in Bayesian Optimization methods with Bayesian Neural Network surrogates, this new infill criterion outperforms the expected improvement. Finally, this thesis presents order-preserving generative models and uses it in a variational Bayesian context to infer Implicit Variational Bayesian Neural Network (IVBNN) surrogates for a new Bayesian Optimization. This new inference mechanism is more efficient and scalable than Monte Carlo sampling. The results show that IVBNN could outperform Monte Carlo BNN in Bayesian optimization of hyperparameters of machine learning models.</p>

Download Full-text

Approximation of phenol concentration using novel hybrid computational intelligence methods

International Journal of Applied Mathematics and Computer Science ◽

10.2478/amcs-2014-0013 ◽

2014 ◽

Vol 24 (1) ◽

pp. 165-181 ◽

Cited By ~ 30

Author(s):

Pawel Plawiak ◽

Ryszard Tadeusiewicz

Keyword(s):

Neural Networks ◽

Genetic Algorithms ◽

Computational Intelligence ◽

Recurrent Neural Networks ◽

Phenol Concentration ◽

Neural Systems ◽

Feed Forward ◽

Computational Intelligence Methods ◽

Lm Algorithm ◽

Selection Of

Abstract This paper presents two innovative evolutionary-neural systems based on feed-forward and recurrent neural networks used for quantitative analysis. These systems have been applied for approximation of phenol concentration. Their performance was compared against the conventional methods of artificial intelligence (artificial neural networks, fuzzy logic and genetic algorithms). The proposed systems are a combination of data preprocessing methods, genetic algorithms and the Levenberg-Marquardt (LM) algorithm used for learning feed forward and recurrent neural networks. The initial weights and biases of neural networks chosen by the use of a genetic algorithm are then tuned with an LM algorithm. The evaluation is made on the basis of accuracy and complexity criteria. The main advantage of proposed systems is the elimination of random selection of the network weights and biases, resulting in increased efficiency of the systems.

Download Full-text

Controller parameter optimization for complex industrial system with uncertainties

Measurement and Control ◽

10.1177/0020294019830108 ◽

2019 ◽

Vol 52 (7-8) ◽

pp. 888-895

Author(s):

Heping Chen ◽

Seth Bowels ◽

Biao Zhang ◽

Thomas Fuhlbrigge

Keyword(s):

Gaussian Process ◽

Parameter Optimization ◽

Optimization Algorithm ◽

Gaussian Process Regression ◽

Optimization Method ◽

Industrial Applications ◽

Bayesian Optimization ◽

Potential Candidate ◽

Bayesian Optimization Algorithm ◽

Controller Parameter

Proportional–integral–derivative control system has been widely used in industrial applications. For complex systems, tuning controller parameters to satisfy the process requirements is very challenging. Different methods have been proposed to solve the problem. However these methods suffer several problems, such as dealing with system complexity, minimizing tuning effort and balancing different performance indices including rise time, settling time, steady-state error and overshoot. In this paper, we develop an automatic controller parameter optimization method based on Gaussian process regression Bayesian optimization algorithm. A non-parametric model is constructed using Gaussian process regression. By combining Gaussian process regression with Bayesian optimization algorithm, potential candidate can be predicted and applied to guide the optimization process. Both experiments and simulation were performed to demonstrate the effectiveness of the proposed method.

Download Full-text

Application of Recurrent Neural Networks for Adaptive Selection of Parameters of Error-correcting Code in Telemetry Data Transmission Systems

10.1109/summa53307.2021.9632159 ◽

2021 ◽

Author(s):

Ilya Bogachev ◽

Alexey Levenets ◽

En Un Chye

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Data Transmission ◽

Error Correcting Code ◽

Telemetry Data ◽

Transmission Systems ◽

Adaptive Selection ◽

Selection Of

Download Full-text

An Optimization Method for the Initial Parameters Selection of Fuzzy Cerebellar Model Neural Networks in Parametric Fault Diagnosis

International Journal of Fuzzy Systems ◽

10.1007/s40815-020-00908-8 ◽

2020 ◽

Vol 22 (7) ◽

pp. 2071-2082

Author(s):

Qiongbin Lin ◽

Shican Chen ◽

Chih-Min Lin

Keyword(s):

Neural Networks ◽

Fault Diagnosis ◽

Optimization Method ◽

Parameters Selection ◽

Parametric Fault ◽

Selection Of

Download Full-text

Sample-efficient Optimization Using Neural Networks

10.26686/wgtn.17147828 ◽

2021 ◽

Author(s):

◽

Mashall Aryan

Keyword(s):

Neural Network ◽

Neural Networks ◽

Monte Carlo ◽

Gaussian Process ◽

Optimization Methods ◽

Optimization Method ◽

Monte Carlo Sampling ◽

Bayesian Optimization ◽

Expected Improvement ◽

Bayesian Neural Network

<p>The solution to many science and engineering problems includes identifying the minimum or maximum of an unknown continuous function whose evaluation inflicts non-negligible costs in terms of resources such as money, time, human attention or computational processing. In such a case, the choice of new points to evaluate is critical. A successful approach has been to choose these points by considering a distribution over plausible surfaces, conditioned on all previous points and their evaluations. In this sequential bi-step strategy, also known as Bayesian Optimization, first a prior is defined over possible functions and updated to a posterior in the light of available observations. Then using this posterior, namely the surrogate model, an infill criterion is formed and utilized to find the next location to sample from. By far the most common prior distribution and infill criterion are Gaussian Process and Expected Improvement, respectively. The popularity of Gaussian Processes in Bayesian optimization is partially due to their ability to represent the posterior in closed form. Nevertheless, the Gaussian Process is afflicted with several shortcomings that directly affect its performance. For example, inference scales poorly with the amount of data, numerical stability degrades with the number of data points, and strong assumptions about the observation model are required, which might not be consistent with reality. These drawbacks encourage us to seek better alternatives. This thesis studies the application of Neural Networks to enhance Bayesian Optimization. It proposes several Bayesian optimization methods that use neural networks either as their surrogates or in the infill criterion. This thesis introduces a novel Bayesian Optimization method in which Bayesian Neural Networks are used as a surrogate. This has reduced the computational complexity of inference in surrogate from cubic (on the number of observation) in GP to linear. Different variations of Bayesian Neural Networks (BNN) are put into practice and inferred using a Monte Carlo sampling. The results show that Monte Carlo Bayesian Neural Network surrogate could performed better than, or at least comparably to the Gaussian Process-based Bayesian optimization methods on a set of benchmark problems. This work develops a fast Bayesian Optimization method with an efficient surrogate building process. This new Bayesian Optimization algorithm utilizes Bayesian Random-Vector Functional Link Networks as surrogate. In this family of models the inference is only performed on a small subset of the entire model parameters and the rest are randomly drawn from a prior. The proposed methods are tested on a set of benchmark continuous functions and hyperparameter optimization problems and the results show the proposed methods are competitive with state-of-the-art Bayesian Optimization methods. This study proposes a novel Neural network-based infill criterion. In this method locations to sample from are found by minimizing the joint conditional likelihood of the new point and parameters of a neural network. The results show that in Bayesian Optimization methods with Bayesian Neural Network surrogates, this new infill criterion outperforms the expected improvement. Finally, this thesis presents order-preserving generative models and uses it in a variational Bayesian context to infer Implicit Variational Bayesian Neural Network (IVBNN) surrogates for a new Bayesian Optimization. This new inference mechanism is more efficient and scalable than Monte Carlo sampling. The results show that IVBNN could outperform Monte Carlo BNN in Bayesian optimization of hyperparameters of machine learning models.</p>

Download Full-text

Meta-Heuristic Parameter Optimization for ANN and Real-Time Applications of ANN

Applications of Artificial Neural Networks for Nonlinear Data - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-7998-4042-8.ch010 ◽

2021 ◽

pp. 227-269

Author(s):

Asha Gowda Karegowda ◽

Devika G.

Keyword(s):

Neural Networks ◽

Real Time ◽

Parameter Optimization ◽

Heuristic Algorithms ◽

Search Space ◽

Optimization Method ◽

Classification Problems ◽

Local Optima ◽

Consistent Solution ◽

Real Time Applications

Artificial neural networks (ANN) are often more suitable for classification problems. Even then, training of ANN is a surviving challenge task for large and high dimensional natured search space problems. These hitches are more for applications that involves process of fine tuning of ANN control parameters: weights and bias. There is no single search and optimization method that suits the weights and bias of ANN for all the problems. The traditional heuristic approach fails because of their poorer convergence speed and chances of ending up with local optima. In this connection, the meta-heuristic algorithms prove to provide consistent solution for optimizing ANN training parameters. This chapter will provide critics on both heuristics and meta-heuristic existing literature for training neural networks algorithms, applicability, and reliability on parameter optimization. In addition, the real-time applications of ANN will be presented. Finally, future directions to be explored in the field of ANN are presented which will of potential interest for upcoming researchers.

Download Full-text