Two Projection Neural Networks With Reduced Model Complexity for Nonlinear Programming

A method for improving the performance of artificial neural networks for linear and nonlinear programming is presented. By analyzing the behavior of the conventional penalty function, the reason for the inherent degenerating accuracy is discovered. Based on this, a new combination penalty function is proposed which can ensure that the equilibrium point is acceptably close to the optimal point. A known neural network model has been modified by using the new penalty function and the corresponding circuit scheme is given. Simulation results show that the relative error for linear and nonlinear programming is substantially reduced by the new method.

Download Full-text

On a Stabilization Problem of Nonlinear Programming Neural Networks

Neural Processing Letters ◽

10.1007/s11063-010-9129-x ◽

2010 ◽

Vol 31 (2) ◽

pp. 93-103 ◽

Cited By ~ 1

Author(s):

Yuancan Huang

Keyword(s):

Neural Networks ◽

Nonlinear Programming ◽

Stabilization Problem

Download Full-text

Simple Convolutional-Based Models: Are They Learning the Task or the Data?

Neural Computation ◽

10.1162/neco_a_01446 ◽

2021 ◽

pp. 1-17

Author(s):

Luis Sa-Couto ◽

Andreas Wichert

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Training Data ◽

Model Complexity ◽

Data Sets ◽

Simple Task ◽

Data Set ◽

Knowing That ◽

Handwritten Digit ◽

End To End

Abstract Convolutional neural networks (CNNs) evolved from Fukushima's neocognitron model, which is based on the ideas of Hubel and Wiesel about the early stages of the visual cortex. Unlike other branches of neocognitron-based models, the typical CNN is based on end-to-end supervised learning by backpropagation and removes the focus from built-in invariance mechanisms, using pooling not as a way to tolerate small shifts but as a regularization tool that decreases model complexity. These properties of end-to-end supervision and flexibility of structure allow the typical CNN to become highly tuned to the training data, leading to extremely high accuracies on typical visual pattern recognition data sets. However, in this work, we hypothesize that there is a flip side to this capability, a hidden overfitting. More concretely, a supervised, backpropagation based CNN will outperform a neocognitron/map transformation cascade (MTCCXC) when trained and tested inside the same data set. Yet if we take both models trained and test them on the same task but on another data set (without retraining), the overfitting appears. Other neocognitron descendants like the What-Where model go in a different direction. In these models, learning remains unsupervised, but more structure is added to capture invariance to typical changes. Knowing that, we further hypothesize that if we repeat the same experiments with this model, the lack of supervision may make it worse than the typical CNN inside the same data set, but the added structure will make it generalize even better to another one. To put our hypothesis to the test, we choose the simple task of handwritten digit classification and take two well-known data sets of it: MNIST and ETL-1. To try to make the two data sets as similar as possible, we experiment with several types of preprocessing. However, regardless of the type in question, the results align exactly with expectation.

Download Full-text

Model Complexity of Neural Networks in High-Dimensional Approximation

Recent Advances in Intelligent Engineering Systems - Studies in Computational Intelligence ◽

10.1007/978-3-642-23229-9_7 ◽

2012 ◽

pp. 151-160

Author(s):

Věra Kůrková

Keyword(s):

Neural Networks ◽

Model Complexity ◽

High Dimensional ◽

Dimensional Approximation

Download Full-text

Artificial Neural Networks and Their Applications in Business

Encyclopedia of Information Science and Technology, Fourth Edition ◽

10.4018/978-1-5225-2255-3.ch576 ◽

2018 ◽

pp. 6642-6657

Author(s):

Trevor J. Bihl ◽

William A. Young II ◽

Gary R. Weckman

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Big Data ◽

Artificial Neural Networks ◽

Model Complexity ◽

Ann Model ◽

Processing Methods ◽

Starting Point ◽

Clustering And Classification ◽

Artificial Neural

Despite the natural advantage humans have for recognizing and interpreting patterns, large and complex datasets, as in Big Data, preclude efficient human analysis. Artificial neural networks (ANNs) provide a family of pattern recognition approaches for prediction, clustering and classification applicable to KDD with ANN model complexity ranging from simple (for small problems) highly complex (for large issues). To provide a starting point for readers, this chapter first describes foundational concepts that relate to ANNs. A listing of commonly used ANN methods, heuristics, and criteria for initializing ANNs is then discussed. Common pre- and post- data processing methods for dimensionality reduction and data quality issues are then described. The authors then provide a tutorial example of ANN analysis. Finally, the authors list and describe applications of ANNs to specific business related endeavors for further reading.

Download Full-text

Artificial Neural Networks and Their Applications in Business

Advanced Methodologies and Technologies in Artificial Intelligence, Computer Simulation, and Human-Computer Interaction - Advances in Computer and Electrical Engineering ◽

10.4018/978-1-5225-7368-5.ch074 ◽

2019 ◽

pp. 1009-1027

Author(s):

Trevor J. Bihl ◽

William A. Young II ◽

Gary R. Weckman

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Big Data ◽

Artificial Neural Networks ◽

Model Complexity ◽

Ann Model ◽

Processing Methods ◽

Starting Point ◽

Clustering And Classification ◽

Artificial Neural

Despite the natural advantage humans have for recognizing and interpreting patterns, large and complex datasets, as in big data, preclude efficient human analysis. Artificial neural networks (ANNs) provide a family of pattern recognition approaches for prediction, clustering, and classification applicable to KDD with ANN model complexity ranging from simple (for small problems) to highly complex (for large issues). To provide a starting point for readers, this chapter first describes foundational concepts that relate to ANNs. A listing of commonly used ANN methods, heuristics, and criteria for initializing ANNs are then discussed. Common pre- and post-data processing methods for dimensionality reduction and data quality issues are then described. The authors then provide a tutorial example of ANN analysis. Finally, the authors list and describe applications of ANNs to specific business-related endeavors for further reading.

Download Full-text