A recurrent neural network framework for flexible and adaptive decision making based on sequence learning

The brain makes flexible and adaptive responses in a complicated and ever-changing environment for an organism’s survival. To achieve this, the brain needs to understand the contingencies between its sensory inputs, actions, and rewards. This is analogous to the statistical inference that has been extensively studied in the natural language processing field, where recent developments of recurrent neural networks have found many successes. We wonder whether these neural networks, the gated recurrent unit (GRU) networks in particular, reflect how the brain solves the contingency problem. Therefore, we build a GRU network framework inspired by the statistical learning approach of NLP and test it with four exemplar behavior tasks previously used in empirical studies. The network models are trained to predict future events based on past events, both comprising sensory, action, and reward events. We show the networks can successfully reproduce animal and human behavior. The networks generalize the training, perform Bayesian inference in novel conditions, and adapt their choices when event contingencies vary. Importantly, units in the network encode task variables and exhibit activity patterns that match previous neurophysiology findings. Our results suggest that the neural network approach based on statistical sequence learning may reflect the brain’s computational principle underlying flexible and adaptive behaviors and serve as a useful approach to understand the brain.

Download Full-text

Material Demands for Optical Neural Networks

MRS Bulletin ◽

10.1557/s0883769400064654 ◽

1988 ◽

Vol 13 (8) ◽

pp. 30-35 ◽

Cited By ~ 5

Author(s):

Dana Z. Anderson

Keyword(s):

Neural Network ◽

Neural Networks ◽

Speech Processing ◽

Visual Processing ◽

Network Models ◽

Human Memory ◽

Information Storage ◽

Physical Damage ◽

Neural Network Models ◽

The Brain

From the time of their conception, holography and holograms have evolved as a metaphor for human memory. Holograms can be made so that the information they contain is distributed throughout the holographic medium—destroy part of the hologram and the stored information remains wholly intact, except for a loss of detail. In this property holograms evidently have something in common with human memory, which is to some extent resilient against physical damage to the brain. There is much more to the metaphor than simply that information is stored in a distributed manner.Research in the optics community is now looking to holography, in particular dynamic holography, not only for information storage, but for information processing as well. The ideas are based upon neural network models. Neural networks are models for processing that are inspired by the apparent architecture of the brain. This is a processing paradigm that is new to optics. From within this network paradigm we look to build machines that can store and recall information associatively, play back a chain of recorded events, undergo learning and possibly forgetting, make decisions, adapt to a particular environment, and self-organize to evolve some desirable behavior. We hope that neural network models will give rise to optical machines for memory, speech processing, visual processing, language acquisition, motor control, and so on.

Download Full-text

Opening the black box of neural networks: methods for interpreting neural network models in clinical applications

Annals of Translational Medicine ◽

10.21037/atm.2018.05.32 ◽

2018 ◽

Vol 6 (11) ◽

pp. 216-216 ◽

Cited By ~ 45

Author(s):

Zhongheng Zhang ◽

◽

Marcus W. Beck ◽

David A. Winkler ◽

Bin Huang ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Models ◽

Black Box ◽

Clinical Applications ◽

Neural Network Models

Download Full-text

Generalization of Patterns by Identification with Polynomial Neural Network

Journal of Electrical Engineering ◽

10.2478/v10187-010-0017-4 ◽

2010 ◽

Vol 61 (2) ◽

pp. 120-124 ◽

Cited By ~ 3

Author(s):

Ladislav Zjavka

Keyword(s):

Neural Network ◽

Neural Networks ◽

Artificial Neural Networks ◽

Functional Dependence ◽

Polynomial Neural Network ◽

New Type ◽

Input Variables ◽

The Brain ◽

Functional Output ◽

Simplified Form

Generalization of Patterns by Identification with Polynomial Neural Network Artificial neural networks (ANN) in general classify patterns according to their relationship, they are responding to related patterns with a similar output. Polynomial neural networks (PNN) are capable of organizing themselves in response to some features (relations) of the data. Polynomial neural network for dependence of variables identification (D-PNN) describes a functional dependence of input variables (not entire patterns). It approximates a hyper-surface of this function with multi-parametric particular polynomials forming its functional output as a generalization of input patterns. This new type of neural network is based on GMDH polynomial neural network and was designed by author. D-PNN operates in a way closer to the brain learning as the ANN does. The ANN is in principle a simplified form of the PNN, where the combinations of input variables are missing.

Download Full-text

Estimation of Annual Average Daily Traffic on Low-Volume Roads: Factor Approach Versus Neural Networks

Transportation Research Record Journal of the Transportation Research Board ◽

10.3141/1719-13 ◽

2000 ◽

Vol 1719 (1) ◽

pp. 103-111 ◽

Cited By ~ 17

Author(s):

Satish C. Sharma ◽

Pawan Lingras ◽

Guo X. Liu ◽

Fei Xu

Keyword(s):

Neural Network ◽

Neural Networks ◽

Traffic Monitoring ◽

Network Approach ◽

Annual Average ◽

Neural Network Approach ◽

The Neural Network ◽

Low Volume Roads ◽

Annual Average Daily Traffic ◽

Low Volume

Estimation of the annual average daily traffic (AADT) for low-volume roads is investigated. Artificial neural networks are compared with the traditional factor approach for estimating AADT from short-period traffic counts. Fifty-five automatic traffic recorder (ATR) sites located on low-volume rural roads in Alberta, Canada, are used as study samples. The results of this study indicate that, when a single 48-h count is used for AADT estimation, the factor approach can yield better results than the neural networks if the ATR sites are grouped appropriately and the sample sites are correctly assigned to various ATR groups. Unfortunately, the current recommended practice offers little guidance on how to achieve the assignment accuracy that may be necessary to obtain reliable AADT estimates from a single 48-h count. The neural network approach can be particularly suitable for estimating AADT from two 48-h counts taken at different times during the counting season. In fact, the 95th percentile error values of about 25 percent as obtained in this study for the neural network models compare favorably with the values reported in the literature for low-volume roads using the traditional factor approach. The advantage of the neural network approach is that classification of ATR sites and sample site assignments to ATR groups are not required. The analysis of various groups of low-volume roads presented also leads to a conclusion that, when defining low-volume roads from a traffic monitoring point of view, it is not likely to matter much whether the AADT on the facility is less than 500 vehicles, less than 750 vehicles, or less than 1,000 vehicles.

Download Full-text

Prediction of Emergency Department Hospital Admission Based on Natural Language Processing and Neural Networks

Methods of Information in Medicine ◽

10.3414/me17-01-0024 ◽

2017 ◽

Vol 56 (05) ◽

pp. 377-389 ◽

Cited By ~ 21

Author(s):

Xingyu Zhang ◽

Joyce Kim ◽

Rachel E. Patzer ◽

Stephen R. Pitts ◽

Aaron Patzer ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Emergency Department ◽

Logistic Regression ◽

Natural Language Processing ◽

Natural Language ◽

Hospital Admission ◽

Language Processing ◽

Predictive Accuracy ◽

Free Text

SummaryObjective: To describe and compare logistic regression and neural network modeling strategies to predict hospital admission or transfer following initial presentation to Emergency Department (ED) triage with and without the addition of natural language processing elements.Methods: Using data from the National Hospital Ambulatory Medical Care Survey (NHAMCS), a cross-sectional probability sample of United States EDs from 2012 and 2013 survey years, we developed several predictive models with the outcome being admission to the hospital or transfer vs. discharge home. We included patient characteristics immediately available after the patient has presented to the ED and undergone a triage process. We used this information to construct logistic regression (LR) and multilayer neural network models (MLNN) which included natural language processing (NLP) and principal component analysis from the patient’s reason for visit. Ten-fold cross validation was used to test the predictive capacity of each model and receiver operating curves (AUC) were then calculated for each model.Results: Of the 47,200 ED visits from 642 hospitals, 6,335 (13.42%) resulted in hospital admission (or transfer). A total of 48 principal components were extracted by NLP from the reason for visit fields, which explained 75% of the overall variance for hospitalization. In the model including only structured variables, the AUC was 0.824 (95% CI 0.818-0.830) for logistic regression and 0.823 (95% CI 0.817-0.829) for MLNN. Models including only free-text information generated AUC of 0.742 (95% CI 0.7310.753) for logistic regression and 0.753 (95% CI 0.742-0.764) for MLNN. When both structured variables and free text variables were included, the AUC reached 0.846 (95% CI 0.839-0.853) for logistic regression and 0.844 (95% CI 0.836-0.852) for MLNN.Conclusions: The predictive accuracy of hospital admission or transfer for patients who presented to ED triage overall was good, and was improved with the inclusion of free text data from a patient’s reason for visit regardless of modeling approach. Natural language processing and neural networks that incorporate patient-reported outcome free text may increase predictive accuracy for hospital admission.

Download Full-text

Statistical and Artificial Neural Networks Models for Electricity Consumption Forecasting in the Brazilian Industrial Sector

Energies ◽

10.3390/en15020588 ◽

2022 ◽

Vol 15 (2) ◽

pp. 588

Author(s):

Felipe Leite Coelho da Silva ◽

Kleyton da Costa ◽

Paulo Canas Rodrigues ◽

Rodrigo Salas ◽

Javier Linkolk López-Gonzales

Keyword(s):

Neural Network ◽

Neural Networks ◽

Artificial Neural Networks ◽

Statistical Approach ◽

Electricity Consumption ◽

Industrial Sector ◽

Neural Network Approach ◽

Artificial Neural ◽

The One ◽

Mlp Model

Forecasting the industry’s electricity consumption is essential for energy planning in a given country or region. Thus, this study aims to apply time-series forecasting models (statistical approach and artificial neural network approach) to the industrial electricity consumption in the Brazilian system. For the statistical approach, the Holt–Winters, SARIMA, Dynamic Linear Model, and TBATS (Trigonometric Box–Cox transform, ARMA errors, Trend, and Seasonal components) models were considered. For the approach of artificial neural networks, the NNAR (neural network autoregression) and MLP (multilayer perceptron) models were considered. The results indicate that the MLP model was the one that obtained the best forecasting performance for the electricity consumption of the Brazilian industry under analysis.

Download Full-text

Robotic grasp detection using a novel two-stage approach

ASP Transactions on Internet of Things ◽

10.52810/tiot.2021.100031 ◽

2021 ◽

Vol 1 (1) ◽

pp. 19-29

Author(s):

Zhe Chu ◽

Mengkai Hu ◽

Xiangyu Chen

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Models ◽

Particle Swarm Optimizer ◽

Neural Network Models ◽

Two Stage ◽

The Neural Network ◽

End To End ◽

Small Change ◽

Robotic Grasp

Recently, deep learning has been successfully applied to robotic grasp detection. Based on convolutional neural networks (CNNs), there have been lots of end-to-end detection approaches. But end-to-end approaches have strict requirements for the dataset used for training the neural network models and it’s hard to achieve in practical use. Therefore, we proposed a two-stage approach using particle swarm optimizer (PSO) candidate estimator and CNN to detect the most likely grasp. Our approach achieved an accuracy of 92.8% on the Cornell Grasp Dataset, which leaped into the front ranks of the existing approaches and is able to run at real-time speeds. After a small change of the approach, we can predict multiple grasps per object in the meantime so that an object can be grasped in a variety of ways.

Download Full-text

Part-of-Speech Tagging via Deep Neural Networks for Northern-Ethiopic Languages

Information Technology And Control ◽

10.5755/j01.itc.49.4.26808 ◽

2020 ◽

Vol 49 (4) ◽

pp. 482-494

Author(s):

Jurgita Kapočiūtė-Dzikienė ◽

Senait Gebremichael Tesfagergish

Keyword(s):

Neural Network ◽

Neural Networks ◽

Language Processing ◽

Deep Neural Networks ◽

Short Term Memory ◽

Parameter Tuning ◽

Feed Forward Neural Network ◽

Pos Tagging ◽

Part Of Speech ◽

Pos Tagger

Deep Neural Networks (DNNs) have proven to be especially successful in the area of Natural Language Processing (NLP) and Part-Of-Speech (POS) tagging—which is the process of mapping words to their corresponding POS labels depending on the context. Despite recent development of language technologies, low-resourced languages (such as an East African Tigrinya language), have received too little attention. We investigate the effectiveness of Deep Learning (DL) solutions for the low-resourced Tigrinya language of the Northern-Ethiopic branch. We have selected Tigrinya as the testbed example and have tested state-of-the-art DL approaches seeking to build the most accurate POS tagger. We have evaluated DNN classifiers (Feed Forward Neural Network – FFNN, Long Short-Term Memory method – LSTM, Bidirectional LSTM, and Convolutional Neural Network – CNN) on a top of neural word2vec word embeddings with a small training corpus known as Nagaoka Tigrinya Corpus. To determine the best DNN classifier type, its architecture and hyper-parameter set both manual and automatic hyper-parameter tuning has been performed. BiLSTM method was proved to be the most suitable for our solving task: it achieved the highest accuracy equal to 92% that is 65% above the random baseline.

Download Full-text

Language Semantics Interpretation with an Interaction-Based Recurrent Neural Network

Machine Learning and Knowledge Extraction ◽

10.3390/make3040046 ◽

2021 ◽

Vol 3 (4) ◽

pp. 922-945

Author(s):

Shaw-Hwa Lo ◽

Yiqiao Yin

Keyword(s):

Neural Network ◽

Neural Networks ◽

Language Processing ◽

Text Classification ◽

Search Algorithm ◽

Greedy Search ◽

Text Documents ◽

Engineering Technique ◽

Language Semantics ◽

Sequential Models

Text classification is a fundamental language task in Natural Language Processing. A variety of sequential models are capable of making good predictions, yet there is a lack of connection between language semantics and prediction results. This paper proposes a novel influence score (I-score), a greedy search algorithm, called Backward Dropping Algorithm (BDA), and a novel feature engineering technique called the “dagger technique”. First, the paper proposes to use the novel influence score (I-score) to detect and search for the important language semantics in text documents that are useful for making good predictions in text classification tasks. Next, a greedy search algorithm, called the Backward Dropping Algorithm, is proposed to handle long-term dependencies in the dataset. Moreover, the paper proposes a novel engineering technique called the “dagger technique” that fully preserves the relationship between the explanatory variable and the response variable. The proposed techniques can be further generalized into any feed-forward Artificial Neural Networks (ANNs) and Convolutional Neural Networks (CNNs), and any neural network. A real-world application on the Internet Movie Database (IMDB) is used and the proposed methods are applied to improve prediction performance with an 81% error reduction compared to other popular peers if I-score and “dagger technique” are not implemented.

Download Full-text

The relational processing limits of classic and contemporary neural network models of language processing

10.32470/ccn.2019.1022-0 ◽

2019 ◽

Author(s):

Guillermo Puebla ◽

Andrea Martin ◽

Leonidas Doumas

Keyword(s):

Neural Network ◽

Language Processing ◽

Network Models ◽

Relational Processing ◽

Neural Network Models

Download Full-text