Reward-Modulated Hebbian Learning of Decision Making

We introduce a framework for decision making in which the learning of decision making is reduced to its simplest and biologically most plausible form: Hebbian learning on a linear neuron. We cast our Bayesian-Hebb learning rule as reinforcement learning in which certain decisions are rewarded and prove that each synaptic weight will on average converge exponentially fast to the log-odd of receiving a reward when its pre- and postsynaptic neurons are active. In our simple architecture, a particular action is selected from the set of candidate actions by a winner-take-all operation. The global reward assigned to this action then modulates the update of each synapse. Apart from this global reward signal, our reward-modulated Bayesian Hebb rule is a pure Hebb update that depends only on the coactivation of the pre- and postsynaptic neurons, not on the weighted sum of all presynaptic inputs to the postsynaptic neuron as in the perceptron learning rule or the Rescorla-Wagner rule. This simple approach to action-selection learning requires that information about sensory inputs be presented to the Bayesian decision stage in a suitably preprocessed form resulting from other adaptive processes (acting on a larger timescale) that detect salient dependencies among input features. Hence our proposed framework for fast learning of decisions also provides interesting new hypotheses regarding neural nodes and computational goals of cortical areas that provide input to the final decision stage.

Download Full-text

Confidence-Controlled Hebbian Learning Efficiently Extracts Category Membership From Stimuli Encoded in View of a Categorization Task

Neural Computation ◽

10.1162/neco_a_01452 ◽

2021 ◽

pp. 1-33

Author(s):

Kevin Berlemont ◽

Jean-Pierre Nadal

Keyword(s):

Decision Making ◽

Hebbian Learning ◽

Learning Rule ◽

Descent Method ◽

Category Membership ◽

Gradient Descent Method ◽

Perceptual Decision Making ◽

Categorization Task ◽

Attractor Network ◽

Categorical Information

Abstract In experiments on perceptual decision making, individuals learn a categorization task through trial-and-error protocols. We explore the capacity of a decision-making attractor network to learn a categorization task through reward-based, Hebbian-type modifications of the weights incoming from the stimulus encoding layer. For the latter, we assume a standard layer of a large number of stimu lus-specific neurons. Within the general framework of Hebbian learning, we have hypothesized that the learning rate is modulated by the reward at each trial. Surprisingly, we find that when the coding layer has been optimized in view of the categorization task, such reward-modulated Hebbian learning (RMHL) fails to extract efficiently the category membership. In previous work, we showed that the attractor neural networks' nonlinear dynamics accounts for behavioral confidence in sequences of decision trials. Taking advantage of these findings, we propose that learning is controlled by confidence, as computed from the neural activity of the decision-making attractor network. Here we show that this confidence-controlled, reward-based Hebbian learning efficiently extracts categorical information from the optimized coding layer. The proposed learning rule is local and, in contrast to RMHL, does not require storing the average rewards obtained on previous trials. In addition, we find that the confidence-controlled learning rule achieves near-optimal performance. In accordance with this result, we show that the learning rule approximates a gradient descent method on a maximizing reward cost function.

Download Full-text

Stable thalamocortical learning between medial-dorsal thalamus and cortical attractor networks captures cognitive flexibility

10.1101/2021.01.15.426814 ◽

2021 ◽

Author(s):

Siwei Qiu

Keyword(s):

Decision Making ◽

Prefrontal Cortex ◽

Distributed Computing ◽

Neural Circuit ◽

Hebbian Learning ◽

Learning Rule ◽

Computing System ◽

Synaptic Scaling ◽

Dorsal Thalamus ◽

Medial Dorsal

AbstractPrimates and rodents are able to continually acquire, adapt, and transfer knowledge and skill, and lead to goal-directed behavior during their lifespan. For the case when context switches slowly, animals learn via slow processes. For the case when context switches rapidly, animals learn via fast processes. We build a biologically realistic model with modules similar to a distributed computing system. Specifically, we are emphasizing the role of thalamocortical learning on a slow time scale between the prefrontal cortex (PFC) and medial dorsal thalamus (MD). Previous work [1] has already shown experimental evidence supporting classification of cell ensembles in the medial dorsal thalamus, where each class encodes a different context. However, the mechanism by which such classification is learned is not clear. In this work, we show that such learning can be self-organizing in the manner of an automaton (a distributed computing system), via a combination of Hebbian learning and homeostatic synaptic scaling. We show that in the simple case of two contexts, the network with hierarchical structure can do context-based decision making and smooth switching between different contexts. Our learning rule creates synaptic competition [2] between the thalamic cells to create winner-take-all activity. Our theory shows that the capacity of such a learning process depends on the total number of task-related hidden variables, and such a capacity is limited by system size N. We also theoretically derived the effective functional connectivity as a function of an order parameter dependent on the thalamo-cortical coupling structure.Significance StatementAnimals need to adapt to dynamically changing environments and make decisions based on changing contexts. Here we propose a combination of neural circuit structure with learning mechanisms to account for such behaviors. Specifically, we built a reservoir computing network improved by a Hebbian learning rule together with a synaptic scaling learning mechanism between the prefrontal cortex and the medial-dorsal (MD) thalamus. This model shows that MD thalamus is crucial in such context-based decision making. I also make use of dynamical mean field theory to predict the effective neural circuit. Furthermore, theoretical analysis provides a prediction that the capacity of such a network increases with the network size and the total number of tasks-related latent variables.

Download Full-text

EFFECTS OF DILATION AND TRANSLATION ON A PERCEPTRON-TYPE LEARNING RULE FOR HIGHER ORDER HOPFIELD NEURAL NETWORKS

International Journal of Neural Systems ◽

10.1142/s0129065702001072 ◽

2002 ◽

Vol 12 (02) ◽

pp. 83-93 ◽

Cited By ~ 1

Author(s):

BURKHARD LENZE ◽

JÖRG RADDATZ

Keyword(s):

Neural Networks ◽

Hebbian Learning ◽

Recall Performance ◽

Learning Rule ◽

Higher Order ◽

Hopfield Neural Networks ◽

Pattern Recognition Problem ◽

Random Patterns ◽

Highly Correlated ◽

Perceptron Learning

In this paper, we will take a further look at a generalized perceptron-like learning rule which uses dilation and translation parameters in order to enhance the recall performance of higher order Hopfield neural networks without significantly increasing their complexity. We will practically study the influence of these parameters on the perceptron learning and recall process, using a generalized version of the Hebbian learning rule for initialization. Our analysis will be based on a pattern recognition problem with random patterns. We will see that in case of a highly correlated set of patterns, there can be gained some improvements concerning the learning and recall performance. On the other hand, we will show that the dilation and translation parameters have to be chosen carefully for a positive result.

Download Full-text

Confidence-Controlled Hebbian Learning Efficiently Extracts Category Membership from Stimuli Encoded in View of a Categorization Task

10.1101/2020.08.06.239533 ◽

2020 ◽

Author(s):

Kevin Berlemont ◽

Jean-Pierre Nadal

Keyword(s):

Decision Making ◽

Hebbian Learning ◽

Learning Rule ◽

Category Membership ◽

Trial And Error ◽

Perceptual Decision Making ◽

Categorization Task ◽

Attractor Network ◽

Categorical Information ◽

Average Rewards

AbstractIn experiments on perceptual decision-making, individuals learn a categorization task through trial-and-error protocols. We explore the capacity of a decision-making attractor network to learn a categorization task through reward-based, Hebbian type, modifications of the weights incoming from the stimulus encoding layer. For the latter, we assume a standard layer of a large number of stimulus specific neurons. Within the general framework of Hebbian learning, authors have hypothesized that the learning rate is modulated by the reward at each trial. Surprisingly, we find that, when the coding layer has been optimized in view of the categorization task, such reward-modulated Hebbian learning (RMHL) fails to extract efficiently the category membership. In a previous work we showed that the attractor neural networks nonlinear dynamics accounts for behavioral confidence in sequences of decision trials. Taking advantage of these findings, we propose that learning is controlled by confidence, as computed from the neural activity of the decision-making attractor network. Here we show that this confidence-controlled, reward-based, Hebbian learning efficiently extracts categorical information from the optimized coding layer. The proposed learning rule is local, and, in contrast to RMHL, does not require to store the average rewards obtained on previous trials. In addition, we find that the confidence-controlled learning rule achieves near optimal performance.

Download Full-text

LPCD framework: Analytical tool or psychological model?

Behavioral and Brain Sciences ◽

10.1017/s0140525x18001383 ◽

2018 ◽

Vol 41 ◽

Author(s):

David Danks

Keyword(s):

Decision Making ◽

Analytical Tool ◽

Mathematical Framework ◽

Bayesian Decision ◽

Perceptual Decision Making ◽

Psychological Model ◽

Psychological Reality ◽

Psychological Theories ◽

Target Article ◽

Bayesian Decision Making

AbstractThe target article uses a mathematical framework derived from Bayesian decision making to demonstrate suboptimal decision making but then attributes psychological reality to the framework components. Rahnev & Denison's (R&D) positive proposal thus risks ignoring plausible psychological theories that could implement complex perceptual decision making. We must be careful not to slide from success with an analytical tool to the reality of the tool components.

Download Full-text

How multidisciplinary are multidisciplinary case reviews in cancer care? Feasibility analysis of a theory-driven team decision-making fidelity framework

10.31234/osf.io/3xwnd ◽

2019 ◽

Author(s):

Tayana Soukup ◽

Ged Murtagh ◽

Ben W Lamb ◽

James Green ◽

Nick Sevdalis

Keyword(s):

Decision Making ◽

Cancer Care ◽

Recent Decade ◽

Individual Case ◽

Multidisciplinary Teams ◽

Final Decision ◽

Political Climate ◽

Implementation Framework ◽

Feasibility Evaluation ◽

The Uk

Background Multidisciplinary teams (MDTs) are a standard cancer care policy in many countries worldwide. Despite an increase in research in a recent decade on MDTs and their care planning meetings, the implementation of MDT-driven decision-making (fidelity) remains unstudied. We report a feasibility evaluation of a novel method for assessing cancer MDT decision-making fidelity. We used an observational protocol to assess (1) the degree to which MDTs adhere to the stages of group decision-making as per the ‘Orientation-Discussion-Decision-Implementation’ framework, and (2) the degree of multidisciplinarity underpinning individual case reviews in the meetings. MethodsThis is a prospective observational study. Breast, colorectal and gynaecological cancer MDTs in the Greater London and Derbyshire (United Kingdom) areas were video recorded over 12-weekly meetings encompassing 822 case reviews. Data were coded and analysed using frequency counts.Results Eight interaction formats during case reviews were identified. case reviews were not always multi-disciplinary: only 8% of overall reviews involved all five clinical disciplines present, and 38% included four of five. The majority of case reviews (i.e. 54%) took place between two (25%) or three (29%) disciplines only. Surgeons (83%) and oncologists (8%) most consistently engaged in all stages of decision-making. While all patients put forward for MDT review were actually reviewed, a small percentage of them (4%) either bypassed the orientation (case presentation) and went straight into discussing the patient, or they did not articulate the final decision to the entire team (8%). Conclusions Assessing fidelity of MDT decision-making at the point of their weekly meetings is feasible. We found that despite being a set policy, case reviews are not entirely MDT-driven. We discuss implications in relation to the current eco-political climate, and the quality and safety of care. Our findings are in line with the current national initiatives in the UK on streamlining MDT meetings, and could help decide how to re-organise them to be most efficient.

Download Full-text

Bayesian Decision-Making Based Recommendation Trust Revision Model in Ad Hoc Networks

Journal of Software ◽

10.3724/sp.j.1001.2009.00579 ◽

2009 ◽

Vol 20 (9) ◽

pp. 2574-2586 ◽

Cited By ~ 12

Author(s):

Yu-Xing SUN ◽

Song-Hua HUANG ◽

Li-Jun CHEN ◽

Li XIE

Keyword(s):

Decision Making ◽

Ad Hoc Networks ◽

Ad Hoc ◽

Bayesian Decision ◽

Hoc Networks ◽

Bayesian Decision Making

Download Full-text

Mate-Choice Copying as Bayesian Decision Making

The American Naturalist ◽

10.2307/3473415 ◽

2005 ◽

Vol 165 (3) ◽

pp. 403

Author(s):

Uehara ◽

Yokomizo ◽

Iwasa

Keyword(s):

Decision Making ◽

Mate Choice ◽

Bayesian Decision ◽

Mate Choice Copying ◽

Bayesian Decision Making

Download Full-text

What’s in a (Change of) Name? Much—but Not That Much—and Not What Wiebe Claims

Method & Theory in the Study of Religion ◽

10.1163/15700682-12341478 ◽

2020 ◽

Vol 32 (2) ◽

pp. 159-184 ◽

Cited By ~ 1

Author(s):

Satoko Fujiwara ◽

Tim Jensen

Keyword(s):

Decision Making ◽

Executive Committee ◽

International Association ◽

International Committee ◽

Academic Standards ◽

General Assembly ◽

Final Decision ◽

Points Of View ◽

The Difference ◽

Consultative Body

Abstract Donald Wiebe claims that the IAHR leadership (already before an Extended Executive Committee (EEC) meeting in Delphi) had decided to water down the academic standards of the IAHR with a proposal to change its name to “International Association for the Study of Religions.” His criticism, we argue, is based on a series of misunderstandings as regards: 1) the difference between the consultative body (EEC) and the decision-making body (EC), 2) the difference between the preliminary points of view of individuals and final proposals by the EC, 3) personal conversations, 4) the link between the proposal to change the name and the wish to tighten up the academic profile of the IAHR. Moreover, if the final decision-making bodies, the International Committee and the General Assembly, adopt the proposal, the new name as little as the old can make the IAHR more or less scientific. Tightening up the academic, scientific profile of the IAHR takes more than a change of name.

Download Full-text

Quantum Bayesian Decision-Making

Foundations of Science ◽

10.1007/s10699-021-09781-6 ◽

2021 ◽

Author(s):

Michael de Oliveira ◽

Luis Soares Barbosa

Keyword(s):

Decision Making ◽

Bayesian Decision ◽

Bayesian Decision Making

Download Full-text