Integration of reinforcement learning and optimal decision-making theories of the basal ganglia.

Larsen T

Scientific Abstract

This article seeks to integrate two sets of theories describing action selection in the basal ganglia: reinforcement learning theories describing learning which actions to select to maximize reward and decision-making theories proposing that the basal ganglia selects actions on the basis of sensory evidence accumulated in the cortex. In particular, we present a model that integrates the actor-critic model of reinforcement learning and a model assuming that the cortico-basal-ganglia circuit implements a statistically optimal decision-making procedure. The values of cortico-striatal weights required for optimal decision making in our model differ from those provided by standard reinforcement learning models. Nevertheless, we show that an actor-critic model converges to the weights required for optimal decision making when biologically realistic limits on synaptic weights are introduced. We also describe the model's predictions concerning reaction times and neural responses during learning, and we discuss directions required for further integration of reinforcement learning and optimal decision-making theories.

Citation

2011.Neural Comput, 23(4):817-51.

Downloads

View PDF (1MB)

Integration of reinforcement learning and optimal decision-making theories of the basal ganglia.

Bogacz R

Larsen T

Scientific Abstract

Citation

2011.Neural Comput, 23(4):817-51.

Downloads

View PDF (1MB)

Integration of reinforcement learning and optimal decision-making theories of the basal ganglia.

Scientific Abstract

Similar content

Normative Networks for Source Separation via Local Plasticity and Dendritic Computation

On the Infinite Width and Depth Limits of Predictive Coding Networks

Understanding Sample Efficiency in Predictive Coding

Dithering suppresses half-harmonic neural synchronisation to photic stimulation in humans.

Citation

Downloads

Integration of reinforcement learning and optimal decision-making theories of the basal ganglia.

Scientific Abstract

Citation

Downloads

Similar content

Normative Networks for Source Separation via Local Plasticity and Dendritic Computation

On the Infinite Width and Depth Limits of Predictive Coding Networks

Understanding Sample Efficiency in Predictive Coding

Dithering suppresses half-harmonic neural synchronisation to photic stimulation in humans.

Integration of reinforcement learning and optimal decision-making theories of the basal ganglia.

Scientific Abstract

Similar content

Normative Networks for Source Separation via Local Plasticity and Dendritic Computation

On the Infinite Width and Depth Limits of Predictive Coding Networks

Understanding Sample Efficiency in Predictive Coding

Dithering suppresses half-harmonic neural synchronisation to photic stimulation in humans.

Citation

Downloads

Related Group

Integration of reinforcement learning and optimal decision-making theories of the basal ganglia.

Scientific Abstract

Citation

Downloads

Related Group

Similar content

Normative Networks for Source Separation via Local Plasticity and Dendritic Computation

On the Infinite Width and Depth Limits of Predictive Coding Networks

Understanding Sample Efficiency in Predictive Coding

Dithering suppresses half-harmonic neural synchronisation to photic stimulation in humans.