A learning rule that explains how rewards teach attention

Jaldert O. Rombouts; Sander M. Bohte; Julio Martinez-Trujillo; Pieter R. Roelfsema

doi:https://doi.org/10.1080/13506285.2015.1010462

A learning rule that explains how rewards teach attention

Jaldert O. Rombouts, Sander M. Bohte, Julio Martinez-Trujillo, Pieter R. Roelfsema

Research output: Contribution to journal › Article › Academic › peer-review

17 Citations (Scopus)

Abstract

Many theories propose that top-down attentional signals control processing in sensory cortices by modulating neural activity. But who controls the controller? Here we investigate how a biologically plausible neural reinforcement learning scheme can create higher order representations and top-down attentional signals. The learning scheme trains neural networks using two factors that gate Hebbian plasticity: (1) an attentional feedback signal from the response-selection stage to earlier processing levels; and (2) a globally available neuromodulator that encodes the reward prediction error. We demonstrate how the neural network learns to direct attention to one of two coloured stimuli that are arranged in a rank-order. Like monkeys trained on this task, the network develops units that are tuned to the rank-order of the colours and it generalizes this newly learned rule to previously unseen colour combinations. These results provide new insight into how individuals can learn to control attention as a function of reward contingency

Original language	English
Pages (from-to)	179-205
Journal	Visual Cognition
Volume	23
Issue number	1-2
DOIs	https://doi.org/10.1080/13506285.2015.1010462
Publication status	Published - 2015

Access to Document

https://doi.org/10.1080/13506285.2015.1010462

Cite this

@article{3daf3f8db8e8483fbc3c5e61ec7721f5,

title = "A learning rule that explains how rewards teach attention",

abstract = "Many theories propose that top-down attentional signals control processing in sensory cortices by modulating neural activity. But who controls the controller? Here we investigate how a biologically plausible neural reinforcement learning scheme can create higher order representations and top-down attentional signals. The learning scheme trains neural networks using two factors that gate Hebbian plasticity: (1) an attentional feedback signal from the response-selection stage to earlier processing levels; and (2) a globally available neuromodulator that encodes the reward prediction error. We demonstrate how the neural network learns to direct attention to one of two coloured stimuli that are arranged in a rank-order. Like monkeys trained on this task, the network develops units that are tuned to the rank-order of the colours and it generalizes this newly learned rule to previously unseen colour combinations. These results provide new insight into how individuals can learn to control attention as a function of reward contingency",

author = "Rombouts, {Jaldert O.} and Bohte, {Sander M.} and Julio Martinez-Trujillo and Roelfsema, {Pieter R.}",

year = "2015",

doi = "https://doi.org/10.1080/13506285.2015.1010462",

language = "English",

volume = "23",

pages = "179--205",

journal = "Visual Cognition",

issn = "1350-6285",

publisher = "Psychology Press Ltd",

number = "1-2",

}

TY - JOUR

T1 - A learning rule that explains how rewards teach attention

AU - Rombouts, Jaldert O.

AU - Bohte, Sander M.

AU - Martinez-Trujillo, Julio

AU - Roelfsema, Pieter R.

PY - 2015

Y1 - 2015

N2 - Many theories propose that top-down attentional signals control processing in sensory cortices by modulating neural activity. But who controls the controller? Here we investigate how a biologically plausible neural reinforcement learning scheme can create higher order representations and top-down attentional signals. The learning scheme trains neural networks using two factors that gate Hebbian plasticity: (1) an attentional feedback signal from the response-selection stage to earlier processing levels; and (2) a globally available neuromodulator that encodes the reward prediction error. We demonstrate how the neural network learns to direct attention to one of two coloured stimuli that are arranged in a rank-order. Like monkeys trained on this task, the network develops units that are tuned to the rank-order of the colours and it generalizes this newly learned rule to previously unseen colour combinations. These results provide new insight into how individuals can learn to control attention as a function of reward contingency

AB - Many theories propose that top-down attentional signals control processing in sensory cortices by modulating neural activity. But who controls the controller? Here we investigate how a biologically plausible neural reinforcement learning scheme can create higher order representations and top-down attentional signals. The learning scheme trains neural networks using two factors that gate Hebbian plasticity: (1) an attentional feedback signal from the response-selection stage to earlier processing levels; and (2) a globally available neuromodulator that encodes the reward prediction error. We demonstrate how the neural network learns to direct attention to one of two coloured stimuli that are arranged in a rank-order. Like monkeys trained on this task, the network develops units that are tuned to the rank-order of the colours and it generalizes this newly learned rule to previously unseen colour combinations. These results provide new insight into how individuals can learn to control attention as a function of reward contingency

U2 - https://doi.org/10.1080/13506285.2015.1010462

DO - https://doi.org/10.1080/13506285.2015.1010462

M3 - Article

SN - 1350-6285

VL - 23

SP - 179

EP - 205

JO - Visual Cognition

JF - Visual Cognition

IS - 1-2

ER -