Attention-gated brain propagation: How the brain can implement reward-based error backpropagation

Isabella Pozzi; Sander M. Bohté; Pieter R. Roelfsema

Attention-gated brain propagation: How the brain can implement reward-based error backpropagation

Isabella Pozzi, Sander M. Bohté, Pieter R. Roelfsema

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

13 Citations (Scopus)

Abstract

Much recent work has focused on biologically plausible variants of supervised learning algorithms. However, there is no teacher in the motor cortex that instructs the motor neurons and learning in the brain depends on reward and punishment. We demonstrate a biologically plausible reinforcement learning scheme for deep networks with an arbitrary number of layers. The network chooses an action by selecting a unit in the output layer and uses feedback connections to assign credit to the units in successively lower layers that are responsible for this action. After the choice, the network receives reinforcement and there is no teacher correcting the errors. We show how the new learning scheme – Attention-Gated Brain Propagation (BrainProp) – is mathematically equivalent to error backpropagation, for one output unit at a time. We demonstrate successful learning of deep fully connected, convolutional and locally connected networks on classical and hard image-classification benchmarks; MNIST, CIFAR10, CIFAR100 and Tiny ImageNet. BrainProp achieves an accuracy that is equivalent to that of standard error-backpropagation, and better than state-of-the-art biologically inspired learning schemes. Additionally, the trial-and-error nature of learning is associated with limited additional training time so that BrainProp is a factor of 1-3.5 times slower. Our results thereby provide new insights into how deep learning may be implemented in the brain.

Original language	English
Title of host publication	34th Conference on Neural Information Processing Systems, NeurIPS 2020
Publisher	Neural Information Processing Systems Foundation
Volume	2020-December
Publication status	Published - 2020
Event	34th Conference on Neural Information Processing Systems, NeurIPS 2020 - Virtual, Online Duration: 6 Dec 2020 → 12 Dec 2020

Publication series

Name	Advances in Neural Information Processing Systems

Conference

Conference	34th Conference on Neural Information Processing Systems, NeurIPS 2020
City	Virtual, Online
Period	6/12/2020 → 12/12/2020

Cite this

@inproceedings{0dcfa536a0bd40b8b8f5fd0986b21d73,

title = "Attention-gated brain propagation: How the brain can implement reward-based error backpropagation",

abstract = "Much recent work has focused on biologically plausible variants of supervised learning algorithms. However, there is no teacher in the motor cortex that instructs the motor neurons and learning in the brain depends on reward and punishment. We demonstrate a biologically plausible reinforcement learning scheme for deep networks with an arbitrary number of layers. The network chooses an action by selecting a unit in the output layer and uses feedback connections to assign credit to the units in successively lower layers that are responsible for this action. After the choice, the network receives reinforcement and there is no teacher correcting the errors. We show how the new learning scheme – Attention-Gated Brain Propagation (BrainProp) – is mathematically equivalent to error backpropagation, for one output unit at a time. We demonstrate successful learning of deep fully connected, convolutional and locally connected networks on classical and hard image-classification benchmarks; MNIST, CIFAR10, CIFAR100 and Tiny ImageNet. BrainProp achieves an accuracy that is equivalent to that of standard error-backpropagation, and better than state-of-the-art biologically inspired learning schemes. Additionally, the trial-and-error nature of learning is associated with limited additional training time so that BrainProp is a factor of 1-3.5 times slower. Our results thereby provide new insights into how deep learning may be implemented in the brain.",

author = "Isabella Pozzi and Boht{\'e}, {Sander M.} and Roelfsema, {Pieter R.}",

year = "2020",

language = "English",

volume = "2020-December",

series = "Advances in Neural Information Processing Systems",

publisher = "Neural Information Processing Systems Foundation",

booktitle = "34th Conference on Neural Information Processing Systems, NeurIPS 2020",

note = "34th Conference on Neural Information Processing Systems, NeurIPS 2020 ; Conference date: 06-12-2020 Through 12-12-2020",

}

Pozzi, I, Bohté, SM & Roelfsema, PR 2020, Attention-gated brain propagation: How the brain can implement reward-based error backpropagation. in 34th Conference on Neural Information Processing Systems, NeurIPS 2020. vol. 2020-December, Advances in Neural Information Processing Systems, Neural Information Processing Systems Foundation, 34th Conference on Neural Information Processing Systems, NeurIPS 2020, Virtual, Online, 6/12/2020.

Attention-gated brain propagation: How the brain can implement reward-based error backpropagation. / Pozzi, Isabella; Bohté, Sander M.; Roelfsema, Pieter R.
34th Conference on Neural Information Processing Systems, NeurIPS 2020. Vol. 2020-December Neural Information Processing Systems Foundation, 2020. (Advances in Neural Information Processing Systems).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review

TY - GEN

T1 - Attention-gated brain propagation: How the brain can implement reward-based error backpropagation

AU - Pozzi, Isabella

AU - Bohté, Sander M.

AU - Roelfsema, Pieter R.

PY - 2020

Y1 - 2020

N2 - Much recent work has focused on biologically plausible variants of supervised learning algorithms. However, there is no teacher in the motor cortex that instructs the motor neurons and learning in the brain depends on reward and punishment. We demonstrate a biologically plausible reinforcement learning scheme for deep networks with an arbitrary number of layers. The network chooses an action by selecting a unit in the output layer and uses feedback connections to assign credit to the units in successively lower layers that are responsible for this action. After the choice, the network receives reinforcement and there is no teacher correcting the errors. We show how the new learning scheme – Attention-Gated Brain Propagation (BrainProp) – is mathematically equivalent to error backpropagation, for one output unit at a time. We demonstrate successful learning of deep fully connected, convolutional and locally connected networks on classical and hard image-classification benchmarks; MNIST, CIFAR10, CIFAR100 and Tiny ImageNet. BrainProp achieves an accuracy that is equivalent to that of standard error-backpropagation, and better than state-of-the-art biologically inspired learning schemes. Additionally, the trial-and-error nature of learning is associated with limited additional training time so that BrainProp is a factor of 1-3.5 times slower. Our results thereby provide new insights into how deep learning may be implemented in the brain.

AB - Much recent work has focused on biologically plausible variants of supervised learning algorithms. However, there is no teacher in the motor cortex that instructs the motor neurons and learning in the brain depends on reward and punishment. We demonstrate a biologically plausible reinforcement learning scheme for deep networks with an arbitrary number of layers. The network chooses an action by selecting a unit in the output layer and uses feedback connections to assign credit to the units in successively lower layers that are responsible for this action. After the choice, the network receives reinforcement and there is no teacher correcting the errors. We show how the new learning scheme – Attention-Gated Brain Propagation (BrainProp) – is mathematically equivalent to error backpropagation, for one output unit at a time. We demonstrate successful learning of deep fully connected, convolutional and locally connected networks on classical and hard image-classification benchmarks; MNIST, CIFAR10, CIFAR100 and Tiny ImageNet. BrainProp achieves an accuracy that is equivalent to that of standard error-backpropagation, and better than state-of-the-art biologically inspired learning schemes. Additionally, the trial-and-error nature of learning is associated with limited additional training time so that BrainProp is a factor of 1-3.5 times slower. Our results thereby provide new insights into how deep learning may be implemented in the brain.

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85107721059&origin=inward

M3 - Conference contribution

VL - 2020-December

T3 - Advances in Neural Information Processing Systems

BT - 34th Conference on Neural Information Processing Systems, NeurIPS 2020

PB - Neural Information Processing Systems Foundation

T2 - 34th Conference on Neural Information Processing Systems, NeurIPS 2020

Y2 - 6 December 2020 through 12 December 2020

ER -

Attention-gated brain propagation: How the brain can implement reward-based error backpropagation

Abstract

Publication series

Conference

Other files and links

Cite this