Change search
ReferencesLink to record
Permanent link

Direct link
Functional Relevance of Different Basal Ganglia Pathways Investigated in a Spiking Model with Reward Dependent Plasticity
KTH, School of Computer Science and Communication (CSC), Computational Biology, CB. Stockholm Univ, Sweden; Karolinska Inst, Sweden.
KTH, School of Computer Science and Communication (CSC), Computational Biology, CB. Karolinska Inst, Sweden.
KTH, School of Computer Science and Communication (CSC), Computational Biology, CB. Karolinska Inst, Sweden; Univ Edinburgh, Scotland.
KTH, School of Computer Science and Communication (CSC), Computational Biology, CB. Karolinska Inst, Sweden.
Show others and affiliations
2016 (English)In: Frontiers in Neural Circuits, ISSN 1662-5110, E-ISSN 1662-5110, Vol. 10, 53Article in journal (Refereed) PublishedText
Abstract [en]

The brain enables animals to behaviorally adapt in order to survive in a complex and dynamic environment, but how reward-oriented behaviors are achieved and computed by its underlying neural circuitry is an open question. To address this concern, we have developed a spiking model of the basal ganglia (BG) that learns to dis-inhibit the action leading to a reward despite ongoing changes in the reward schedule. The architecture of the network features the two pathways commonly described in BG, the direct (denoted D1) and the indirect (denoted D2) pathway, as well as a loop involving striatum and the dopaminergic system. The activity of these dopaminergic neurons conveys the reward prediction error (RPE), which determines the magnitude of synaptic plasticity within the different pathways. All plastic connections implement a versatile four-factor learning rule derived from Bayesian inference that depends upon pre- and post-synaptic activity, receptor type, and dopamine level. Synaptic weight updates occur in the D1 or D2 pathways depending on the sign of the RPE, and an efference copy informs upstream nuclei about the action selected. We demonstrate successful performance of the system in a multiple-choice learning task with a transiently changing reward schedule. We simulate lesioning of the various pathways and show that a condition without the D2 pathway fares worse than one without D1. Additionally, we simulate the degeneration observed in Parkinson's disease (PD) by decreasing the number of dopaminergic neurons during learning. The results suggest that the D1 pathway impairment in PD might have been overlooked. Furthermore, an analysis of the alterations in the synaptic weights shows that using the absolute reward value instead of the RPE leads to a larger change in D1.

Place, publisher, year, edition, pages
FRONTIERS MEDIA SA , 2016. Vol. 10, 53
Keyword [en]
basal ganglia, action selection, reinforcement learning, synaptic plasticity, dopamine, reward, prediction error, Parkinson's disease
National Category
Neurosciences
Identifiers
URN: urn:nbn:se:kth:diva-190552DOI: 10.3389/fncir.2016.00053ISI: 000380103000001PubMedID: 27493625OAI: oai:DiVA.org:kth-190552DiVA: diva2:952588
Note

QC 20160815

Available from: 2016-08-15 Created: 2016-08-12 Last updated: 2016-08-15Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textPubMed

Search in DiVA

By author/editor
Berthet, PierreLindahl, MikaelTully, Philip J.Hellgren-Kotaleski, JeanetteLansner, Anders
By organisation
Computational Biology, CB
In the same journal
Frontiers in Neural Circuits
Neurosciences

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 22 hits
ReferencesLink to record
Permanent link

Direct link