Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Action selection performance of a reconfigurable Basal Ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivity
KTH, Skolan för datavetenskap och kommunikation (CSC), Beräkningsbiologi, CB.
KTH, Skolan för datavetenskap och kommunikation (CSC), Beräkningsbiologi, CB.ORCID-id: 0000-0002-0550-0739
KTH, Skolan för datavetenskap och kommunikation (CSC), Beräkningsbiologi, CB.ORCID-id: 0000-0002-2358-7815
2012 (Engelska)Ingår i: Frontiers in Behavioral Neuroscience, ISSN 1662-5153, E-ISSN 1662-5153, Vol. 6, s. 65-Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Several studies have shown a strong involvement of the basal ganglia (BG) in action selection and dopamine dependent learning. The dopaminergic signal to striatum, the input stage of the BG, has been commonly described as coding a reward prediction error (RPE), i.e. the difference between the predicted and actual reward. The RPE has been hypothesized to be critical in the modulation of the synaptic plasticity in cortico-striatal synapses in the direct and indirect pathway. We developed an abstract computational model of the BG, with a dual pathway structure functionally corresponding to the direct and indirect pathways, and compared its behaviour to biological data as well as other reinforcement learning models. The computations in our model are inspired by Bayesian inference, and the synaptic plasticity changes depend on a three factor Hebbian-Bayesian learning rule based on co-activation of pre- and post-synaptic units and on the value of the RPE. The model builds on a modified Actor-Critic architecture and implements the direct (Go) and the indirect (NoGo) pathway, as well as the reward prediction (RP) system, acting in a complementary fashion. We investigated the performance of the model system when different configurations of the Go, NoGo and RP system were utilized, e.g. using only the Go, NoGo, or RP system, or combinations of those. Learning performance was investigated in several types of learning paradigms, such as learning-relearning, successive learning, stochastic learning, reversal learning and a two-choice task. The RPE and the activity of the model during learning were similar to monkey electrophysiological and behavioural data. Our results, however, show that there is not a unique best way to configure this BG model to handle well all the learning paradigms tested. We thus suggest that an agent might dynamically configure its action selection mode, possibly depending on task characteristics and also on how much time is available.

Ort, förlag, år, upplaga, sidor
2012. Vol. 6, s. 65-
Nyckelord [en]
Basal ganglia, Bayesian inference, BCPNN, Behaviour selection, Direct-indirect pathway, Dopamine, Hebbian-Bayesian plasticity, Reinforcement learning, article, basal ganglion, Bayes theorem, brain function, controlled study, dopaminergic transmission, learning, mathematical model, mental task, motor control, nerve cell plasticity, probability, reinforcement, statistical analysis
Nationell ämneskategori
Bioinformatik (beräkningsbiologi)
Identifikatorer
URN: urn:nbn:se:kth:diva-105249DOI: 10.3389/fnbeh.2012.00065ISI: 000310727200001Scopus ID: 2-s2.0-84866713557OAI: oai:DiVA.org:kth-105249DiVA, id: diva2:570603
Forskningsfinansiär
EU, Europeiska forskningsrådet, 237955 201716VetenskapsrådetSwedish e‐Science Research Center
Anmärkning

QC 20121120

Tillgänglig från: 2012-11-20 Skapad: 2012-11-19 Senast uppdaterad: 2018-01-12Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Personposter BETA

Hällgren Kotaleski, Jeanette

Sök vidare i DiVA

Av författaren/redaktören
Berthet, PierreHällgren Kotaleski, JeanetteLansner, Anders
Av organisationen
Beräkningsbiologi, CB
I samma tidskrift
Frontiers in Behavioral Neuroscience
Bioinformatik (beräkningsbiologi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 159 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf