Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Reinforcement Learning Based on a Bayesian Confidence Propagating Neural Network
KTH, Superseded Departments, Numerical Analysis and Computer Science, NADA. KTH, School of Computer Science and Communication (CSC), Computational Biology, CB.
KTH, Superseded Departments, Numerical Analysis and Computer Science, NADA.
KTH, Superseded Departments, Numerical Analysis and Computer Science, NADA.ORCID iD: 0000-0002-2358-7815
2003 (English)Conference paper, Published paper (Refereed)
Abstract [en]

We present a system capable of reinforcement learning (RL) based on the Bayesian confidence propagating neural network (BCPNN). The system is called BCPNNRL and its architecture is somewhat motivated by parallels to biology. We analyze the systems properties and we benchmark it against a simple Monte Carlo (MC) based RL algorithm, pursuit RL methods, and the Associative Reward Penalty (AR-P) algorithm. The system is used to solve the n-armed bandit problem, pattern association, and path finding in a maze.

Place, publisher, year, edition, pages
2003.
National Category
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-8797OAI: oai:DiVA.org:kth-8797DiVA: diva2:14231
Conference
2003, April 10-11, SAIS-SSLS Joint Workshop, Center for Applied Autonomous Sensor Systems, Örebro, Sweden
Available from: 2005-11-23 Created: 2005-11-23 Last updated: 2011-12-20Bibliographically approved
In thesis
1. Towards a framework for reinforcement learning with artificial neural networks
Open this publication in new window or tab >>Towards a framework for reinforcement learning with artificial neural networks
2004 (English)Licentiate thesis, comprehensive summary (Other scientific)
Place, publisher, year, edition, pages
Stockholm: Numerisk analys och datalogi, 2004
Series
Trita-NA, ISSN 0348-2952 ; 0437
Keyword
aritficial neural networks, reinforcement learning, BCPNN, modularity, genetic algorithms
National Category
Computer Science
Identifiers
urn:nbn:se:kth:diva-500 (URN)91-7283-928-7 (ISBN)
Presentation
2004-12-10, E32, KTH, Lindstedsvägen 3, Stockholm, 13:00
Available from: 2005-11-23 Created: 2005-11-23 Last updated: 2012-03-21

Open Access in DiVA

No full text

Other links

www.nada.kth.se

Search in DiVA

By author/editor
Johansson, ChristopherRaicevic, PeterLansner, Anders
By organisation
Numerical Analysis and Computer Science, NADAComputational Biology, CB
Computer Science

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 62 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf