kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
SCORE: Skill-Conditioned Online Reinforcement Learning
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Software and Computer systems, SCS. King.com Ltd., Sweden.ORCID iD: 0000-0002-0638-7352
King.com Ltd., Sweden.
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Software and Computer systems, SCS.ORCID iD: 0000-0002-2748-8929
2024 (English)Conference paper, Published paper (Refereed)
Abstract [en]

Solving complex long-horizon tasks through Reinforcement Learning (RL) from scratch presents challenges related to efficient exploration. Two common approaches to reduce complexity and enhance exploration efficiency are (i) integrating learning-from-demonstration techniques with online RL, where the prior knowledge acquired from demonstrations is used to guide exploration, refine representations, or tailor reward functions, and (ii) using representation learning to facilitate state abstraction. In this study, we present Skill-Conditioned Online REinforcement Learning (SCORE), a novel approach that leverages these two strategies and utilizes skills acquired from an unstructured demonstrations dataset in a policy gradient RL algorithm. This integration enriches the algorithm with informative input representations, improving downstream task learning and exploration efficiency. We evaluate our method on long-horizon robotic and navigation tasks and game environments, demonstrating enhancements in online RL performance compared to the baselines. Furthermore, we show our approach’s generalization capabilities and analyze its effectiveness through an ablation study.

Place, publisher, year, edition, pages
Association for the Advancement of Artificial Intelligence (AAAI) , 2024. p. 189-198
National Category
Computer Sciences Robotics and automation Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-358212DOI: 10.1609/aiide.v20i1.31879Scopus ID: 2-s2.0-85213057195OAI: oai:DiVA.org:kth-358212DiVA, id: diva2:1924846
Conference
20th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, AIIDE 2024, Lexington, United States of America, November 18-22, 2024
Note

Part of ISBN 978-1-57735-895-4

QC 20250116

Available from: 2025-01-07 Created: 2025-01-07 Last updated: 2025-02-05Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Karimi, SaraPayberah, Amir H.

Search in DiVA

By author/editor
Karimi, SaraPayberah, Amir H.
By organisation
Software and Computer systems, SCS
Computer SciencesRobotics and automationComputer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 40 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf