kth.sePublications
Change search
Link to record
Permanent link

Direct link
Publications (7 of 7) Show all publications
Marta, D., Holk, S., Vasco, M., Lundell, J., Homberger, T., Busch, F. L., . . . Leite, I. (2025). FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions. In: : . Paper presented at IEEE International Conference on Robotics and Automation (ICRA), Atlanta, USA, 19-23 May 2025. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions
Show others...
2025 (English)Conference paper, Published paper (Refereed)
Abstract [en]

Preference-based reinforcement learning (PbRL) is a suitable approach for style adaptation of pre-trained robotic behavior: adapting the robot's policy to follow human user preferences while still being able to perform the original task. However, collecting preferences for the adaptation process in robotics is often challenging and time-consuming. In this work we explore the adaptation of pre-trained robots in the low-preference-data regime. We show that, in this regime, recent adaptation approaches suffer from catastrophic reward forgetting (CRF), where the updated reward model overfits to the new preferences, leading the agent to become unable to perform the original task. To mitigate CRF, we propose to enhance the original reward model with a small number of parameters (low-rank matrices) responsible for modeling the preference adaptation. Our evaluation shows that our method can efficiently and effectively adjust robotic behavior to human preferences across simulation benchmark tasks and multiple real-world robotic tasks. We provide videos of our results and source code at https://sites.google.com/view/preflora/

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:kth:diva-360980 (URN)
Conference
IEEE International Conference on Robotics and Automation (ICRA), Atlanta, USA, 19-23 May 2025
Note

QC 20250618

Available from: 2025-03-07 Created: 2025-03-07 Last updated: 2025-06-18Bibliographically approved
Taleb, F., Vasco, M., Ribeiro, A. H., Björkman, M. & Kragic Jensfelt, D. (2024). Can Transformers Smell Like Humans?. In: Advances in Neural Information Processing Systems 37 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024: . Paper presented at 38th Conference on Neural Information Processing Systems, NeurIPS 2024, Vancouver, Canada, Dec 9 2024 - Dec 15 2024. Neural Information Processing Systems Foundation
Open this publication in new window or tab >>Can Transformers Smell Like Humans?
Show others...
2024 (English)In: Advances in Neural Information Processing Systems 37 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024, Neural Information Processing Systems Foundation , 2024Conference paper, Published paper (Refereed)
Abstract [en]

The human brain encodes stimuli from the environment into representations that form a sensory perception of the world. Despite recent advances in understanding visual and auditory perception, olfactory perception remains an under-explored topic in the machine learning community due to the lack of large-scale datasets annotated with labels of human olfactory perception. In this work, we ask the question of whether pre-trained transformer models of chemical structures encode representations that are aligned with human olfactory perception, i.e., can transformers smell like humans? We demonstrate that representations encoded from transformers pre-trained on general chemical structures are highly aligned with human olfactory perception. We use multiple datasets and different types of perceptual representations to show that the representations encoded by transformer models are able to predict: (i) labels associated with odorants provided by experts; (ii) continuous ratings provided by human participants with respect to pre-defined descriptors; and (iii) similarity ratings between odorants provided by human participants. Finally, we evaluate the extent to which this alignment is associated with physicochemical features of odorants known to be relevant for olfactory decoding.

Place, publisher, year, edition, pages
Neural Information Processing Systems Foundation, 2024
National Category
Neurosciences Computer Sciences
Identifiers
urn:nbn:se:kth:diva-361995 (URN)2-s2.0-105000466521 (Scopus ID)
Conference
38th Conference on Neural Information Processing Systems, NeurIPS 2024, Vancouver, Canada, Dec 9 2024 - Dec 15 2024
Note

QC 20250408

Available from: 2025-04-03 Created: 2025-04-03 Last updated: 2025-04-08Bibliographically approved
Santos, P. P., Carvalho, D. S., Vasco, M., Sardinha, A., Santos, P. A., Paiva, A. & Melo, F. S. (2024). Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning. In: AAMAS 2024 - Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems: . Paper presented at 23rd International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2024, Auckland, New Zealand, May 6 2024 - May 10 2024 (pp. 2453-2455). International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Open this publication in new window or tab >>Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Show others...
2024 (English)In: AAMAS 2024 - Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS) , 2024, p. 2453-2455Conference paper, Published paper (Refereed)
Abstract [en]

We introduce hybrid execution in multi-agent reinforcement learning (MARL), a new paradigm in which agents aim to successfully complete cooperative tasks with arbitrary communication levels at execution time by taking advantage of information-sharing among the agents. Under hybrid execution, the communication level can range from a setting in which no communication is allowed between agents (fully decentralized), to a setting featuring full communication (fully centralized), but the agents do not know beforehand which communication level they will encounter at execution time. To formalize our setting, we define a new class of multi-agent partially observable Markov decision processes (POMDPs) that we name hybrid-POMDPs, which explicitly model a communication process between the agents.

Place, publisher, year, edition, pages
International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), 2024
Keywords
Machine Learning, Multi-Agent Reinforcement Learning, Multi-Agent Systems, Reinforcement Learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-348768 (URN)2-s2.0-85196359245 (Scopus ID)
Conference
23rd International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2024, Auckland, New Zealand, May 6 2024 - May 10 2024
Note

QC 20240701

Available from: 2024-06-27 Created: 2024-06-27 Last updated: 2024-07-01Bibliographically approved
Taleb, F., Vasco, M., Rajabi, N., Björkman, M. & Kragic, D. (2024). Challenging Deep Learning Methods for EEG Signal Denoising under Data Corruption. In: 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024 - Proceedings: . Paper presented at 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024, Orlando, United States of America, Jul 15 2024 - Jul 19 2024. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Challenging Deep Learning Methods for EEG Signal Denoising under Data Corruption
Show others...
2024 (English)In: 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024 - Proceedings, Institute of Electrical and Electronics Engineers (IEEE) , 2024Conference paper, Published paper (Refereed)
Abstract [en]

Capturing informative electroencephalogram (EEG) signals is a challenging task due to the presence of noise (e.g., due to human movement). In extreme cases, data recordings from specific electrodes (channels) can become corrupted and entirely devoid of information. Motivated by recent work on deep-learning-based approaches for EEG signal denoising, we present the first benchmark study on the performance of EEG signal denoising methods in the presence of corrupted channels. We design our study considering a wide variety of datasets, models, and evaluation tasks. Our results highlight the need for assessing the performance of EEG deep-learning models across a broad suite of datasets, as provided by our benchmark.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
data corruption, deep learning, EEG, signal denoising, signal noise
National Category
Signal Processing Computer Sciences
Identifiers
urn:nbn:se:kth:diva-358866 (URN)10.1109/EMBC53108.2024.10782132 (DOI)40039138 (PubMedID)2-s2.0-85214969123 (Scopus ID)
Conference
46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024, Orlando, United States of America, Jul 15 2024 - Jul 19 2024
Note

Part of ISBN 9798350371499]

QC 20250128

Available from: 2025-01-23 Created: 2025-01-23 Last updated: 2025-05-27Bibliographically approved
Esteves, B., Vasco, M. & Melo, F. S. (2024). NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General Tasks. In: Advances in Neural Information Processing Systems 37 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024: . Paper presented at 38th Conference on Neural Information Processing Systems, NeurIPS 2024, Vancouver, Canada, Dec 9 2024 - Dec 15 2024. Neural information processing systems foundation, 37
Open this publication in new window or tab >>NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General Tasks
2024 (English)In: Advances in Neural Information Processing Systems 37 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024, Neural information processing systems foundation , 2024, Vol. 37Conference paper, Published paper (Refereed)
Abstract [en]

We contribute NeuralSolver, a novel recurrent solver that can efficiently and consistently extrapolate, i.e., learn algorithms from smaller problems (in terms of observation size) and execute those algorithms in large problems. Contrary to previous recurrent solvers, NeuralSolver can be naturally applied in both same-size problems, where the input and output sizes are the same, and in different-size problems, where the size of the input and output differ. To allow for this versatility, we design NeuralSolver with three main components: a recurrent module, that iteratively processes input information at different scales, a processing module, responsible for aggregating the previously processed information, and a curriculum-based training scheme, that improves the extrapolation performance of the method. To evaluate our method we introduce a set of novel different-size tasks and we show that NeuralSolver consistently outperforms the prior state-of-the-art recurrent solvers in extrapolating to larger problems, considering smaller training problems and requiring less parameters than other approaches. Code available at https://github.com/esteveste/NeuralSolver.

Place, publisher, year, edition, pages
Neural information processing systems foundation, 2024
National Category
Computational Mathematics Computer Sciences
Identifiers
urn:nbn:se:kth:diva-361954 (URN)2-s2.0-105000478336 (Scopus ID)
Conference
38th Conference on Neural Information Processing Systems, NeurIPS 2024, Vancouver, Canada, Dec 9 2024 - Dec 15 2024
Note

QC 20250409

Available from: 2025-04-03 Created: 2025-04-03 Last updated: 2025-04-09Bibliographically approved
Reichlin, A., Tegner, G., Vasco, M., Yin, H., Björkman, M. & Kragic, D. (2024). Reducing Variance in Meta-Learning via Laplace Approximation for Regression Tasks. Transactions on Machine Learning Research, 2024
Open this publication in new window or tab >>Reducing Variance in Meta-Learning via Laplace Approximation for Regression Tasks
Show others...
2024 (English)In: Transactions on Machine Learning Research, E-ISSN 2835-8856, Vol. 2024Article in journal (Refereed) Published
Abstract [en]

Given a finite set of sample points, meta-learning algorithms aim to learn an optimal adaptation strategy for new, unseen tasks. Often, this data can be ambiguous as it might belong to different tasks concurrently. This is particularly the case in meta-regression tasks. In such cases, the estimated adaptation strategy is subject to high variance due to the limited amount of support data for each task, which often leads to sub-optimal generalization performance. In this work, we address the problem of variance reduction in gradient-based meta-learning and formalize the class of problems prone to this, a condition we refer to as task overlap. Specifically, we propose a novel approach that reduces the variance of the gradient estimate by weighing each support point individually by the variance of its posterior over the parameters. To estimate the posterior, we utilize the Laplace approximation, which allows us to express the variance in terms of the curvature of the loss landscape of our meta-learner. Experimental results demonstrate the effectiveness of the proposed method and highlight the importance of variance reduction in meta-learning.

Place, publisher, year, edition, pages
Transactions on Machine Learning Research, 2024
National Category
Robotics and automation Control Engineering
Identifiers
urn:nbn:se:kth:diva-361197 (URN)2-s2.0-85219566964 (Scopus ID)
Note

QC 20250312

Available from: 2025-03-12 Created: 2025-03-12 Last updated: 2025-03-12Bibliographically approved
Zhang, Y., Vasco, M., Björkman, M. & Kragic, D. (2024). Will You Participate? Exploring the Potential of Robotics Competitions on Human-Centric Topics. In: Human-Computer Interaction - Thematic Area, HCI 2024, Held as Part of the 26th HCI International Conference, HCII 2024, Proceedings: . Paper presented at Human Computer Interaction thematic area of the 26th International Conference on Human-Computer Interaction, HCII 2024, Washington, United States of America, Jun 29 2024 - Jul 4 2024 (pp. 240-255). Springer Nature
Open this publication in new window or tab >>Will You Participate? Exploring the Potential of Robotics Competitions on Human-Centric Topics
2024 (English)In: Human-Computer Interaction - Thematic Area, HCI 2024, Held as Part of the 26th HCI International Conference, HCII 2024, Proceedings, Springer Nature , 2024, p. 240-255Conference paper, Published paper (Refereed)
Abstract [en]

This paper presents findings from an exploratory needfinding study investigating the research current status and potential participation of the competitions on the robotics community towards four human-centric topics: safety, privacy, explainability, and federated learning. We conducted a survey with 34 participants across three distinguished European robotics consortia, nearly 60% of whom possessed over five years of research experience in robotics. Our qualitative and quantitative analysis revealed that current mainstream robotic researchers prioritize safety and explainability, expressing a greater willingness to invest in further research in these areas. Conversely, our results indicate that privacy and federated learning garner less attention and are perceived to have lower potential. Additionally, the study suggests a lack of enthusiasm within the robotics community for participating in competitions related to these topics. Based on these findings, we recommend targeting other communities, such as the machine learning community, for future competitions related to these four human-centric topics.

Place, publisher, year, edition, pages
Springer Nature, 2024
Keywords
explainability, federated learning, needfinding, privacy, Robotics, safety, survey
National Category
Computer Sciences Robotics and automation
Identifiers
urn:nbn:se:kth:diva-350540 (URN)10.1007/978-3-031-60412-6_18 (DOI)001283865000018 ()2-s2.0-85195824006 (Scopus ID)
Conference
Human Computer Interaction thematic area of the 26th International Conference on Human-Computer Interaction, HCII 2024, Washington, United States of America, Jun 29 2024 - Jul 4 2024
Note

Part of ISBN 9783031604119

QC 20240716

Available from: 2024-07-16 Created: 2024-07-16 Last updated: 2025-02-05Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-5761-4105

Search in DiVA

Show all publications