kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Publications (10 of 14) Show all publications
Santos, P. P., Carvalho, D. S., Vasco, M., Sardinha, A., Santos, P. A., Paiva, A. & Melo, F. S. (2025). Centralized training with hybrid execution in multi-agent reinforcement learning via predictive observation imputation. Artificial Intelligence, 348, Article ID 104404.
Open this publication in new window or tab >>Centralized training with hybrid execution in multi-agent reinforcement learning via predictive observation imputation
Show others...
2025 (English)In: Artificial Intelligence, ISSN 0004-3702, E-ISSN 1872-7921, Vol. 348, article id 104404Article in journal (Refereed) Published
Abstract [en]

We study hybrid execution in multi-agent reinforcement learning (MARL), a paradigm where agents aim to complete cooperative tasks with arbitrary communication levels at execution time by taking advantage of information-sharing among the agents. Under hybrid execution, the communication level can range from a setting in which no communication is allowed between agents (fully decentralized), to a setting featuring full communication (fully centralized), but the agents do not know beforehand which communication level they will encounter at execution time. We contribute MARO, an approach that makes use of an auto-regressive predictive model, trained in a centralized manner, to estimate missing agents' observations at execution time. We evaluate MARO on standard scenarios and extensions of previous benchmarks tailored to emphasize the impact of partial observability in MARL. Experimental results show that our method consistently outperforms relevant baselines, allowing agents to act with faulty communication while successfully exploiting shared information.

Place, publisher, year, edition, pages
Elsevier BV, 2025
Keywords
Machine learning, Multi-agent reinforcement learning, Multi-agent systems, Reinforcement learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-370406 (URN)10.1016/j.artint.2025.104404 (DOI)001573396900001 ()2-s2.0-105015723683 (Scopus ID)
Note

QC 20250926

Available from: 2025-09-26 Created: 2025-09-26 Last updated: 2025-12-05Bibliographically approved
Rajabi, N., Ribeiro, A. H., Vasco, M. & Kragic Jensfelt, D. (2025). Deep Learning Amplified Early Stopping Bias: Overestimating Performance on Small Datasets. In: 2025 Ieee International Conference On Acoustics, Speech And Signal Processing (Icassp): . Paper presented at 2025 International Conference on Acoustics Speech and Signal Processing-ICASSP-Annual, APR 06-11, 2025, Hyderabad, INDIA. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Deep Learning Amplified Early Stopping Bias: Overestimating Performance on Small Datasets
2025 (English)In: 2025 Ieee International Conference On Acoustics, Speech And Signal Processing (Icassp), Institute of Electrical and Electronics Engineers (IEEE) , 2025Conference paper, Published paper (Refereed)
Abstract [en]

Cross-validation is commonly used to estimate machine learning model performance on new samples. However, using it for both hyperparameter selection and error estimation can lead to overestimating model performance, especially with extensive hyperparameter searches that overly tailor models to validation data. We demonstrate that deep learning further amplifies this bias, with even minor model adjustments causing significant overestimation. Our extensive experiments on simulated and real data focus on the bias from early stopping during cross-validation. We find that overestimation intensifies with network depth and is especially severe in small datasets, which are common in physiological signal processing applications. Selecting the early stopping point during cross-validation can result in ROC-AUC estimates exceeding 90% on random data, and this effect persists across various sample sizes, architectures, and network sizes1.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Series
International Conference on Acoustics Speech and Signal Processing ICASSP, ISSN 1520-6149
Keywords
Cross-validation, deep learning, early stopping, error-estimation bias, small datasets.
National Category
Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:kth:diva-378813 (URN)10.1109/ICASSP49660.2025.10890439 (DOI)001611519700380 ()2-s2.0-105009593127 (Scopus ID)979-8-3503-6875-8 (ISBN)979-8-3503-6874-1 (ISBN)
Conference
2025 International Conference on Acoustics Speech and Signal Processing-ICASSP-Annual, APR 06-11, 2025, Hyderabad, INDIA
Note

QC 20260401

Available from: 2026-04-01 Created: 2026-04-01 Last updated: 2026-04-16Bibliographically approved
Rajabi, N., Zanettin, I., Ribeiro, A. H., Vasco, M., Björkman, M., Lundström, J. N. & Kragic Jensfelt, D. (2025). Exploring the feasibility of olfactory brain–computer interfaces. Scientific Reports, 15(1)
Open this publication in new window or tab >>Exploring the feasibility of olfactory brain–computer interfaces
Show others...
2025 (English)In: Scientific Reports, E-ISSN 2045-2322, Vol. 15, no 1Article in journal (Refereed) Published
Abstract [en]

In this study, we explore the feasibility of single-trial predictions of odor registration in the brain using olfactory bio-signals. We focus on two main aspects: input data modality and the processing model. For the first time, we assess the predictability of odor registration from novel electrobulbogram (EBG) recordings, both in sensor and source space, and compare these with commonly used electroencephalogram (EEG) signals. Despite having fewer data channels, EBG shows comparable performance to EEG. We also examine whether breathing patterns contain relevant information for this task. By comparing a logistic regression classifier, which requires hand-crafted features, with an end-to-end convolutional deep neural network, we find that end-to-end approaches can be as effective as classic methods. However, due to the high dimensionality of the data, the current dataset is insufficient for either classifier to robustly differentiate odor and non-odor trials. Finally, we identify key challenges in olfactory BCIs and suggest future directions for improving odor detection systems.

Place, publisher, year, edition, pages
United Kingdom: Springer Nature, 2025
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-379335 (URN)10.1038/s41598-025-01488-z (DOI)001496076300032 ()40419502 (PubMedID)2-s2.0-105006408347 (Scopus ID)
Note

QC 20260416

Available from: 2026-04-16 Created: 2026-04-16 Last updated: 2026-04-16Bibliographically approved
Betran, S. B., Longhini, A., Vasco, M., Zhang, Y. & Kragic Jensfelt, D. (2025). FLAME: A Federated Learning Benchmark for Robotic Manipulation. In: IROS 2025 - 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems, Conference Proceedings: . Paper presented at 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2025, Hangzhou, China, Oct 19 2025 - Oct 25 2025 (pp. 2494-2500). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>FLAME: A Federated Learning Benchmark for Robotic Manipulation
Show others...
2025 (English)In: IROS 2025 - 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems, Conference Proceedings, Institute of Electrical and Electronics Engineers (IEEE) , 2025, p. 2494-2500Conference paper, Published paper (Refereed)
Abstract [en]

Recent progress in robotic manipulation has been fueled by large-scale datasets collected across diverse environments. Training robotic manipulation policies on these datasets is traditionally performed in a centralized manner, raising concerns regarding scalability, adaptability, and data privacy. While federated learning enables decentralized, privacy-preserving training, its application to robotic manipulation remains largely unexplored. We introduce FLAME (Federated Learning Across Manipulation Environments), the first benchmark designed for federated learning in robotic manipulation. FLAME consists of: (i) a set of large-scale datasets of over 160,000 expert demonstrations of multiple manipulation tasks, collected across a wide range of simulated environments; (ii) a training and evaluation framework for robotic policy learning in a federated setting. We evaluate standard federated learning algorithms in FLAME, showing their potential for distributed policy learning and highlighting key challenges. Our benchmark establishes a foundation for scalable, adaptive, and privacy-aware robotic learning. The code is publicly available at https://github.com/KTH-RPL/ELSA-Robotics-Challenge.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
National Category
Computer Sciences Robotics and automation Computer graphics and computer vision
Identifiers
urn:nbn:se:kth:diva-377806 (URN)10.1109/IROS60139.2025.11245937 (DOI)2-s2.0-105029951023 (Scopus ID)
Conference
2025 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2025, Hangzhou, China, Oct 19 2025 - Oct 25 2025
Note

Part of ISBN 9798331543938

QC 20260312

Available from: 2026-03-12 Created: 2026-03-12 Last updated: 2026-03-12Bibliographically approved
Marta, D., Holk, S., Vasco, M., Lundell, J., Homberger, T., Busch, F. L., . . . Leite, I. (2025). FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions. In: IEEE International Conference on Robotics and Automation: . Paper presented at IEEE International Conference on Robotics and Automation, ICRA 2025, Atlanta, USA, 19-23 May 2025 (pp. 4789-4796). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions
Show others...
2025 (English)In: IEEE International Conference on Robotics and Automation, Institute of Electrical and Electronics Engineers (IEEE), 2025, p. 4789-4796Conference paper, Published paper (Refereed)
Abstract [en]

Preference-based reinforcement learning (PbRL) is a suitable approach for style adaptation of pre-trained robotic behavior: adapting the robot's policy to follow human user preferences while still being able to perform the original task. However, collecting preferences for the adaptation process in robotics is often challenging and time-consuming. In this work we explore the adaptation of pre-trained robots in the low-preference-data regime. We show that, in this regime, recent adaptation approaches suffer from catastrophic reward forgetting (CRF), where the updated reward model overfits to the new preferences, leading the agent to become unable to perform the original task. To mitigate CRF, we propose to enhance the original reward model with a small number of parameters (low-rank matrices) responsible for modeling the preference adaptation. Our evaluation shows that our method can efficiently and effectively adjust robotic behavior to human preferences across simulation benchmark tasks and multiple real-world robotic tasks. We provide videos of our results and source code at https://sites.google.com/view/preflora/

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:kth:diva-360980 (URN)10.1109/ICRA55743.2025.11127633 (DOI)2-s2.0-105016684037 (Scopus ID)
Conference
IEEE International Conference on Robotics and Automation, ICRA 2025, Atlanta, USA, 19-23 May 2025
Note

QC 20250618

Part of ISBN 979-833154139-2

Available from: 2025-03-07 Created: 2025-03-07 Last updated: 2025-10-14Bibliographically approved
Rajabi, N., Ribeiro, A. H., Vasco, M., Taleb, F., Björkman, M. & Kragic Jensfelt, D. (2025). Human-Aligned Image Models Improve Visual Decoding from the Brain. In: Proceedings of the 42nd International Conference on Machine Learning: . Paper presented at International Conference on Machine Learning, 13-19 July 2025, Vancouver, Canada. MLResearchPress
Open this publication in new window or tab >>Human-Aligned Image Models Improve Visual Decoding from the Brain
Show others...
2025 (English)In: Proceedings of the 42nd International Conference on Machine Learning, MLResearchPress , 2025Conference paper, Published paper (Refereed)
Abstract [en]

Decoding visual images from brain activity has significant potential for advancing brain-computer interaction and enhancing the understanding of human perception. Recent approaches align the representation spaces of images and brain activity to enable visual decoding. In this paper, we introduce the use of human-aligned image encoders to map brain signals to images. We hypothesize that these models more effectively capture perceptual attributes associated with the rapid visual stimuli presentations commonly used in visual brain data recording experiments. Our empirical results support this hypothesis, demonstrating that this simple modification improves image retrieval accuracy by up to 21% compared to state-of-the-art methods. Comprehensive experiments confirm consistent performance improvements across diverse EEG architectures, image encoders, alignment methods, participants, and brain imaging modalities.

Place, publisher, year, edition, pages
MLResearchPress, 2025
Series
Proceedings of Machine Learning Research, ISSN 2640-3498
Keywords
Computer Science: Computer Vision and Pattern Recognition, Computer Science: Machine Learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-379306 (URN)
Conference
International Conference on Machine Learning, 13-19 July 2025, Vancouver, Canada
Note

QC 20260416

Available from: 2026-04-16 Created: 2026-04-16 Last updated: 2026-04-16Bibliographically approved
Reichlin, A., Vasco, M. & Kragic Jensfelt, D. (2025). Walking on the Fiber: A Simple Geometric Approximation for Bayesian Neural Networks. Transactions on Machine Learning Research, 2025-July
Open this publication in new window or tab >>Walking on the Fiber: A Simple Geometric Approximation for Bayesian Neural Networks
2025 (English)In: Transactions on Machine Learning Research, E-ISSN 2835-8856, Vol. 2025-JulyArticle in journal (Refereed) Published
Abstract [en]

Bayesian Neural Networks provide a principled framework for uncertainty quantification by modeling the posterior distribution of network parameters. However, exact posterior inference is computationally intractable, and widely used approximations like the Laplace method struggle with scalability and posterior accuracy in modern deep networks. In this work, we revisit sampling techniques for posterior exploration, proposing a simple variation tailored to efficiently sample from the posterior in over-parameterized networks by leveraging the low-dimensional structure of loss minima. Building on this, we introduce a model that learns a deformation of the parameter space, enabling rapid posterior sampling without requiring iterative methods. Empirical results demonstrate that our approach achieves competitive posterior approximations with improved scalability compared to recent refinement techniques. These contributions provide a practical alternative for Bayesian inference in deep learning.

Place, publisher, year, edition, pages
Transactions on Machine Learning Research, 2025
National Category
Computer Sciences Control Engineering Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:kth:diva-369023 (URN)2-s2.0-105011329903 (Scopus ID)
Note

QC 20250908

Available from: 2025-09-08 Created: 2025-09-08 Last updated: 2026-02-16Bibliographically approved
Taleb, F., Vasco, M., Ribeiro, A. H., Björkman, M. & Kragic Jensfelt, D. (2024). Can Transformers Smell Like Humans?. In: Advances in Neural Information Processing Systems 37 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024: . Paper presented at 38th Conference on Neural Information Processing Systems, NeurIPS 2024, Vancouver, Canada, Dec 9 2024 - Dec 15 2024. Neural Information Processing Systems Foundation
Open this publication in new window or tab >>Can Transformers Smell Like Humans?
Show others...
2024 (English)In: Advances in Neural Information Processing Systems 37 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024, Neural Information Processing Systems Foundation , 2024Conference paper, Published paper (Refereed)
Abstract [en]

The human brain encodes stimuli from the environment into representations that form a sensory perception of the world. Despite recent advances in understanding visual and auditory perception, olfactory perception remains an under-explored topic in the machine learning community due to the lack of large-scale datasets annotated with labels of human olfactory perception. In this work, we ask the question of whether pre-trained transformer models of chemical structures encode representations that are aligned with human olfactory perception, i.e., can transformers smell like humans? We demonstrate that representations encoded from transformers pre-trained on general chemical structures are highly aligned with human olfactory perception. We use multiple datasets and different types of perceptual representations to show that the representations encoded by transformer models are able to predict: (i) labels associated with odorants provided by experts; (ii) continuous ratings provided by human participants with respect to pre-defined descriptors; and (iii) similarity ratings between odorants provided by human participants. Finally, we evaluate the extent to which this alignment is associated with physicochemical features of odorants known to be relevant for olfactory decoding.

Place, publisher, year, edition, pages
Neural Information Processing Systems Foundation, 2024
National Category
Neurosciences Computer Sciences
Identifiers
urn:nbn:se:kth:diva-361995 (URN)2-s2.0-105000466521 (Scopus ID)
Conference
38th Conference on Neural Information Processing Systems, NeurIPS 2024, Vancouver, Canada, Dec 9 2024 - Dec 15 2024
Note

QC 20250408

Available from: 2025-04-03 Created: 2025-04-03 Last updated: 2025-04-08Bibliographically approved
Santos, P. P., Carvalho, D. S., Vasco, M., Sardinha, A., Santos, P. A., Paiva, A. & Melo, F. S. (2024). Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning. In: AAMAS 2024 - Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems: . Paper presented at 23rd International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2024, Auckland, New Zealand, May 6 2024 - May 10 2024 (pp. 2453-2455). International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Open this publication in new window or tab >>Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Show others...
2024 (English)In: AAMAS 2024 - Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS) , 2024, p. 2453-2455Conference paper, Published paper (Refereed)
Abstract [en]

We introduce hybrid execution in multi-agent reinforcement learning (MARL), a new paradigm in which agents aim to successfully complete cooperative tasks with arbitrary communication levels at execution time by taking advantage of information-sharing among the agents. Under hybrid execution, the communication level can range from a setting in which no communication is allowed between agents (fully decentralized), to a setting featuring full communication (fully centralized), but the agents do not know beforehand which communication level they will encounter at execution time. To formalize our setting, we define a new class of multi-agent partially observable Markov decision processes (POMDPs) that we name hybrid-POMDPs, which explicitly model a communication process between the agents.

Place, publisher, year, edition, pages
International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), 2024
Keywords
Machine Learning, Multi-Agent Reinforcement Learning, Multi-Agent Systems, Reinforcement Learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-348768 (URN)2-s2.0-85196359245 (Scopus ID)
Conference
23rd International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2024, Auckland, New Zealand, May 6 2024 - May 10 2024
Note

QC 20240701

Available from: 2024-06-27 Created: 2024-06-27 Last updated: 2024-07-01Bibliographically approved
Taleb, F., Vasco, M., Rajabi, N., Björkman, M. & Kragic, D. (2024). Challenging Deep Learning Methods for EEG Signal Denoising under Data Corruption. In: 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024 - Proceedings: . Paper presented at 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024, Orlando, United States of America, Jul 15 2024 - Jul 19 2024. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Challenging Deep Learning Methods for EEG Signal Denoising under Data Corruption
Show others...
2024 (English)In: 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024 - Proceedings, Institute of Electrical and Electronics Engineers (IEEE) , 2024Conference paper, Published paper (Refereed)
Abstract [en]

Capturing informative electroencephalogram (EEG) signals is a challenging task due to the presence of noise (e.g., due to human movement). In extreme cases, data recordings from specific electrodes (channels) can become corrupted and entirely devoid of information. Motivated by recent work on deep-learning-based approaches for EEG signal denoising, we present the first benchmark study on the performance of EEG signal denoising methods in the presence of corrupted channels. We design our study considering a wide variety of datasets, models, and evaluation tasks. Our results highlight the need for assessing the performance of EEG deep-learning models across a broad suite of datasets, as provided by our benchmark.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
data corruption, deep learning, EEG, signal denoising, signal noise
National Category
Signal Processing Computer Sciences
Identifiers
urn:nbn:se:kth:diva-358866 (URN)10.1109/EMBC53108.2024.10782132 (DOI)40039138 (PubMedID)2-s2.0-85214969123 (Scopus ID)
Conference
46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024, Orlando, United States of America, Jul 15 2024 - Jul 19 2024
Note

Part of ISBN 9798350371499]

QC 20250128

Available from: 2025-01-23 Created: 2025-01-23 Last updated: 2025-05-27Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-5761-4105

Search in DiVA

Show all publications