kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Publications (10 of 36) Show all publications
Janati, Y., Moulines, E., Olsson, J. & Oliviero-Durmus, A. (2025). Bridging diffusion posterior sampling and Monte Carlo methods: a survey. Philosophical Transactions. Series A: Mathematical, physical, and engineering science, 383(2299), Article ID 20240331.
Open this publication in new window or tab >>Bridging diffusion posterior sampling and Monte Carlo methods: a survey
2025 (English)In: Philosophical Transactions. Series A: Mathematical, physical, and engineering science, ISSN 1364-503X, E-ISSN 1471-2962, Vol. 383, no 2299, article id 20240331Article, review/survey (Refereed) Published
Abstract [en]

Diffusion models enable the synthesis of highly accurate samples from complex distributions and have become foundational in generative modelling. Recently, they have demonstrated significant potential for solving Bayesian inverse problems by serving as priors. This review offers a comprehensive overview of current methods that leverage pre-trained diffusion models alongside Monte Carlo methods to address Bayesian inverse problems without requiring addi- tional training. We show that these methods primarily employ a twisting mechanism for the intermediate distributions within the diffusion process, guiding the simulations towards the posterior distribution. We describe how various Monte Carlo methods are then used to aid in sampling from these twisted distributions. This article is part of the theme issue ‘Generative modelling meets Bayesian inference: a new paradigm for inverse problems’..

Place, publisher, year, edition, pages
The Royal Society, 2025
Keywords
Bayesian inverse problems, diffusion models, Monte Carlo methods
National Category
Probability Theory and Statistics Computational Mathematics
Identifiers
urn:nbn:se:kth:diva-368687 (URN)10.1098/rsta.2024.0331 (DOI)001511003000006 ()40534298 (PubMedID)2-s2.0-105009002163 (Scopus ID)
Note

QC 20250821

Available from: 2025-08-21 Created: 2025-08-21 Last updated: 2025-09-08Bibliographically approved
Mastrototaro, A., Müller, M. & Olsson, J. (2025). Recursive Learning of Asymptotic Variational Objectives. In: Proceedings of the 28th International Conference on Artificial Intelligence and Statistics, AISTATS 2025: . Paper presented at 28th International Conference on Artificial Intelligence and Statistics, AISTATS 2025, Mai Khao, Thailand, May 3 2025 - May 5 2025 (pp. 1432-1440). ML Research Press
Open this publication in new window or tab >>Recursive Learning of Asymptotic Variational Objectives
2025 (English)In: Proceedings of the 28th International Conference on Artificial Intelligence and Statistics, AISTATS 2025, ML Research Press , 2025, p. 1432-1440Conference paper, Published paper (Refereed)
Abstract [en]

General state-space models (SSMs) are widely used in statistical machine learning and are among the most classical generative models for sequential time-series data. SSMs, comprising latent Markovian states, can be subjected to variational inference (VI), but standard VI methods like the importance-weighted autoencoder (IWAE) lack functionality for streaming data. To enable online VI in SSMs when the observations are received in real time, we propose maximising an IWAE-type variational lower bound on the asymptotic contrast function, rather than the standard IWAE ELBO, using stochastic approximation. Unlike the recursive maximum likelihood method, which directly maximises the asymptotic contrast, our approach, called online sequential IWAE (OSIWAE), allows for online learning of both model parameters and a Markovian recognition model for inferring latent states. By approximating filter state posteriors and their derivatives using sequential Monte Carlo (SMC) methods, we create a particle-based framework for online VI in SSMs. This approach is more theoretically well-founded than recently proposed online variational SMC methods. We provide rigorous theoretical results on the learning objective and a numerical study demonstrating the method's efficiency in learning model parameters and particle proposal kernels.

Place, publisher, year, edition, pages
ML Research Press, 2025
National Category
Probability Theory and Statistics Control Engineering Computer Sciences Computer Systems
Identifiers
urn:nbn:se:kth:diva-370309 (URN)2-s2.0-105014328961 (Scopus ID)
Conference
28th International Conference on Artificial Intelligence and Statistics, AISTATS 2025, Mai Khao, Thailand, May 3 2025 - May 5 2025
Note

QC 20250925

Not duplicate with DiVA 1911260

Available from: 2025-09-25 Created: 2025-09-25 Last updated: 2025-09-25Bibliographically approved
Mastrototaro, A., Müller, M. & Olsson, J. (2025). Recursive Learning of Asymptotic Variational Objectives. In: Li, Y Mandt, S Agrawal, S Khan, E (Ed.), INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS: . Paper presented at 28th International Conference on Artificial Intelligence and Statistics-AISTATS-Annual, MAY 03-05, 2025, THAILAND. JMLR, 258
Open this publication in new window or tab >>Recursive Learning of Asymptotic Variational Objectives
2025 (English)In: INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS / [ed] Li, Y Mandt, S Agrawal, S Khan, E, JMLR , 2025, Vol. 258Conference paper, Published paper (Refereed)
Abstract [en]

General state-space models (SSMs) are widely used in statistical machine learning and are among the most classical generative models for sequential time-series data. SSMs, comprising latent Markovian states, can be subjected to variational inference (VI), but standard VI methods like the importance-weighted autoencoder (IWAE) lack functionality for streaming data. To enable online VI in SSMs when the observations are received in real time, we propose maximising an IWAE-type variational lower bound on the asymptotic contrast function, rather than the standard IWAE ELBO, using stochastic approximation. Unlike the recursive maximum likelihood method, which directly maximises the asymptotic contrast, our approach, called online sequential IWAE (OS-IWAE), allows for online learning of both model parameters and a Markovian recognition model for inferring latent states. By approximating filter state posteriors and their derivatives using sequential Monte Carlo (SMC) methods, we create a particle-based framework for online VI in SSMs. This approach is more theoretically well-founded than recently proposed online variational SMC methods. We provide rigorous theoretical results on the learning objective and a numerical study demonstrating the method's efficiency in learning model parameters and particle proposal kernels.

Place, publisher, year, edition, pages
JMLR, 2025
Series
Proceedings of Machine Learning Research, ISSN 2640-3498
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:kth:diva-378637 (URN)001593416700160 ()
Conference
28th International Conference on Artificial Intelligence and Statistics-AISTATS-Annual, MAY 03-05, 2025, THAILAND
Note

QC 20260327

Available from: 2026-03-27 Created: 2026-03-27 Last updated: 2026-03-27Bibliographically approved
Janati, Y., Moufad, B., Durmus, A., Moulines, E. & Olsson, J. (2024). Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors. In: Advances in Neural Information Processing Systems 37 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024: . Paper presented at 38th Conference on Neural Information Processing Systems, NeurIPS 2024, Vancouver, Canada, December 9-15, 2024. Neural information processing systems foundation
Open this publication in new window or tab >>Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors
Show others...
2024 (English)In: Advances in Neural Information Processing Systems 37 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024, Neural information processing systems foundation , 2024Conference paper, Published paper (Refereed)
Abstract [en]

Recent advancements in solving Bayesian inverse problems have spotlighted denoising diffusion models (DDMs) as effective priors. Although these have great potential, DDM priors yield complex posterior distributions that are challenging to sample. Existing approaches to posterior sampling in this context address this problem either by retraining model-specific components, leading to stiff and cumbersome methods, or by introducing approximations with uncontrolled errors that affect the accuracy of the produced samples. We present an innovative framework, divide-and-conquer posterior sampling, which leverages the inherent structure of DDMs to construct a sequence of intermediate posteriors that guide the produced samples to the target posterior. Our method significantly reduces the approximation error associated with current techniques without the need for retraining. We demonstrate the versatility and effectiveness of our approach for a wide range of Bayesian inverse problems. The code is available at https://github.com/Badr-MOUFAD/dcps.

Place, publisher, year, edition, pages
Neural information processing systems foundation, 2024
National Category
Computational Mathematics Probability Theory and Statistics
Identifiers
urn:nbn:se:kth:diva-361994 (URN)2-s2.0-105000508457 (Scopus ID)
Conference
38th Conference on Neural Information Processing Systems, NeurIPS 2024, Vancouver, Canada, December 9-15, 2024
Note

Part of ISBN 9798331314385

QC 20250404

Available from: 2025-04-03 Created: 2025-04-03 Last updated: 2025-04-04Bibliographically approved
Mastrototaro, A., Olsson, J. & Alenlöv, J. (2024). Fast and Numerically Stable Particle-Based Online Additive Smoothing: The AdaSmooth Algorithm. Journal of the American Statistical Association, 119(545), 356-367
Open this publication in new window or tab >>Fast and Numerically Stable Particle-Based Online Additive Smoothing: The AdaSmooth Algorithm
2024 (English)In: Journal of the American Statistical Association, ISSN 0162-1459, E-ISSN 1537-274X, Vol. 119, no 545, p. 356-367Article in journal (Refereed) Published
Abstract [en]

We present a novel sequential Monte Carlo approach to online smoothing of additive functionals in a very general class of path-space models. Hitherto, the solutions proposed in the literature suffer from either long-term numerical instability due to particle-path degeneracy or, in the case that degeneracy is remedied by particle approximation of the so-called backward kernel, high computational demands. In order to balance optimally computational speed against numerical stability, we propose to furnish a (fast) naive particle smoother, propagating recursively a sample of particles and associated smoothing statistics, with an adaptive backward-sampling-based updating rule which allows the number of (costly) backward samples to be kept at a minimum. This yields a new, function-specific additive smoothing algorithm, AdaSmooth, which is computationally fast, numerically stable and easy to implement. The algorithm is provided with rigorous theoretical results guaranteeing its consistency, asymptotic normality and long-term stability as well as numerical results demonstrating empirically the clear superiority of AdaSmooth to existing algorithms. Supplementary materials for this article are available online. 

Place, publisher, year, edition, pages
Informa UK Limited, 2024
Keywords
Adaptive sequential Monte Carlo methods, Central limit theorem, Effective sample size, Particle smoothing, Particle-path degeneracy, State-space models
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:kth:diva-328340 (URN)10.1080/01621459.2022.2118602 (DOI)000865636000001 ()2-s2.0-85139832848 (Scopus ID)
Note

QC 20251222

Available from: 2023-06-07 Created: 2023-06-07 Last updated: 2025-12-22Bibliographically approved
Mastrototaro, A. & Olsson, J. (2024). Online Variational Sequential Monte Carlo. In: Proceedings of Machine Learning Research: . Paper presented at 41st International Conference on Machine Learning, ICML 2024, Vienna, Austria, Jul 21 2024 - Jul 27 2024 (pp. 35039-35062). ML Research Press, 235
Open this publication in new window or tab >>Online Variational Sequential Monte Carlo
2024 (English)In: Proceedings of Machine Learning Research, ML Research Press , 2024, Vol. 235, p. 35039-35062Conference paper, Published paper (Refereed)
Abstract [en]

Being the most classical generative model for serial data, state-space models (SSM) are fundamental in AI and statistical machine learning. In SSM, any form of parameter learning or latent state inference typically involves the computation of complex latent-state posteriors. In this work, we build upon the variational sequential Monte Carlo (VSMC) method, which provides computationally efficient and accurate model parameter estimation and Bayesian latent-state inference by combining particle methods and variational inference. While standard VSMC operates in the offline mode, by re-processing repeatedly a given batch of data, we distribute the approximation of the gradient of the VSMC surrogate ELBO in time using stochastic approximation, allowing for online learning in the presence of streams of data. This results in an algorithm, online VSMC, that is capable of performing efficiently, entirely on-the-fly, both parameter estimation and particle proposal adaptation. In addition, we provide rigorous theoretical results describing the algorithm's convergence properties as the number of data tends to infinity as well as numerical illustrations of its excellent convergence properties and usefulness also in batch-processing settings.

Place, publisher, year, edition, pages
ML Research Press, 2024
Series
Proceedings of Machine Learning Research, ISSN 2640-3498
National Category
Computer Sciences Control Engineering
Identifiers
urn:nbn:se:kth:diva-353953 (URN)2-s2.0-85203797973 (Scopus ID)
Conference
41st International Conference on Machine Learning, ICML 2024, Vienna, Austria, Jul 21 2024 - Jul 27 2024
Note

QC 20241114

Available from: 2024-09-25 Created: 2024-09-25 Last updated: 2024-11-14Bibliographically approved
Cardoso, G., Moulines, E. & Olsson, J. (2024). Particle-Based, Rapid Incremental Smoother Meets Particle Gibbs. Statistica sinica, 34, 1115-1144
Open this publication in new window or tab >>Particle-Based, Rapid Incremental Smoother Meets Particle Gibbs
2024 (English)In: Statistica sinica, ISSN 1017-0405, E-ISSN 1996-8507, Vol. 34, p. 1115-1144Article in journal (Refereed) Published
Abstract [en]

The particle-based rapid incremental smoother (PARIS) is a sequential Monte Carlo technique that allows for efficient online approximations of expectations of additive functionals under Feynman–Kac path distributions. Under weak assumptions, the algorithm has linear computational complexity and limited memory requirements. It also comes with a number of nonasymptotic bounds and convergence results. However, being based on self-normalized importance sampling, the PARIS estimator is biased. This bias is inversely proportional to the number of particles, but has been found to grow linearly with the time horizon, under appropriate mixing conditions. In this work, we propose the Parisian particle Gibbs (PPG) sampler, which has essentially the same complexity as that of the PARIS, but significantly reduces the bias for a given computational complexity at the cost of a modest increase in the variance. This method is a wrapper, in the sense that it uses the PARIS algorithm in the inner loop of the particle Gibbs algorithm to form a bias-reduced version of the targeted quantities. We substantiate the PPG algorithm with theoretical results, including new bounds on the bias and variance, as well as deviation inequalities. We illustrate our theoretical results using numerical experiments that support our claims.

Place, publisher, year, edition, pages
Institute of Statistical Science, 2024
Keywords
Bias reduction, particle filters, particle Gibbs, sequential Monte Carlo, smoothing of additive functionals, state space smoothing
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:kth:diva-367022 (URN)10.5705/ss.202020.0215 (DOI)2-s2.0-85193504121 (Scopus ID)
Note

QC 20250714

Available from: 2025-07-14 Created: 2025-07-14 Last updated: 2025-07-14Bibliographically approved
Cardoso, G., Moulines, E. & Olsson, J. (2024). Particle-based, Rapid Incremental Smoother Meets Particle Gibbs. Statistica sinica, 34(SI), 1115-1144
Open this publication in new window or tab >>Particle-based, Rapid Incremental Smoother Meets Particle Gibbs
2024 (English)In: Statistica sinica, ISSN 1017-0405, E-ISSN 1996-8507, Vol. 34, no SI, p. 1115-1144Article in journal (Refereed) Published
Abstract [en]

The particle-based, rapid incremental smoother (PARIS) is a sequential Monte Carlo technique allowing for efficient online approximation of expectations of additive functionals under Feynman–Kac path distributions. Under weak assumptions, the algorithm has linear computational complexity and limited memory requirements. It also comes with a number of non-asymptotic bounds and convergence results. However, being based on self-normalised importance sampling, the PARIS estimator is biased; its bias is inversely proportional to the number of particles but has been found to grow linearly with the time horizon under appropriate mixing conditions. In this work, we propose the Parisian particle Gibbs (PPG) sampler, whose complexity is essentially the same as that of the PARIS and which significantly reduces the bias for a given computational complexity at the price of a modest increase in the variance. This method is a wrapper in the sense that it uses the PARIS algorithm in the inner loop of particle Gibbs to form a bias-reduced version of the targeted quantities. We substantiate the PPG algorithm with theoretical results, including new bounds on bias and variance as well as deviation inequalities. We illustrate our theoretical results with numerical experiments supporting our claims.

Place, publisher, year, edition, pages
Statistica Sinica (Institute of Statistical Science), 2024
Keywords
Bias reduction, particle filters, particle Gibbs, sequential Monte Carlo, smoothing of additive functionals, state space smoothing
National Category
Mathematics
Identifiers
urn:nbn:se:kth:diva-346828 (URN)10.5705/ss.202022.0215 (DOI)001224106500007 ()2-s2.0-85192688896 (Scopus ID)
Note

QC 20260119

Available from: 2024-05-24 Created: 2024-05-24 Last updated: 2026-01-19Bibliographically approved
Zhang, T., Bokrantz, R. & Olsson, J. (2023). A similarity-based Bayesian mixture-of-experts model. Statistics and computing, 33(4), Article ID 83.
Open this publication in new window or tab >>A similarity-based Bayesian mixture-of-experts model
2023 (English)In: Statistics and computing, ISSN 0960-3174, E-ISSN 1573-1375, Vol. 33, no 4, article id 83Article in journal (Refereed) Published
Abstract [en]

We present a new nonparametric mixture-of-experts model for multivariate regression problems, inspired by the probabilistic k-nearest neighbors algorithm. Using a conditionally specified model, predictions for out-of-sample inputs are based on similarities to each observed data point, yielding predictive distributions represented by Gaussian mixtures. Posterior inference is performed on the parameters of the mixture components as well as the distance metric using a mean-field variational Bayes algorithm accompanied with a stochastic gradient-based optimization procedure. The proposed method is especially advantageous in settings where inputs are of relatively high dimension in comparison to the data size, where input-output relationships are complex, and where predictive distributions may be skewed or multimodal. Computational studies on five datasets, of which two are synthetically generated, illustrate clear advantages of our mixture-of-experts method for high-dimensional inputs, outperforming competitor models both in terms of validation metrics and visual inspection.

Place, publisher, year, edition, pages
Springer Nature, 2023
Keywords
Mixture-of-experts, Nonparametric Bayesian regression, k-nearest neighbors, Pseudolikelihood, Variational inference, Reparameterization trick
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:kth:diva-329452 (URN)10.1007/s11222-023-10238-y (DOI)000998699100001 ()2-s2.0-85160424509 (Scopus ID)
Note

QC 20230621

Available from: 2023-06-21 Created: 2023-06-21 Last updated: 2023-06-21Bibliographically approved
Mastrototaro, A. & Olsson, J. (2023). Adaptive online variance estimation in particle filters: the ALVar estimator. Statistics and computing, 33(4), Article ID 77.
Open this publication in new window or tab >>Adaptive online variance estimation in particle filters: the ALVar estimator
2023 (English)In: Statistics and computing, ISSN 0960-3174, E-ISSN 1573-1375, Vol. 33, no 4, article id 77Article in journal (Refereed) Published
Abstract [en]

We present a new approach-the ALVar estimator-to estimation of asymptotic variance in sequential Monte Carlo methods, or, particle filters. The method, which adjusts adaptively the lag of the estimator proposed in Olsson and Douc (Bernoulli 25(2):1504-1535) applies to very general distribution flows and particle filters, including auxiliary particle filters with adaptive resampling. The algorithm operates entirely online, in the sense that it is able to monitor the variance of the particle filter in real time and with, on the average, constant computational complexity and memory requirements per iteration. Crucially, it does not require the calibration of any algorithmic parameter. Estimating the variance only on the basis of the genealogy of the propagated particle cloud, without additional simulations, the routine requires only minor code additions to the underlying particle algorithm. Finally, we prove that the ALVar estimator is consistent for the true asymptotic variance as the number of particles tends to infinity and illustrate numerically its superiority to existing approaches.

Place, publisher, year, edition, pages
Springer Nature, 2023
Keywords
Central limit theorem, Particle filter, Sequential Monte Carlo, Variance estimation
National Category
Control Engineering
Identifiers
urn:nbn:se:kth:diva-328300 (URN)10.1007/s11222-023-10243-1 (DOI)000983688800002 ()2-s2.0-85158950018 (Scopus ID)
Note

QC 20230607

Available from: 2023-06-07 Created: 2023-06-07 Last updated: 2024-11-13Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-0772-846X

Search in DiVA

Show all publications