kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Publications (10 of 149) Show all publications
Hu, Y., Jiang, Y., Li, Z., Gao, F., Boström, H. & Wang, N. (2025). CADQ: Attribute-Consistent Face Cartoonization with Cross-modal Aligned and Deformable Quantization. In: PROCEEDINGS OF THE 33RD ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM 2025): . Paper presented at 2025 Conference on Multimedia-MM, OCT 27-31, 2025, Dublin, IRELAND (pp. 10681-10689). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>CADQ: Attribute-Consistent Face Cartoonization with Cross-modal Aligned and Deformable Quantization
Show others...
2025 (English)In: PROCEEDINGS OF THE 33RD ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM 2025), Association for Computing Machinery (ACM) , 2025, p. 10681-10689Conference paper, Published paper (Refereed)
Abstract [en]

Face cartoonization remains a challenging task due to significant geometric deformations between facial photos and cartoons, as well as the absence of paired training data for supervised learning. Existing methods struggle to generate high-quality cartoonized avatars with attribute consistency. To address this challenge, this paper proposes an unsupervised facial cartoonization method based on cross-domain aligned and deformable vector quantization (CADQ). Firstly, we construct textual descriptions with facial attributes for both photo datasets and cartoon collections. Attribute consistency during transformation is enforced through individually contrastive learning between image-text cross-modal features and globally distribution alignment across photo-cartoon domains. Secondly, a deformable Transformer with dual attention is introduced during the transformation process, which queries corresponding cartoon code-book entries based on image features to simulate cross-domain geometric deformations. Experimental results demonstrate that the proposed method can convert facial photos into high-quality cartoons with attribute consistency, outperforming existing state-of-the-art approaches. Furthermore, the method can be effectively extended to unsupervised cross-domain generation of other artistic portrait styles, achieving superior or highly competitive performance. Our code has been released at: https://github.com/IIP-Lab- XDU/CADQ.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2025
Keywords
Vector Quantization, Generative Adversarial Networks, Unsupervised Image-to-Image Translation, Contrastive Learning, Cross-modal Learning
National Category
Materials Chemistry
Identifiers
urn:nbn:se:kth:diva-378826 (URN)10.1145/3746027.3755840 (DOI)001671135600330 ()2-s2.0-105024072462 (Scopus ID)979-8-4007-2035-2 (ISBN)
Conference
2025 Conference on Multimedia-MM, OCT 27-31, 2025, Dublin, IRELAND
Note

QC 20260401

Available from: 2026-04-01 Created: 2026-04-01 Last updated: 2026-04-01Bibliographically approved
Horchidan, S.-F., Boström, H., Zeiher, F. & Carbone, P. (2025). ConANN: Conformal Approximate Nearest Neighbor Search. Proceedings of the VLDB Endowment, 19(1), 29-42
Open this publication in new window or tab >>ConANN: Conformal Approximate Nearest Neighbor Search
2025 (English)In: Proceedings of the VLDB Endowment, E-ISSN 2150-8097, Vol. 19, no 1, p. 29-42Article in journal (Refereed) Published
Abstract [en]

Approximate Nearest Neighbor (ANN) search is widely used in applications such as recommendation systems, search engines, and natural language processing. Indexing techniques like the Inverted File (IVF) offer efficiency at the cost of accuracy, yet lack formal mechanisms to quantify or control approximation error. Existing approaches that attempt to provide such guarantees typically rely on restrictive assumptions about underlying data distributions, which limits their generalizability. We introduce ConANN, the first framework to provide formal, distribution-free error guarantees for IVF-based ANN search by leveraging recent advances in Conformal Risk Control. Empirical evaluation across five standard benchmarks demonstrates that ConANN: (1) tightly controls approximation error, achieving a worst-case False Negative Rate deviation within 0.03 percentage points of the target; (2) provides formal guarantees without requiring expansion of the search space, and in some cases even reduces the number of probed clusters; (3) dynamically adapts the cluster probes required per query; and (4) incurs negligible overheads when compared to existing state-of-the-art baselines. ConANN is integrated into the FAISS vector search library, facilitating adoption in real-world ANN systems.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2025
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-378635 (URN)10.14778/3772181.3772184 (DOI)001679981300003 ()2-s2.0-105033220871 (Scopus ID)
Note

QC 20260327

Available from: 2026-03-27 Created: 2026-03-27 Last updated: 2026-04-14Bibliographically approved
Dong, G., Bresson, R. & Boström, H. (2025). Detecting Attacks with Conformal Test Martingales. In: Proceedings of the 14th Symposium on Conformal and Probabilistic Prediction with Applications, COPA 2025: . Paper presented at 14th Symposium on Conformal and Probabilistic Prediction with Applications, COPA 2025, London, United Kingdom of Great Britain, Sep 10 2025 - Sep 12 2025 (pp. 758-761). ML Research Press, 266
Open this publication in new window or tab >>Detecting Attacks with Conformal Test Martingales
2025 (English)In: Proceedings of the 14th Symposium on Conformal and Probabilistic Prediction with Applications, COPA 2025, ML Research Press , 2025, Vol. 266, p. 758-761Conference paper, Published paper (Refereed)
Place, publisher, year, edition, pages
ML Research Press, 2025
Keywords
adversarial examples, conformal test martingales, whitebox attacks
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:kth:diva-370312 (URN)2-s2.0-105013957430 (Scopus ID)
Conference
14th Symposium on Conformal and Probabilistic Prediction with Applications, COPA 2025, London, United Kingdom of Great Britain, Sep 10 2025 - Sep 12 2025
Note

QC 20250925

Available from: 2025-09-25 Created: 2025-09-25 Last updated: 2025-09-25Bibliographically approved
Dong, G., Bresson, R. & Boström, H. (2025). Detecting Attacks with Conformal Test Martingales. In: Luo, Z Nguyen, KA Papadopoulos, H Lofstrom, T Carlsson, L Bostrom, H (Ed.), Fourteenth Symposium On Conformal And Probabilistic Prediction With Applications, COPA 2025: . Paper presented at 14th Symposium on Conformal and Probabilistic Prediction with Applications-COPA, SEP 10-12, 2025, ENGLAND (pp. 758-761). JMLR-JOURNAL MACHINE LEARNING RESEARCH, 266
Open this publication in new window or tab >>Detecting Attacks with Conformal Test Martingales
2025 (English)In: Fourteenth Symposium On Conformal And Probabilistic Prediction With Applications, COPA 2025 / [ed] Luo, Z Nguyen, KA Papadopoulos, H Lofstrom, T Carlsson, L Bostrom, H, JMLR-JOURNAL MACHINE LEARNING RESEARCH , 2025, Vol. 266, p. 758-761Conference paper, Published paper (Refereed)
Place, publisher, year, edition, pages
JMLR-JOURNAL MACHINE LEARNING RESEARCH, 2025
Series
Proceedings of Machine Learning Research, ISSN 2640-3498
Keywords
conformal test martingales, adversarial examples, whitebox attacks
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:kth:diva-376360 (URN)001595063100041 ()
Conference
14th Symposium on Conformal and Probabilistic Prediction with Applications-COPA, SEP 10-12, 2025, ENGLAND
Note

QC 20260209

Available from: 2026-02-09 Created: 2026-02-09 Last updated: 2026-02-09Bibliographically approved
Dong, G., Boström, H., Bresson, R. & Alkhatib, A. (2025). Explaining Deep Neural Networks with Example and Pixel Attribution. In: Discovery Science - 28th International Conference, DS 2025, Proceedings: . Paper presented at 28th International Conference on Discovery Science, DS 2025, Ljubljana, Slovenia, September 23-25, 2025 (pp. 286-300). Springer Nature
Open this publication in new window or tab >>Explaining Deep Neural Networks with Example and Pixel Attribution
2025 (English)In: Discovery Science - 28th International Conference, DS 2025, Proceedings, Springer Nature , 2025, p. 286-300Conference paper, Published paper (Refereed)
Abstract [en]

Most techniques for explainable machine learning focus on a single modality for the explanations, e.g., using either feature or example attribution. A novel approach, called Hybrid Attribution Network (HAN), is proposed for providing multimodal explanations for image classification. The technique first extracts embeddings from a deep neural network (DNN), which are subsequently used by a KNN classifier to form predictions; example attributions can then be derived from the latter. Based on the example attributions, pixel attributions are further generated to provide complementary feature-level explanations. Results from an empirical investigation show that HAN may provide highly concentrated example attributions, i.e., the predictions can be explained with few training examples, without compromising predictive performance relative to the original deep neural network. Moreover, the pixel attributions are shown to enhance the interpretability of the predictions, by highlighting key pixels in the example attributions. An important finding from the empirical investigation is that the choice of layer to use for the embeddings may have a large impact on both the predictive performance and the generated explanations.

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Deep Neural Networks, Example attribution, Explainable AI, Pixel attribution
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-372788 (URN)10.1007/978-3-032-05461-6_19 (DOI)2-s2.0-105020009776 (Scopus ID)
Conference
28th International Conference on Discovery Science, DS 2025, Ljubljana, Slovenia, September 23-25, 2025
Note

Part of ISBN 9783032054609

QC 20251119

Available from: 2025-11-19 Created: 2025-11-19 Last updated: 2025-11-19Bibliographically approved
Kuschel, M., Alkhatib, A., Hasija, T. & Boström, H. (2025). Explaining Representations in Correlation-based Deep Multiview Representation Learning. In: Icassp 2025-2025 Ieee International Conference On Acoustics, Speech And Signal Processing, Icassp: . Paper presented at 2025 International Conference on Acoustics Speech and Signal Processing-ICASSP-Annual, APR 06-11, 2025, Hyderabad, INDIA. Institute of Electrical and Electronics Engineers (IEEE), Article ID 2208.
Open this publication in new window or tab >>Explaining Representations in Correlation-based Deep Multiview Representation Learning
2025 (English)In: Icassp 2025-2025 Ieee International Conference On Acoustics, Speech And Signal Processing, Icassp, Institute of Electrical and Electronics Engineers (IEEE) , 2025, article id 2208Conference paper, Published paper (Refereed)
Abstract [en]

Multiview representation learning techniques based on deep correlation maximization have become increasingly popular for learning meaningful and compact representations from multiview data. Even though their performance is state-of-the-art in many interpretability-critical fields, their blackbox behavior poses a problem and restricts their usability. To overcome this restriction, we propose XDCCA (eXplanations for Deep Canonical Correlation Analysis), an explanation strategy using characteristic rules in combination with SHAP that exploits the inherent structure of latent spaces created by correlation maximization techniques. We demonstrate how XDCCA allows for interpreting learned representations and their correlation using real medical time series and synthetic image data.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Series
International Conference on Acoustics Speech and Signal Processing ICASSP, ISSN 1520-6149
Keywords
Explainable AI, Multiview Representation Learning, Deep Canonical Correlation Analysis, CEGA, SHAP
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-378396 (URN)10.1109/ICASSP49660.2025.10888309 (DOI)001548470300732 ()2-s2.0-105003881835 (Scopus ID)979-8-3503-6875-8 (ISBN)979-8-3503-6874-1 (ISBN)
Conference
2025 International Conference on Acoustics Speech and Signal Processing-ICASSP-Annual, APR 06-11, 2025, Hyderabad, INDIA
Note

QC 20260327

Available from: 2026-03-27 Created: 2026-03-27 Last updated: 2026-04-14Bibliographically approved
Li, Z. & Boström, H. (2025). FlowGuard: Guarding Flow Matching via Conformal Sampling. In: Proceedings of the 14th Symposium on Conformal and Probabilistic Prediction with Applications, COPA 2025: . Paper presented at 14th Symposium on Conformal and Probabilistic Prediction with Applications, COPA 2025, London, United Kingdom of Great Britain and Northern Ireland, Sep 10 2025 - Sep 12 2025 (pp. 775-777). ML Research Press, 266
Open this publication in new window or tab >>FlowGuard: Guarding Flow Matching via Conformal Sampling
2025 (English)In: Proceedings of the 14th Symposium on Conformal and Probabilistic Prediction with Applications, COPA 2025, ML Research Press , 2025, Vol. 266, p. 775-777Conference paper, Published paper (Refereed)
Place, publisher, year, edition, pages
ML Research Press, 2025
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-370311 (URN)001595063100046 ()2-s2.0-105013969243 (Scopus ID)
Conference
14th Symposium on Conformal and Probabilistic Prediction with Applications, COPA 2025, London, United Kingdom of Great Britain and Northern Ireland, Sep 10 2025 - Sep 12 2025
Note

QC 20250925

Available from: 2025-09-25 Created: 2025-09-25 Last updated: 2026-02-03Bibliographically approved
Dong, G., Boström, H., Vazirgiannis, M. & Bresson, R. (2025). Obtaining Example-Based Explanations from Deep Neural Networks. In: Advances in Intelligent Data Analysis XXIII - 23rd International Symposium on Intelligent Data Analysis, IDA 2025, Proceedings: . Paper presented at 23rd International Symposium on Intelligent Data Analysis, IDA 2025, Konstanz, Germany, May 7 2025 - May 9 2025 (pp. 432-443). Springer Nature
Open this publication in new window or tab >>Obtaining Example-Based Explanations from Deep Neural Networks
2025 (English)In: Advances in Intelligent Data Analysis XXIII - 23rd International Symposium on Intelligent Data Analysis, IDA 2025, Proceedings, Springer Nature , 2025, p. 432-443Conference paper, Published paper (Refereed)
Abstract [en]

Most techniques for explainable machine learning focus on feature attribution, i.e., values are assigned to the features such that their sum equals the prediction. Example attribution is another form of explanation that assigns weights to the training examples, such that their scalar product with the labels equals the prediction. The latter may provide valuable complementary information to feature attribution, in particular in cases where the features are not easily interpretable. Current example-based explanation techniques have targeted a few model types only, such as k-nearest neighbors and random forests. In this work, a technique for obtaining example-based explanations from deep neural networks (EBE-DNN) is proposed. The basic idea is to use the deep neural network to obtain an embedding, which is employed by a k-nearest neighbor classifier to form a prediction; the example attribution can hence straightforwardly be derived from the latter. Results from an empirical investigation show that EBE-DNN can provide highly concentrated example attributions, i.e., the predictions can be explained with few training examples, without reducing accuracy compared to the original deep neural network. Another important finding from the empirical investigation is that the choice of layer to use for the embeddings may have a large impact on the resulting accuracy.

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Deep neural networks, Example-based explanations, Explainable AI
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-363991 (URN)10.1007/978-3-031-91398-3_32 (DOI)001544948000032 ()2-s2.0-105005271603 (Scopus ID)
Conference
23rd International Symposium on Intelligent Data Analysis, IDA 2025, Konstanz, Germany, May 7 2025 - May 9 2025
Note

Part of ISBN 9783031913976

QC 20250605

Available from: 2025-06-02 Created: 2025-06-02 Last updated: 2025-12-08Bibliographically approved
Alkhatib, A., Bresson, R., Boström, H. & Vazirgiannis, M. (2025). Prediction via Shapley Value Regression. In: Proceedings of Machine Learning Research - International Conference on Machine Learning, ICML 2025: . Paper presented at 42nd International Conference on Machine Learning, ICML 2025, Vancouver, Canada, July 13-19, 2025. ML Research Press
Open this publication in new window or tab >>Prediction via Shapley Value Regression
2025 (English)In: Proceedings of Machine Learning Research - International Conference on Machine Learning, ICML 2025, ML Research Press , 2025Conference paper, Published paper (Refereed)
Abstract [en]

Shapley values have several desirable, theoretically well-supported, properties for explaining black-box model predictions. Traditionally, Shapley values are computed post-hoc, leading to additional computational cost at inference time. To overcome this, a novel method, called ViaSHAP, is proposed, that learns a function to compute Shapley values, from which the predictions can be derived directly by summation. Two approaches to implement the proposed method are explored; one based on the universal approximation theorem and the other on the Kolmogorov-Arnold representation theorem. Results from a large-scale empirical investigation are presented, showing that ViaSHAP using Kolmogorov-Arnold Networks performs on par with state-of-the-art algorithms for tabular data. It is also shown that the explanations of ViaSHAP are significantly more accurate than the popular approximator FastSHAP on both tabular data and images.

Place, publisher, year, edition, pages
ML Research Press, 2025
National Category
Computer Sciences Control Engineering
Identifiers
urn:nbn:se:kth:diva-373344 (URN)2-s2.0-105021828219 (Scopus ID)
Conference
42nd International Conference on Machine Learning, ICML 2025, Vancouver, Canada, July 13-19, 2025
Note

QC 20251202

Available from: 2025-12-02 Created: 2025-12-02 Last updated: 2025-12-02Bibliographically approved
Papadopoulos, H., An Nguyen, K., Luo, Z., Löfström, T., Carlsson, L. & Boström, H. (2025). Preface. In: Proceedings of Machine Learning Research: . Paper presented at 14th Symposium on Conformal and Probabilistic Prediction with Applications, COPA 2025, London, 10 September 2025 - 12 September 2025 (pp. 1-5). ML Research Press, 266
Open this publication in new window or tab >>Preface
Show others...
2025 (English)In: Proceedings of Machine Learning Research, ML Research Press , 2025, Vol. 266, p. 1-5Conference paper, Published paper (Refereed)
Place, publisher, year, edition, pages
ML Research Press, 2025
National Category
Other Engineering and Technologies
Identifiers
urn:nbn:se:kth:diva-370201 (URN)2-s2.0-105013962679 (Scopus ID)
Conference
14th Symposium on Conformal and Probabilistic Prediction with Applications, COPA 2025, London, 10 September 2025 - 12 September 2025
Note

QC 20251021

Available from: 2025-10-21 Created: 2025-10-21 Last updated: 2025-10-21Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-8382-0300

Search in DiVA

Show all publications