kth.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Structured Representations for Explainable Deep Learning
KTH, Skolan för elektroteknik och datavetenskap (EECS), Intelligenta system, Robotik, perception och lärande, RPL.ORCID-id: 0000-0001-8152-767x
2023 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Deep learning has revolutionized scientific research and is being used to take decisions in increasingly complex scenarios. With growing power comes a growing demand for transparency and interpretability. The field of Explainable AI aims to provide explanations for the predictions of AI systems. The state of the art of AI explainability, however, is far from satisfactory. For example, in Computer Vision, the most prominent post-hoc explanation methods produce pixel-wise heatmaps over the input domain, which are meant to visualize the importance of individual pixels of an image or video. We argue that such dense attribution maps are poorly interpretable to non-expert users because of the domain in which explanations are formed - we may recognize shapes in a heatmap but they are just blobs of pixels. In fact, the input domain is closer to the raw data of digital cameras than to the interpretable structures that humans use to communicate, e.g. objects or concepts. In this thesis, we propose to move beyond dense feature attributions by adopting structured internal representations as a more interpretable explanation domain. Conceptually, our approach splits a Deep Learning model in two: the perception step that takes as input dense representations and the reasoning step that learns to perform the task at hand. At the interface between the two are structured representations that correspond to well-defined objects, entities, and concepts. These representations serve as the interpretable domain for explaining the predictions of the model, allowing us to move towards more meaningful and informative explanations. The proposed approach introduces several challenges, such as how to obtain structured representations, how to use them for downstream tasks, and how to evaluate the resulting explanations. The works included in this thesis address these questions, validating the approach and providing concrete contributions to the field. For the perception step, we investigate how to obtain structured representations from dense representations, whether by manually designing them using domain knowledge or by learning them from data without supervision. For the reasoning step, we investigate how to use structured representations for downstream tasks, from Biology to Computer Vision, and how to evaluate the learned representations. For the explanation step, we investigate how to explain the predictions of models that operate in a structured domain, and how to evaluate the resulting explanations. Overall, we hope that this work inspires further research in Explainable AI and helps bridge the gap between high-performing Deep Learning models and the need for transparency and interpretability in real-world applications.

Abstract [sv]

Deep Learning har revolutionerat den vetenskapliga forskningen och används för att fatta beslut i allt mer komplexa scenarier. Med växande makt kommer ett växande krav på transparens och tolkningsbarhet. Området Explainable AI syftar till att ge förklaringar till AI-systems förutsägelser. Prestandan hos existerande lösningar för AI-förklarbarhet är dock långt ifrån tillfredsställande.Till exempel, inom datorseendeområdet, producerar de mest framträdande post-hoc-förklaringsmetoderna pixelvisa värmekartor, som är avsedda att visualisera hur viktiga enskilda pixlar är i en bild eller video. Vi hävdar att sådana metoder är svårtolkade på grund av den domän där förklaringar bildas - vi kanske känner igen former i en värmekarta men de är bara pixlar. Faktum är att indatadomänen ligger närmare digitalkamerors rådata än de strukturer som människor använder för att kommunicera, t.ex. objekt eller koncept.I den här avhandlingen föreslår vi att vi går bortom täta egenskapsattributioner genom att använda strukturerade interna representationer som en mer tolkningsbar förklaringsdomän. Begreppsmässigt delar vårt tillvägagångssätt en Deep Learning-modell i två: perception-steget som tar täta representationer som indata och reasoning-steget som lär sig att utföra uppgiften. I gränssnittet mellan de två finns strukturerade representationer som motsvarar väldefinierade objekt, entiteter och begrepp. Dessa representationer fungerar som den tolkbara domänen för att förklara modellens förutsägelser, vilket gör att vi kan gå mot mer meningsfulla och informativa förklaringar.Det föreslagna tillvägagångssättet introducerar flera utmaningar, såsom hur man skapar strukturerade representationer, hur man använder dem för senare uppgifter och hur man utvärderar de resulterande förklaringarna. Forskningen som ingår i denna avhandling tar upp dessa frågor, validerar tillvägagångssättet och ger konkreta bidrag till området. För steget perception undersöker vi hur man får strukturerade representationer från täta representationer, antingen genom att manuellt designa dem med hjälp av domänkunskap eller genom att lära dem från data utan övervakning. För steget reasoning undersöker vi hur man använder strukturerade representationer för senare uppgifter, från biologi till datorseende, och hur man utvärderar de inlärda representationerna. För steget explanation undersöker vi hur man förklarar förutsägelserna för modeller som fungerar i en strukturerad domän, och hur man utvärderar de resulterande förklaringarna. Sammantaget hoppas vi att detta arbete inspirerar till ytterligare forskning inom Explainable AI och hjälper till att överbrygga klyftan mellan högpresterande Deep Learning-modeller och behovet av transparens och tolkningsbarhet i verkliga applikationer.

Ort, förlag, år, upplaga, sidor
Stockholm: KTH Royal Institute of Technology, 2023. , s. xi, 103
Serie
TRITA-EECS-AVL ; 2023:49
Nyckelord [en]
Explainable AI, Deep Learning, Self-supervised Learning, Transformers, Graph Networks, Computer Vision
Nyckelord [sv]
Explainable AI, Deep Learning, Self-supervised Learning, Transformers, Graph Networks, Computer Vision
Nationell ämneskategori
Datorgrafik och datorseende
Forskningsämne
Datalogi
Identifikatorer
URN: urn:nbn:se:kth:diva-326958ISBN: 978-91-8040-606-2 (tryckt)OAI: oai:DiVA.org:kth-326958DiVA, id: diva2:1757331
Disputation
2023-06-12, F3 https://kth-se.zoom.us/j/66725845533, Lindstedtsvägen 26, Stockholm, 14:00 (Engelska)
Opponent
Handledare
Forskningsfinansiär
Vetenskapsrådet, 2017-04609
Anmärkning

QC 20230516

Tillgänglig från: 2023-05-16 Skapad: 2023-05-16 Senast uppdaterad: 2025-02-07Bibliografiskt granskad
Delarbeten
1.
Posten kunde inte hittas. Det kan bero på att posten inte längre är tillgänglig eller att du har råkat ange ett felaktigt id i adressfältet.
2. GraphQA: Protein Model Quality Assessment using Graph Convolutional Networks
Öppna denna publikation i ny flik eller fönster >>GraphQA: Protein Model Quality Assessment using Graph Convolutional Networks
2020 (Engelska)Ingår i: Bioinformatics, ISSN 1367-4803, E-ISSN 1367-4811, Vol. 37, nr 3, s. 360-366Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Motivation

Proteins are ubiquitous molecules whose function in biological processes is determined by their 3D structure. Experimental identification of a protein’s structure can be time-consuming, prohibitively expensive, and not always possible. Alternatively, protein folding can be modeled using computational methods, which however are not guaranteed to always produce optimal results.

GraphQA is a graph-based method to estimate the quality of protein models, that possesses favorable properties such as representation learning, explicit modeling of both sequential and 3D structure, geometric invariance, and computational efficiency.

Results

GraphQA performs similarly to state-of-the-art methods despite using a relatively low number of input features. In addition, the graph network structure provides an improvement over the architecture used in ProQ4 operating on the same input features. Finally, the individual contributions of GraphQA components are carefully evaluated.

Availability and implementation

PyTorch implementation, datasets, experiments, and link to an evaluation server are available through this GitHub repository: github.com/baldassarreFe/graphqa

Supplementary information

Supplementary material is available at Bioinformatics online.

Ort, förlag, år, upplaga, sidor
Oxford University Press, 2020
Nyckelord
graph neural networks, protein quality assessment
Nationell ämneskategori
Bioinformatik (beräkningsbiologi)
Forskningsämne
Datalogi
Identifikatorer
urn:nbn:se:kth:diva-284600 (URN)10.1093/bioinformatics/btaa714 (DOI)000667755400010 ()32780838 (PubMedID)2-s2.0-85105697201 (Scopus ID)
Forskningsfinansiär
Vetenskapsrådet, 2017-04609
Anmärkning

QC 20201118

Tillgänglig från: 2020-10-30 Skapad: 2020-10-30 Senast uppdaterad: 2023-05-16Bibliografiskt granskad
3. Explanation-Based Weakly-Supervised Learning of Visual Relations with Graph Networks
Öppna denna publikation i ny flik eller fönster >>Explanation-Based Weakly-Supervised Learning of Visual Relations with Graph Networks
2020 (Engelska)Ingår i: Proceedings, Part XXVIII Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Springer Nature , 2020, s. 612-630Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Visual relationship detection is fundamental for holistic image understanding. However, the localization and classification of (subject, predicate, object) triplets remain challenging tasks, due to the combinatorial explosion of possible relationships, their long-tailed distribution in natural images, and an expensive annotation process. This paper introduces a novel weakly-supervised method for visual relationship detection that relies on minimal image-level predicate labels. A graph neural network is trained to classify predicates in images from a graph representation of detected objects, implicitly encoding an inductive bias for pairwise relations. We then frame relationship detection as the explanation of such a predicate classifier, i.e. we obtain a complete relation by recovering the subject and object of a predicted predicate. We present results comparable to recent fully- and weakly-supervised methods on three diverse and challenging datasets: HICO-DET for human-object interaction, Visual Relationship Detection for generic object-to-object relations, and UnRel for unusual triplets; demonstrating robustness to non-comprehensive annotations and good few-shot generalization.

Ort, förlag, år, upplaga, sidor
Springer Nature, 2020
Serie
Lecture Notes in Computer Science book series ; 12373
Nyckelord
Computer vision, Image coding, Supervised learning, Combinatorial explosion, Graph neural networks, Graph representation, Human-object interaction, Long-tailed distributions, Object to objects, Supervised methods, Weakly supervised learning, Object detection
Nationell ämneskategori
Data- och informationsvetenskap
Identifikatorer
urn:nbn:se:kth:diva-290838 (URN)10.1007/978-3-030-58604-1_37 (DOI)2-s2.0-85097054926 (Scopus ID)
Konferens
Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020
Anmärkning

Part of ISBN 9783030586034

QC 20210323

Tillgänglig från: 2021-03-23 Skapad: 2021-03-23 Senast uppdaterad: 2023-05-16Bibliografiskt granskad
4. Towards Self-Supervised Learning of Global and Object-Centric Representations
Öppna denna publikation i ny flik eller fönster >>Towards Self-Supervised Learning of Global and Object-Centric Representations
2022 (Engelska)Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Self-supervision allows learning meaningful representations of natural images, which usually contain one central object. How well does it transfer to multi-entity scenes? We discuss key aspects of learning structured object-centric representations with self-supervision and validate our insights through several experiments on the CLEVR dataset. Regarding the architecture, we confirm the importance of competition for attention-based object discovery, where each image patch is exclusively attended by one object. For training, we show that contrastive losses equipped with matching can be applied directly in a latent space, avoiding pixel-based reconstruction. However, such an optimization objective is sensitive to false negatives (recurring objects) and false positives (matching errors). Careful consideration is thus required around data augmentation and negative sample selection.

Nationell ämneskategori
Datorgrafik och datorseende
Identifikatorer
urn:nbn:se:kth:diva-326853 (URN)
Konferens
ICLR Workshop on the Elements of Reasoning, Objects, Structure and Causality
Anmärkning

QC 20230516

Tillgänglig från: 2023-05-12 Skapad: 2023-05-12 Senast uppdaterad: 2025-02-07Bibliografiskt granskad
5. Quantitative Metrics for Evaluating Explanations of Video DeepFake Detectors
Öppna denna publikation i ny flik eller fönster >>Quantitative Metrics for Evaluating Explanations of Video DeepFake Detectors
2022 (Engelska)Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The proliferation of DeepFake technology is a rising challenge in today’s society, owing to more powerful and accessible generation methods. To counter this, the research community has developed detectors of ever-increasing accuracy. However, the ability to explain the decisions of such models to users lags behind performance and is considered an accessory in large-scale benchmarks, despite being a crucial requirement for the correct deployment of automated tools for moderation and censorship. We attribute the issue to the reliance on qualitative comparisons and the lack of established metrics. We describe a simple set of metrics to evaluate the visual quality and informativeness of explanations of video DeepFake classifiers from a human-centric perspective. With these metrics, we compare common approaches to improve explanation quality and discuss their effect on both classification and explanation performance on the recent DFDC and DFD datasets.

Nationell ämneskategori
Datorgrafik och datorseende
Identifikatorer
urn:nbn:se:kth:diva-326957 (URN)
Konferens
33rd British Machine Vision Conference (BMVC)
Anmärkning

QC 20230516

Tillgänglig från: 2023-05-15 Skapad: 2023-05-15 Senast uppdaterad: 2025-02-07Bibliografiskt granskad
6. Variable Rate Allocation for Vector-Quantized Autoencoders
Öppna denna publikation i ny flik eller fönster >>Variable Rate Allocation for Vector-Quantized Autoencoders
2023 (Engelska)Ingår i: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Institute of Electrical and Electronics Engineers (IEEE) , 2023Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Vector-quantized autoencoders have recently gained interest in image compression, generation and self-supervised learning. However, as a neural compression method, they lack the possibility to allocate a variable number of bits to each image location, e.g. according to the semantic content or local saliency. In this paper, we address this limitation in a simple yet effective way. We adopt a product quantizer (PQ) that produces a set of discrete codes for each image patch rather than a single index. This PQ-autoencoder is trained end-to-end with a structured dropout that selectively masks a variable number of codes at each location. These mechanisms force the decoder to reconstruct the original image based on partial information and allow us to control the local rate. The resulting model can compress images on a wide range of operating points of the rate-distortion curve and can be paired with any external method for saliency estimation to control the compression rate at a local level. We demonstrate the effectiveness of our approach on the popular Kodak and ImageNet datasets by measuring both distortion and perceptual quality metrics.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2023
Nationell ämneskategori
Datorgrafik och datorseende
Identifikatorer
urn:nbn:se:kth:diva-326854 (URN)10.1109/ICASSP49357.2023.10095451 (DOI)2-s2.0-85168851171 (Scopus ID)
Konferens
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Anmärkning

QC 20230516

Tillgänglig från: 2023-05-12 Skapad: 2023-05-12 Senast uppdaterad: 2025-02-07Bibliografiskt granskad

Open Access i DiVA

kappa(30762 kB)1581 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 30762 kBChecksumma SHA-512
2d095bdfa3729c3f1c0adcdcd394a8e5d1e6967db35f49d59fbe2714b0162c929407c06112d22bac4b65e1f9609bf6c2387f62641d957d8cb4525658459af116
Typ fulltextMimetyp application/pdf

Person

Baldassarre, Federico

Sök vidare i DiVA

Av författaren/redaktören
Baldassarre, Federico
Av organisationen
Robotik, perception och lärande, RPL
Datorgrafik och datorseende

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 1581 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 1576 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf