kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Evaluating the Robustness of ML Models to Out-of-Distribution Data Through Similarity Analysis
Mälardalen University, Västerås, Sweden; Saab AB, Linköping, Sweden.
Mälardalen University, Västerås, Sweden.ORCID iD: 0000-0002-0933-6059
Mälardalen University, Västerås, Sweden.ORCID iD: 0000-0001-6289-1521
KTH, School of Electrical Engineering and Computer Science (EECS), Electrical Engineering, Electronics and Embedded systems. Saab AB, Linköping, Sweden.ORCID iD: 0000-0001-9863-9985
2023 (English)In: New Trends in Database and Information Systems: ADBIS 2023 Short Papers, Doctoral Consortium and Workshops, Springer Nature , 2023, p. 348-359Conference paper, Published paper (Refereed)
Abstract [en]

In Machine Learning systems, several factors impact the performance of a trained model. The most important ones include model architecture, the amount of training time, the dataset size and diversity. We present a method for analyzing datasets from a use-case scenario perspective, detecting and quantifying out-of-distribution (OOD) data on dataset level. Our main contribution is the novel use of similarity metrics for the evaluation of the robustness of a model by introducing relative Fréchet Inception Distance (FID) and relative Kernel Inception Distance (KID) measures. These relative measures are relative to a baseline in-distribution dataset and are used to estimate how the model will perform on OOD data (i.e. estimate the model accuracy drop). We find a correlation between our proposed relative FID/relative KID measure and the drop in Average Precision (AP) accuracy on unseen data.

Place, publisher, year, edition, pages
Springer Nature , 2023. p. 348-359
Keywords [en]
accuracy estimation, datasets, neural networks, similarity metrics
National Category
Computer Sciences Computer graphics and computer vision Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:kth:diva-337818DOI: 10.1007/978-3-031-42941-5_30ISI: 001351054200030Scopus ID: 2-s2.0-85171979824OAI: oai:DiVA.org:kth-337818DiVA, id: diva2:1803567
Conference
Proceedings of the 27th European Conference on Advances in Databases and Information Systems, ADBIS 2023, Barcelona, Spain, Sep 4 2023 - Sep 7 2023
Note

Part of ISBN 9783031429408

QC 20231009

Available from: 2023-10-09 Created: 2023-10-09 Last updated: 2025-02-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Daneshtalab, MasoudSöderquist, Ingemar

Search in DiVA

By author/editor
Forsberg, HåkanDaneshtalab, MasoudSöderquist, Ingemar
By organisation
Electronics and Embedded systems
Computer SciencesComputer graphics and computer visionBioinformatics (Computational Biology)

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 42 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf