kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Publications (3 of 3) Show all publications
Mehrpanah, A., Englesson, E. & Azizpour, H. (2025). On Spectral Properties of Gradient-Based Explanation Methods. In: Computer Vision – ECCV 2024 - 18th European Conference, Proceedings: . Paper presented at 18th European Conference on Computer Vision, ECCV 2024, Milan, Italy, Sep 29 2024 - Oct 4 2024 (pp. 282-299). Springer Nature
Open this publication in new window or tab >>On Spectral Properties of Gradient-Based Explanation Methods
2025 (English)In: Computer Vision – ECCV 2024 - 18th European Conference, Proceedings, Springer Nature , 2025, p. 282-299Conference paper, Published paper (Refereed)
Abstract [en]

Understanding the behavior of deep networks is crucial to increase our confidence in their results. Despite an extensive body of work for explaining their predictions, researchers have faced reliability issues, which can be attributed to insufficient formalism. In our research, we adopt novel probabilistic and spectral perspectives to formally analyze explanation methods. Our study reveals a pervasive spectral bias stemming from the use of gradient, and sheds light on some common design choices that have been discovered experimentally, in particular, the use of squared gradient and input perturbation. We further characterize how the choice of perturbation hyperparameters in explanation methods, such as SmoothGrad, can lead to inconsistent explanations and introduce two remedies based on our proposed formalism: (i) a mechanism to determine a standard perturbation scale, and (ii) an aggregation method which we call SpectralLens. Finally, we substantiate our theoretical results through quantitative evaluations.

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Deep Neural Networks, Explainability, Gradient-based Explanation Methods, Probabilistic Machine Learning, Probabilistic Pixel Attribution Techniques, Spectral Analysis
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-357698 (URN)10.1007/978-3-031-73021-4_17 (DOI)001416940200017 ()2-s2.0-85210488897 (Scopus ID)
Conference
18th European Conference on Computer Vision, ECCV 2024, Milan, Italy, Sep 29 2024 - Oct 4 2024
Note

Part of ISBN 978-303173020-7

QC 20241213

Available from: 2024-12-12 Created: 2024-12-12 Last updated: 2025-03-17Bibliographically approved
Rahimi, M., Mehrpanah, A., Mouchani, P., Rahimi, E. & Salaudeen, S. A. (2024). Optimizing Methane Uptake on N/O Functionalized Graphene via DFT, Machine Learning, and Uniform Manifold Approximation and Projection (UMAP) Techniques. Industrial & Engineering Chemistry Research, 63(44), 18940-18956
Open this publication in new window or tab >>Optimizing Methane Uptake on N/O Functionalized Graphene via DFT, Machine Learning, and Uniform Manifold Approximation and Projection (UMAP) Techniques
Show others...
2024 (English)In: Industrial & Engineering Chemistry Research, ISSN 0888-5885, E-ISSN 1520-5045, Vol. 63, no 44, p. 18940-18956Article in journal (Refereed) Published
Abstract [en]

Carbon materials possess active sites and functionalities on the surface that can attract prominent interest as solid adsorbents for diverse gas adsorption. This study aimed to predict the optimized methane uptake, adsorption energy (E ad), and adsorbent rediscovery through multitechniques of neural, regression, classifier ML-DFT, and Uniform Manifold Approximation and Projection (UMAP). Nitrogen and oxygen (N/O) functionalities and graphene, graphene oxide (GO), and N-doped GO were applied to the methane storage medium. Multi-ML algorithms were employed for the adsorption energy of CH4 uptake on (i) N/O functionalities such as pyridinic (N-py), carboxyl (O-II), oxidized (N-x), hydroxyl (O-h), Nitroso (N-ni), and Amine (primary, secondary, and tertiary). (ii) The graphene surfaces are decorated with N/O heteroatoms to construct graphene oxide (GO) and N-doped GO. The DFT calculations were applied by PW91 and the Dmol3 package. N/O-functionalities in the distance of similar to 2.0 to 3.1 & Aring; groups obtained E ad of approximately -2.0 to -4 eV. Further, ML models accomplished the forthcoming rediscovery of CH4 physisorption by using the multiadsorptive features of optimized adsorbents with an R 2 of 0.99. ML-derived sensitivity analysis approach was applied to specifications such as deformation adsorption energy, N/O functionality type, and optimized structure. CH4 adsorption specifications indicate sensitivity levels of -0.03 to 0.02 eV. The synergetic DFT/ML approaches distinguished the modeled and rediscovered phases of CH4 adsorption on N/O functional groups and graphene structures. UMAP is employed as a new adsorbent screening approach to play a complementary role in the ML modeling process.

Place, publisher, year, edition, pages
American Chemical Society (ACS), 2024
National Category
Chemical Sciences
Identifiers
urn:nbn:se:kth:diva-356503 (URN)10.1021/acs.iecr.4c02626 (DOI)001343842600001 ()2-s2.0-85208095047 (Scopus ID)
Note

QC 20241119

Available from: 2024-11-19 Created: 2024-11-19 Last updated: 2024-11-19Bibliographically approved
Englesson, E., Mehrpanah, A. & Azizpour, H. (2023). Logistic-Normal Likelihoods for Heteroscedastic Label Noise. Transactions on Machine Learning Research, 2023(8)
Open this publication in new window or tab >>Logistic-Normal Likelihoods for Heteroscedastic Label Noise
2023 (English)In: Transactions on Machine Learning Research, E-ISSN 2835-8856, Vol. 2023, no 8Article in journal (Refereed) Published
Abstract [en]

A natural way of estimating heteroscedastic label noise in regression is to model the observed (potentially noisy) target as a sample from a normal distribution, whose parameters can be learned by minimizing the negative log-likelihood. This formulation has desirable loss attenuation properties, as it reduces the contribution of high-error examples. Intuitively, this behavior can improve robustness against label noise by reducing overfitting. We propose an extension of this simple and probabilistic approach to classification that has the same desirable loss attenuation properties. Furthermore, we discuss and address some practical challenges of this extension. We evaluate the effectiveness of the method by measuring its robustness against label noise in classification. We perform enlightening experiments exploring the inner workings of the method, including sensitivity to hyperparameters, ablation studies, and other insightful analyses.

Place, publisher, year, edition, pages
Transactions on Machine Learning Research (TMLR), 2023
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-346451 (URN)2-s2.0-86000109470 (Scopus ID)
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

QC 20250325

Available from: 2024-05-15 Created: 2024-05-15 Last updated: 2025-03-25Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-6193-7126

Search in DiVA

Show all publications