kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Girdzijauskas, SarunasORCID iD iconorcid.org/0000-0003-4516-7317
Alternative names
Publications (10 of 101) Show all publications
Cornell, F., Jin, Y., Karlgren, J. & Girdzijauskas, S. (2025). Are We Wasting Time? A Fast, Accurate Performance Evaluation Framework for Knowledge Graph Link Predictors. In: Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025: . Paper presented at 41st IEEE International Conference on Data Engineering, ICDE 2025, Hong Kong, China, May 19-23, 2025 (pp. 1650-1663). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Are We Wasting Time? A Fast, Accurate Performance Evaluation Framework for Knowledge Graph Link Predictors
2025 (English)In: Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025, p. 1650-1663Conference paper, Published paper (Refereed)
Abstract [en]

The standard evaluation protocol for measuring the quality of Knowledge Graph Completion methods - the task of inferring new links to be added to a graph - typically involves a step which ranks every entity of a Knowledge Graph to assess their fit as a head or tail of a candidate link to be added. In Knowledge Graphs on a larger scale, this task rapidly becomes prohibitively heavy. Previous approaches mitigate this problem by using random sampling of entities to assess the quality of links predicted or suggested by a method. However, we show that this approach has serious limitations since the ranking metrics produced do not properly reflect true outcomes. In this paper, we present a thorough analysis of these effects along with the following findings. First, we empirically find and theoretically motivate why sampling uniformly at random vastly overestimates the ranking performance of a method. We show that this can be attributed to the effect of easy versus hard negatives. Second, we propose a framework that uses relational recommenders to guide the selection of candidates for evaluation. We provide both theoretical and empirical justification of our methodology, and find that simple and fast methods work extremely well, matching advanced neural approaches. Even when a large portion of the true candidates for a property are missed, the estimation of the ranking metrics on a downstream model barely deteriorates. With our proposed framework, we can reduce the time and computation needed similar to random sampling strategies while vastly improving the estimation; on ogbl-wikikg2, we show that accurate estimations of the full ranking can be obtained in 20 seconds instead of 30 minutes. We conclude that considerable computational effort can be saved by effective preprocessing and sampling methods and still reliably predict performance accurately of the true performance for the entire ranking procedure. We make our code available to the community1. 1Accessible at https://github.com/Filco306/are-we-wasting-time.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
evaluation, Knowledge Graph, link prediction, sampling
National Category
Probability Theory and Statistics Computer Sciences
Identifiers
urn:nbn:se:kth:diva-370773 (URN)10.1109/ICDE65448.2025.00127 (DOI)2-s2.0-105015357159 (Scopus ID)
Conference
41st IEEE International Conference on Data Engineering, ICDE 2025, Hong Kong, China, May 19-23, 2025
Note

Part of ISBN 9798331536039

QC 20251001

Available from: 2025-10-01 Created: 2025-10-01 Last updated: 2025-10-01Bibliographically approved
Samy, A. E., Giaretta, L. & Girdzijauskas, S. (2025). HopNet: Addressing Over-Squashing with Learnable Rewiring in GNNs.
Open this publication in new window or tab >>HopNet: Addressing Over-Squashing with Learnable Rewiring in GNNs
2025 (English)Manuscript (preprint) (Other academic)
Abstract [en]

Graph Neural Networks (GNNs) have emerged as powerful tools for extracting insights from graph-structured data. However, GNNs often encounter challenges when aggregating information across distant and constrained connections, leading to performance degradation due to a phenomenon known as over-squashing. Previous solutions, such as static rewiring, address this issue by increasing graph density but compromise the inherent inductive bias of GNNs with respect to node distances. In this study, we introduce HopNet, a novel model designed to facilitate effective long-range information propagation through learnable rewiring. HopNet employs attention mechanisms to dynamically create targeted shortcuts, enabling efficient communication between distant nodes while maintaining a balance between local and global interactions. Extensive experiments across diverse tasks, including node classification, link prediction, graph regression, and graph classification, on established real-world benchmark datasets demonstrate HopNet’s ability to overcome GNN limitations, consistently achieving superior performance over state-of-the-art methods.

Keywords
Graph Neural Networks, Bottlenecks, Over-Squashing, Learnable Rewiring.
National Category
Artificial Intelligence
Identifiers
urn:nbn:se:kth:diva-370615 (URN)
Note

Submitted to ACM WebConference 2026

QC 20250929

Available from: 2025-09-29 Created: 2025-09-29 Last updated: 2025-10-21Bibliographically approved
Komini, V. & Girdzijauskas, S. (2025). Integrating Logit Space Embeddings for Reliable Out-of-Distribution Detection. In: Machine Learning, Optimization, and Data Science - 10th International Conference, LOD 2024, Revised Selected Papers: . Paper presented at 10th International Conference on Machine Learning, Optimization, and Data Science, LOD 2024, Castiglione della Pescaia, Italy, September 22-25, 2024 (pp. 255-269). Springer Nature
Open this publication in new window or tab >>Integrating Logit Space Embeddings for Reliable Out-of-Distribution Detection
2025 (English)In: Machine Learning, Optimization, and Data Science - 10th International Conference, LOD 2024, Revised Selected Papers, Springer Nature , 2025, p. 255-269Conference paper, Published paper (Refereed)
Abstract [en]

Deep learning (DL) models have significantly transformed machine learning (ML), particularly with their prowess in classification tasks. However, these models struggle to differentiate between in-distribution (ID) and out-of-distribution (OOD) data at the testing phase. This challenge has curtailed their deployment in sensitive fields like biotechnology, where misidentifying OOD data, such as unclear or unknown bacterial genomic sequences, as known ID classes could lead to dire consequences. To address this, we propose an approach to make DL models OOD-sensitive by exploiting the configuration of the logit space embeddings, into the model’s decision-making process. Leveraging the effect observed in recent studies that there is minimal overlap between the embeddings of ID and OOD data, we use a density estimator to model the ID logit distribution based on the training data. This allows us to reliably flag data that do not match the ID distribution as OOD. Our methodology is designed to be independent of the specific data or model architecture and can seamlessly augment existing trained models without the need to expose them to OOD data. Testing our method on widely recognized image datasets, we achieve leading-edge results, including a substantial 10% enhancement in the area under the receiver operating characteristic curve (AUCROC) on the Google genome dataset.

Place, publisher, year, edition, pages
Springer Nature, 2025
National Category
Computer Sciences Bioinformatics (Computational Biology) Software Engineering
Identifiers
urn:nbn:se:kth:diva-361973 (URN)10.1007/978-3-031-82484-5_19 (DOI)001530956900019 ()2-s2.0-105000982628 (Scopus ID)
Conference
10th International Conference on Machine Learning, Optimization, and Data Science, LOD 2024, Castiglione della Pescaia, Italy, September 22-25, 2024
Note

Part of ISBN 9783031824838

QC 20250404

Available from: 2025-04-03 Created: 2025-04-03 Last updated: 2025-12-08Bibliographically approved
Samy, A. E., Kefato, Z. T. & Girdzijauskas, Š. (2025). Leap: Inductive Link Prediction via Learnable Topology Augmentation. In: Machine Learning, Optimization, and Data Science - 10th International Conference, LOD 2024, Revised Selected Papers: . Paper presented at 10th International Conference on Machine Learning, Optimization, and Data Science, LOD 2024, Castiglione della Pescaia, Italy, Sep 22 2024 - Sep 25 2024 (pp. 448-463). Springer Nature
Open this publication in new window or tab >>Leap: Inductive Link Prediction via Learnable Topology Augmentation
2025 (English)In: Machine Learning, Optimization, and Data Science - 10th International Conference, LOD 2024, Revised Selected Papers, Springer Nature , 2025, p. 448-463Conference paper, Published paper (Refereed)
Abstract [en]

Link prediction is a crucial task in many downstream applications of graph machine learning. To this end, Graph Neural Network (GNN) is a widely used technique for link prediction, mainly in transductive settings, where the goal is to predict missing links between existing nodes. However, many real-life applications require an inductive setting that accommodates for new nodes, coming into an existing graph. Thus, recently inductive link prediction has attracted considerable attention, and a multi-layer perceptron (MLP) is the popular choice of most studies to learn node representations. However, these approaches have limited expressivity and do not fully capture the graph’s structural signal. Therefore, in this work we propose LEAP, an inductive link prediction method based on LEArnable toPology augmentation. Unlike previous methods, LEAP models the inductive bias from both the structure and node features, and hence is more expressive. To the best of our knowledge, this is the first attempt to provide structural contexts for new nodes via learnable augmentation in inductive settings. Extensive experiments on seven real-world homogeneous and heterogeneous graphs demonstrates that LEAP significantly surpasses SOTA methods. The improvements are up to 22% and 17% in terms of AUC and average precision, respectively. The code and datasets are available on GitHub (1https://github.com/AhmedESamy/LEAP/).

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Graph Neural Networks, Heterogeneous graphs, Inductive link prediction, Learnable augmentation
National Category
Computer Sciences Applied Mechanics
Identifiers
urn:nbn:se:kth:diva-361992 (URN)10.1007/978-3-031-82481-4_31 (DOI)001530948600031 ()2-s2.0-105000770530 (Scopus ID)
Conference
10th International Conference on Machine Learning, Optimization, and Data Science, LOD 2024, Castiglione della Pescaia, Italy, Sep 22 2024 - Sep 25 2024
Note

Part of ISBN 9783031824807

QC 20250403

Available from: 2025-04-03 Created: 2025-04-03 Last updated: 2025-12-05Bibliographically approved
Listo Zec, E., Hagander, T., Ihre-Thomason, E. & Girdzijauskas, S. (2025). On the effects of similarity metrics in decentralized deep learning under distributional shift. Transactions on Machine Learning Research, 2025-January, 1-23
Open this publication in new window or tab >>On the effects of similarity metrics in decentralized deep learning under distributional shift
2025 (English)In: Transactions on Machine Learning Research, E-ISSN 2835-8856, Vol. 2025-January, p. 1-23Article in journal (Refereed) Published
Abstract [en]

Decentralized Learning (DL) enables privacy-preserving collaboration among organizations or users to enhance the performance of local deep learning models. However, model aggregation becomes challenging when client data is heterogeneous, and identifying compatible collaborators without direct data exchange remains a pressing issue. In this paper, we investigate the effectiveness of various similarity metrics in DL for identifying peers for model merging, conducting an empirical analysis across multiple datasets with distribution shifts. Our research provides insights into the performance of these metrics, examining their role in facilitating effective collaboration. By exploring the strengths and limitations of these metrics, we contribute to the development of robust DL methods.

Place, publisher, year, edition, pages
Transactions on Machine Learning Research, 2025
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-361192 (URN)2-s2.0-85219582623 (Scopus ID)
Note

QC 20250313

Available from: 2025-03-12 Created: 2025-03-12 Last updated: 2025-03-13Bibliographically approved
Jin, Y., Maatouk, A., Girdzijauskas, S., Xu, S., Tassiulas, L. & Ying, R. (2025). SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate. In: 2025 IEEE International Conference on Machine Learning for Communication and Networking, ICMLCN 2025: . Paper presented at 2nd IEEE International Conference on Machine Learning for Communication and Networking, ICMLCN 2025, Barcelona, Spain, May 26 2025 - May 29 2025. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate
Show others...
2025 (English)In: 2025 IEEE International Conference on Machine Learning for Communication and Networking, ICMLCN 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025Conference paper, Published paper (Refereed)
Abstract [en]

Wireless ray-tracing (RT) is emerging as a key tool for three-dimensional (3D) wireless channel modeling, driven by advances in graphical rendering. Current approaches struggle to accurately model beyond 5G (B5G) network signaling, which often operates at higher frequencies and is more susceptible to environmental conditions and changes. Existing online learning solutions require real-time interaction with radio environment during training, which is both costly and incompatible with GPU-based processing. In response, we propose a novel approach that redefines ray trajectory generation as a sequential decision-making problem, solved with the proposed Scene-Aware Neural Decision Wireless Channel Raytracing Hierarchy (SANDWICH) approach. The SANDWICH approach leverages a decision transformer to jointly learn the optical, physical, and signal properties within each designated environment in a fully differentiable approach, which can be trained entirely on GPUs. SANDWICH offers superior performance compared to existing online learning methods, and outperforms the baseline by 4e<sup>-2</sup> rad in RT accuracy. Furthermore, channel gain estimation w.r.t predicted trajectory only fades 0.5 dB away from using ground truth wireless RT result for channel gain estimation.<sup>2</sup>

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
Channel Generation, Channel Modeling, RF Sensing, Wireless Raytracing
National Category
Computer Sciences Communication Systems
Identifiers
urn:nbn:se:kth:diva-371723 (URN)10.1109/ICMLCN64995.2025.11139897 (DOI)2-s2.0-105016789661 (Scopus ID)
Conference
2nd IEEE International Conference on Machine Learning for Communication and Networking, ICMLCN 2025, Barcelona, Spain, May 26 2025 - May 29 2025
Note

Part of ISBN 979-8-3315-2042-7

QC 20251022

Available from: 2025-10-22 Created: 2025-10-22 Last updated: 2025-11-10Bibliographically approved
Jin, Y., Maatouk, A., Girdzijauskas, S., Xu, S., Tassiulas, L. & Ying, R. (2025). SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate. In: 2025 IEEE International Conference On Machine Learning For Communication And Networking, Icmlcn: . Paper presented at 2025 International Conference on Machine Learning for Communication and Networking-ICMLCN-Annual, MAY 26-29, 2025, Barcelona, SPAIN. IEEE
Open this publication in new window or tab >>SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate
Show others...
2025 (English)In: 2025 IEEE International Conference On Machine Learning For Communication And Networking, Icmlcn, IEEE , 2025Conference paper, Published paper (Refereed)
Abstract [en]

Wireless ray-tracing (RT) is emerging as a key tool for three-dimensional (3D) wireless channel modeling, driven by advances in graphical rendering. Current approaches struggle to accurately model beyond 5G (B5G) network signaling, which often operates at higher frequencies and is more susceptible to environmental conditions and changes. Existing online learning solutions require real-time interaction with radio environment during training, which is both costly and incompatible with GPU-based processing. In response, we propose a novel approach that redefines ray trajectory generation as a sequential decision-making problem, solved with the proposed Scene-Aware Neural Decision Wireless Channel Raytracing Hierarchy (SANDWICH) approach. The SANDWICH approach leverages a decision transformer to jointly learn the optical, physical, and signal properties within each designated environment in a fully differentiable approach, which can be trained entirely on GPUs. SANDWICH offers superior performance compared to existing online learning methods, and outperforms the baseline by 4e-2 rad in RT accuracy. Furthermore, channel gain estimation w.r.t predicted trajectory only fades 0.5 dB away from using ground truth wireless RT result for channel gain estimation.2 Index Terms-Wireless Raytracing, RF Sensing, Channel Modeling, Channel Generation

Place, publisher, year, edition, pages
IEEE, 2025
Keywords
Wireless Raytracing, RF Sensing, Channel Modeling, Channel Generation
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:kth:diva-375653 (URN)10.1109/ICMLCN64995.2025.11139897 (DOI)001576278800008 ()2-s2.0-105016789661 (Scopus ID)979-8-3315-2043-4 (ISBN)979-8-3315-2042-7 (ISBN)
Conference
2025 International Conference on Machine Learning for Communication and Networking-ICMLCN-Annual, MAY 26-29, 2025, Barcelona, SPAIN
Note

QC 20260116

Available from: 2026-01-16 Created: 2026-01-16 Last updated: 2026-01-16Bibliographically approved
Kralj, I., Giaretta, L., Ježić, G., Žarko, I. P. & Girdzijauskas, Š. (2025). Semi-decentralized Training of Spatio-Temporal Graph Neural Networks for Traffic Prediction. In: Proceedings - 2025 IEEE International Conference on Edge Computing and Communications, EDGE 2025: . Paper presented at 2025 IEEE International Conference on Edge Computing and Communications, EDGE 2025, Helsinki, Finland, July 7-12, 2025 (pp. 147-155). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Semi-decentralized Training of Spatio-Temporal Graph Neural Networks for Traffic Prediction
Show others...
2025 (English)In: Proceedings - 2025 IEEE International Conference on Edge Computing and Communications, EDGE 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025, p. 147-155Conference paper, Published paper (Refereed)
Abstract [en]

In smart mobility, large networks of geographically distributed sensors produce vast amounts of high-frequency spatio-temporal data that must be processed in real time to avoid major disruptions. Traditional centralized approaches are increasingly unsuitable to this task, as they struggle to scale with expanding sensor networks, and reliability issues in central components can easily affect the whole deployment. To address these challenges, we explore and adapt semi-decentralized training techniques for Spatio-Temporal Graph Neural Networks (ST-GNNs) in the smart mobility domain. We implement a simulation framework where sensors are grouped by proximity into multiple cloudlets, each handling a subgraph of the traffic graph, fetching node features from other cloudlets to train its own local ST-GNN model, and exchanging model updates with other cloudlets to ensure consistency, enhancing scalability and removing reliance on a centralized aggregator. We perform extensive comparative evaluation of four different ST-GNN training setups - centralized, traditional FL, server-free FL, and Gossip Learning - on large-scale traffic datasets, the METR-LA and PeMS-BAY datasets, for short-, mid-, and long-term vehicle speed predictions. Experimental results show that semi-decentralized setups are comparable to centralized approaches in performance metrics, while offering advantages in terms of scalability and fault tolerance. In addition, we highlight often overlooked issues in existing literature for distributed ST-GNNs, such as the variation in model performance across different geographical areas due to region-specific traffic patterns, and the significant communication overhead and computational costs. However, due to the planar nature of graphs, per-cloudlet costs remain consistent as the network grows, unlike the growing costs in a centralized approach.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
semi-decentralized training, ST-GNN, traffic prediction
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-370835 (URN)10.1109/EDGE67623.2025.00025 (DOI)2-s2.0-105015719072 (Scopus ID)
Conference
2025 IEEE International Conference on Edge Computing and Communications, EDGE 2025, Helsinki, Finland, July 7-12, 2025
Note

Part of ISBN 9798331555597

QC 20251003

Available from: 2025-10-03 Created: 2025-10-03 Last updated: 2025-10-03Bibliographically approved
Komini, V., Koriakina, N., Roy, D. & Girdzijauskas, S. (2025). Similarity Learning for Spectral Clustering. In: Discovery Science - 28th International Conference, DS 2025, Proceedings: . Paper presented at 28th International Conference on Discovery Science, DS 2025, Ljubljana, Slovenia, September 23-25, 2025 (pp. 207-221). Springer Nature
Open this publication in new window or tab >>Similarity Learning for Spectral Clustering
2025 (English)In: Discovery Science - 28th International Conference, DS 2025, Proceedings, Springer Nature , 2025, p. 207-221Conference paper, Published paper (Refereed)
Abstract [en]

Spectral clustering is a widely adopted method capable of identifying complicated cluster boundaries. However, traditional spectral clustering requires the definition of a predefined similarity metric for constructing the Laplacian matrix, a requirement that limits flexibility and adaptability. Instead of predefining this metric upfront as a fixed parametric function, we introduce a novel approach that learns the optimal parameters of a similarity function through parameter optimization. This optimizes a similarity function to assign high similarity values to data pairs with shared discriminative features and low values to those without such features. Previous methods that adapt similarity measures typically treat their parameters as hyperparameters or rely on non-convex optimization strategies. However, these approaches are not well-suited for unsupervised scenarios, as they depend heavily on initial conditions and require labeled data for validation, which is unavailable in such settings. In contrast, our method employs convex optimization to learn the parameters of the similarity metrics directly, rather than treating them as hyperparameters. This enables robust and reliable unsupervised learning, making our approach particularly well-suited for spectral clustering. We validate the effectiveness and adaptability of our method on several benchmark datasets, demonstrating superior performance compared to existing techniques.

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Similarity Learning, Spectral Clustering
National Category
Computer Sciences Computer graphics and computer vision
Identifiers
urn:nbn:se:kth:diva-372798 (URN)10.1007/978-3-032-05461-6_14 (DOI)2-s2.0-105020024626 (Scopus ID)
Conference
28th International Conference on Discovery Science, DS 2025, Ljubljana, Slovenia, September 23-25, 2025
Note

Part of ISBN 9783032054609

QC 20251118

Available from: 2025-11-18 Created: 2025-11-18 Last updated: 2025-11-18Bibliographically approved
Cornell, F., Jin, Y., Karlgren, J. & Girdzijauskas, S. (2025). Unsupervised Ontology- and Taxonomy Construction Through Hyperbolic Relational Domains and Ranges. In: Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of ECML PKDD 2023, Revised Selected Papers: . Paper presented at Joint European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023, Turin, Italy, September 18-22, 2023 (pp. 339-348). Springer Nature
Open this publication in new window or tab >>Unsupervised Ontology- and Taxonomy Construction Through Hyperbolic Relational Domains and Ranges
2025 (English)In: Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of ECML PKDD 2023, Revised Selected Papers, Springer Nature , 2025, p. 339-348Conference paper, Published paper (Refereed)
Abstract [en]

The sets of possible heads and tails for relations in a Knowledge Graph and a type taxonomy of participating entities are important aspects of a Knowledge Graph and constitute a large portion of a Knowledge Graph’s ontology. Making the ontology explicit helps to ensure consistency of the knowledge represented in the graph and allows for less costly maintenance and update of graph content. However, Knowledge Graphs are often conveniently formed without an explicitly described ontology, using only (head, relation, tail)-tuples. For such graphs without predefined structure, we propose learning an ontology and a hierarchical type taxonomy from the graph itself, taking advantage of the reciprocity of entity types and sets of possible heads (relational domains) and tails (relational ranges). Experiments on real-world datasets validate our approach and demonstrate the promise for leveraging machine learning methodologies to efficiently generate taxonomies and ontologies jointly for Knowledge Graphs (Our implementation can be found at https://tinyurl.com/5n6sw6yj).

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Hyperbolic learning, Knowledge Graphs, Ontology Learning, Taxonomy Learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-359658 (URN)10.1007/978-3-031-74633-8_23 (DOI)001437448200023 ()2-s2.0-85216103400 (Scopus ID)
Conference
Joint European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023, Turin, Italy, September 18-22, 2023
Note

Part of ISBN 9783031746321

QC 20250207

Available from: 2025-02-06 Created: 2025-02-06 Last updated: 2025-12-05Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-4516-7317

Search in DiVA

Show all publications