kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Publications (10 of 12) Show all publications
Zhou, T., Neumann, S., Garimella, K. & Gionis, A. (2024). Modeling the Impact of Timeline Algorithms on Opinion Dynamics Using Low-rank Updates. In: WWW 2024 - Proceedings of the ACM Web Conference: . Paper presented at 33rd ACM Web Conference, WWW 2024, Singapore, Singapore, May 13 2024 - May 17 2024 (pp. 2694-2702). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>Modeling the Impact of Timeline Algorithms on Opinion Dynamics Using Low-rank Updates
2024 (English)In: WWW 2024 - Proceedings of the ACM Web Conference, Association for Computing Machinery (ACM) , 2024, p. 2694-2702Conference paper, Published paper (Refereed)
Abstract [en]

Timeline algorithms are key parts of online social networks, but during recent years they have been blamed for increasing polarization and disagreement in our society. Opinion-dynamics models have been used to study a variety of phenomena in online social networks, but an open question remains on how thesemodels can be augmented to take into account the fine-grained impact of user-level timeline algorithms. We make progress on this question by providing a way to model the impact of timeline algorithms on opinion dynamics. Specifically, we show how the popular Friedkin - Johnsen opinion-formation model can be augmented based on aggregate information, extracted from timeline data. We use our model to study the problem of minimizing the polarization and disagreement; we assume that we are allowed to make small changes to the users' timeline compositions by strengthening some topics of discussion and penalizing some others. We present a gradient descent-based algorithm for this problem, and show that under realistic parameter settings, our algorithm computes a (1+\varepsilon)-approximate solution in time∼\tO(m\sqrtn łg(1/\varepsilon)), where m∼is the number of edges in the graph and n∼is the number of vertices. We also present an algorithm that provably computes an \varepsilon-approximation of our model in near-linear time. We evaluate our method on real-world data and show that it effectively reduces the polarization and disagreement in the network. Finally, we release an anonymized graph dataset with ground-truth opinions and more than 27\,000∼nodes (the previously largest publicly available dataset contains less than 550∼nodes).

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2024
Keywords
disagreement, friedkin-johnsen model, opinion dynamics, polarization, social-network analysis
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-347331 (URN)10.1145/3589334.3645714 (DOI)2-s2.0-85194072203 (Scopus ID)
Conference
33rd ACM Web Conference, WWW 2024, Singapore, Singapore, May 13 2024 - May 17 2024
Note

QC 20240613

Part of ISBN 979-840070171-9

Available from: 2024-06-10 Created: 2024-06-10 Last updated: 2025-01-27Bibliographically approved
Neumann, S., Dong, Y. & Peng, P. (2024). Sublinear-Time Opinion Estimation in the Friedkin - Johnsen Model. In: WWW 2024 - Proceedings of the ACM Web Conference: . Paper presented at 33rd ACM Web Conference, WWW 2024, Singapore, May 13- May 17 2024 (pp. 2563-2571). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>Sublinear-Time Opinion Estimation in the Friedkin - Johnsen Model
2024 (English)In: WWW 2024 - Proceedings of the ACM Web Conference, Association for Computing Machinery (ACM) , 2024, p. 2563-2571Conference paper, Published paper (Refereed)
Abstract [en]

Online social networks are ubiquitous parts of modern societies and the discussions that take place in these networks impact people's opinions on diverse topics, such as politics or vaccination. One of the most popular models to formally describe this opinion formation process is the Friedkin - Johnsen (FJ) model, which allows to define measures, such as the polarization and the disagreement of a network. Recently, Xu, Bao and Zhang (WebConf'21) showed that all opinions and relevant measures in the FJ model can be approximated in near-linear time. However, their algorithm requires the entire network and the opinions of all nodes as input. Given the sheer size of online social networks and increasing data-access limitations, obtaining the entirety of this data might, however, be unrealistic in practice. In this paper, we show that node opinions and all relevant measures, like polarization and disagreement, can be efficiently approximated in time that is sublinear in the size of the network. Particularly, our algorithms only require query-access to the network and do not have to preprocess the graph. Furthermore, we use a connection between FJ opinion dynamics and personalized PageRank, and show that in d-regular graphs, we can deterministically approximate each node's opinion by only looking at a constant-size neighborhood, independently of the network size. We also experimentally validate that our estimation algorithms perform well in practice.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2024
Keywords
friedkin - johnsen model, opinion formation, social-network analysis, sublinear time algorithms
National Category
Control Engineering
Identifiers
urn:nbn:se:kth:diva-347329 (URN)10.1145/3589334.3645572 (DOI)2-s2.0-85194107567 (Scopus ID)
Conference
33rd ACM Web Conference, WWW 2024, Singapore, May 13- May 17 2024
Note

Part pf ISBN: 979-840070171-9

QC 20240613

Available from: 2024-06-10 Created: 2024-06-10 Last updated: 2024-06-13Bibliographically approved
Tu, S., Neumann, S. & Gionis, A. (2023). Adversaries with Limited Information in the Friedkin-Johnsen Model. In: KDD 2023: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Paper presented at 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023, Long Beach, United States of America, Aug 6 2023 - Aug 10 2023 (pp. 2201-2210). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>Adversaries with Limited Information in the Friedkin-Johnsen Model
2023 (English)In: KDD 2023: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery (ACM) , 2023, p. 2201-2210Conference paper, Published paper (Refereed)
Abstract [en]

In recent years, online social networks have been the target of adversaries who seek to introduce discord into societies, to undermine democracies and to destabilize communities. Often the goal is not to favor a certain side of a conflict but to increase disagreement and polarization. To get a mathematical understanding of such attacks, researchers use opinion-formation models from sociology, such as the Friedkin - Johnsen model, and formally study how much discord the adversary can produce when altering the opinions for only a small set of users. In this line of work, it is commonly assumed that the adversary has full knowledge about the network topology and the opinions of all users. However, the latter assumption is often unrealistic in practice, where user opinions are not available or simply difficult to estimate accurately. To address this concern, we raise the following question: Can an attacker sow discord in a social network, even when only the network topology is known? We answer this question affirmatively. We present approximation algorithms for detecting a small set of users who are highly influential for the disagreement and polarization in the network. We show that when the adversary radicalizes these users and if the initial disagreement/polarization in the network is not very high, then our method gives a constant-factor approximation on the setting when the user opinions are known. To find the set of influential users, we provide a novel approximation algorithm for a variant of MaxCut in graphs with positive and negative edge weights. We experimentally evaluate our methods, which have access only to the network topology, and we find that they have similar performance as methods that have access to the network topology and all user opinions. We further present an NP-hardness proof, which was left as an open question by Chen and Racz [IEEE Transactions on Network Science and Engineering, 2021].

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2023
Keywords
graph algorithms, max-cut, opinion dynamics, polarization, social network analysis
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-337892 (URN)10.1145/3580305.3599255 (DOI)001118896302023 ()2-s2.0-85171327908 (Scopus ID)
Conference
29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023, Long Beach, United States of America, Aug 6 2023 - Aug 10 2023
Note

Part of ISBN 9798400701030

QC 20231010

Available from: 2023-10-10 Created: 2023-10-10 Last updated: 2025-05-14Bibliographically approved
Foerster, K. T., Marette, T., Neumann, S., Plant, C., Sadikaj, Y., Schmid, S. & Velaj, Y. (2023). Analyzing the Communication Clusters in Datacenters. In: ACM Web Conference 2023: Proceedings of the World Wide Web Conference, WWW 2023. Paper presented at 2023 World Wide Web Conference, WWW 2023, Austin, United States of America, Apr 30 2023 - May 4 2023 (pp. 3022-3032). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>Analyzing the Communication Clusters in Datacenters
Show others...
2023 (English)In: ACM Web Conference 2023: Proceedings of the World Wide Web Conference, WWW 2023, Association for Computing Machinery (ACM) , 2023, p. 3022-3032Conference paper, Published paper (Refereed)
Abstract [en]

Datacenter networks have become a critical infrastructure of our digital society and over the last years, great efforts have been made to better understand the communication patterns inside datacenters. In particular, existing empirical studies showed that datacenter traffic typically features much temporal and spatial structure, and that at any given time, some communication pairs interact much more frequently than others. This paper generalizes this study to communication groups and analyzes how clustered the datacenter traffic is, and how stable these clusters are over time. To this end, we propose a methodology which revolves around a biclustering approach, allowing us to identify groups of racks and servers which communicate frequently over the network. In particular, we consider communication patterns occurring in three different Facebook datacenters: a Web cluster consisting of web servers serving web traffic, a Database cluster which mainly consists of MySQL servers, and a Hadoop cluster. Interestingly, we find that in all three clusters, small groups of racks and servers can produce a large fraction of the network traffic, and we can determine these groups even when considering short snapshots of network traffic. We also show empirically that these clusters are fairly stable across time. Our insights on the size and stability of communication clusters hence uncover an interesting potential for resource optimizations in datacenter infrastructures.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2023
Keywords
Clustering, Data Center, Network Traffic
National Category
Computer Systems
Identifiers
urn:nbn:se:kth:diva-333332 (URN)10.1145/3543507.3583410 (DOI)2-s2.0-85159326653 (Scopus ID)
Conference
2023 World Wide Web Conference, WWW 2023, Austin, United States of America, Apr 30 2023 - May 4 2023
Note

Part of ISBN 9781450394161

QC 20230801

Available from: 2023-08-01 Created: 2023-08-01 Last updated: 2023-08-01Bibliographically approved
Henzinger, M., Neumann, S., Räcke, H. & Schmid, S. (2023). Dynamic Maintenance of Monotone Dynamic Programs and Applications. In: 40th International Symposium on Theoretical Aspects of Computer Science, STACS 2023: . Paper presented at 40th International Symposium on Theoretical Aspects of Computer Science, STACS 2023, Hamburg, Germany, Mar 7 2023 - Mar 9 2023. Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, Article ID 36.
Open this publication in new window or tab >>Dynamic Maintenance of Monotone Dynamic Programs and Applications
2023 (English)In: 40th International Symposium on Theoretical Aspects of Computer Science, STACS 2023, Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing , 2023, article id 36Conference paper, Published paper (Refereed)
Abstract [en]

Dynamic programming (DP) is one of the fundamental paradigms in algorithm design. However, many DP algorithms have to fill in large DP tables, represented by two-dimensional arrays, which causes at least quadratic running times and space usages. This has led to the development of improved algorithms for special cases when the DPs satisfy additional properties like, e.g., the Monge property or total monotonicity. In this paper, we consider a new condition which assumes (among some other technical assumptions) that the rows of the DP table are monotone. Under this assumption, we introduce a novel data structure for computing (1 + ϵ)-approximate DP solutions in near-linear time and space in the static setting, and with polylogarithmic update times when the DP entries change dynamically. To the best of our knowledge, our new condition is incomparable to previous conditions and is the first which allows to derive dynamic algorithms based on existing DPs. Instead of using two-dimensional arrays to store the DP tables, we store the rows of the DP tables using monotone piecewise constant functions. This allows us to store length-n DP table rows with entries in [0, W] using only polylog(n, W) bits, and to perform operations, such as (min, +)-convolution or rounding, on these functions in polylogarithmic time. We further present several applications of our data structure. For bicriteria versions of k-balanced graph partitioning and simultaneous source location, we obtain the first dynamic algorithms with subpolynomial update times, as well as the first static algorithms using only near-linear time and space. Additionally, we obtain the currently fastest algorithm for fully dynamic knapsack.

Place, publisher, year, edition, pages
Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, 2023
Keywords
data structures, dynamic algorithms, Dynamic programming
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-333386 (URN)10.4230/LIPIcs.STACS.2023.36 (DOI)001532693100036 ()2-s2.0-85149867993 (Scopus ID)
Conference
40th International Symposium on Theoretical Aspects of Computer Science, STACS 2023, Hamburg, Germany, Mar 7 2023 - Mar 9 2023
Note

Part of ISBN 9783959772662

QC 20230801

Available from: 2023-08-01 Created: 2023-08-01 Last updated: 2025-12-08Bibliographically approved
Coupette, C., Neumann, S. & Gionis, A. (2023). Reducing Exposure to Harmful Content via Graph Rewiring. In: KDD 2023 - Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining: . Paper presented at 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023, Long Beach, United States of America, Aug 6 2023 - Aug 10 2023 (pp. 323-334). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>Reducing Exposure to Harmful Content via Graph Rewiring
2023 (English)In: KDD 2023 - Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery (ACM) , 2023, p. 323-334Conference paper, Published paper (Refereed)
Abstract [en]

Most media content consumed today is provided by digital platforms that aggregate input from diverse sources, where access to information is mediated by recommendation algorithms. One principal challenge in this context is dealing with content that is considered harmful. Striking a balance between competing stakeholder interests, rather than block harmful content altogether, one approach is to minimize the exposure to such content that is induced specifically by algorithmic recommendations. Hence, modeling media items and recommendations as a directed graph, we study the problem of reducing the exposure to harmful content via edge rewiring. We formalize this problem using absorbing random walks, and prove that it is NP-hard and NP-hard to approximate to within an additive error, while under realistic assumptions, the greedy method yields a (1-1/e)-approximation. Thus, we introduce Gamine, a fast greedy algorithm that can reduce the exposure to harmful content with or without quality constraints on recommendations. By performing just 100 rewirings on YouTube graphs with several hundred thousand edges, Gamine reduces the initial exposure by 50%, while ensuring that its recommendations are at most 5% less relevant than the original recommendations. Through extensive experiments on synthetic data and real-world data from video recommendation and news feed applications, we confirm the effectiveness, robustness, and efficiency of Gamine in practice.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2023
Keywords
graph rewiring, random walks, recommendation graphs
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-338585 (URN)10.1145/3580305.3599489 (DOI)001118896300028 ()2-s2.0-85165195377 (Scopus ID)
Conference
29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023, Long Beach, United States of America, Aug 6 2023 - Aug 10 2023
Note

Part of ISBN 9798400701030

QC 20231107

Available from: 2023-11-07 Created: 2023-11-07 Last updated: 2025-01-27Bibliographically approved
Marette, T., Miettinen, P. & Neumann, S. (2023). Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations. In: Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Turin, Italy, September 18-22, 2023, Proceedings, Part I. Paper presented at European Conference, ECML PKDD 2023, Turin, Italy, September 18-22, 2023 (pp. 743-758). Springer Nature, 14169
Open this publication in new window or tab >>Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations
2023 (English)In: Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Turin, Italy, September 18-22, 2023, Proceedings, Part I, Springer Nature , 2023, Vol. 14169, p. 743-758Conference paper, Published paper (Refereed)
Abstract [en]

Finding (bi-)clusters in bipartite graphs is a popular data analysis approach. Analysts typically want to visualize the clusters, which is simple as long as the clusters are disjoint. However, many modern algorithms find overlapping clusters, making visualization more complicated. In this paper, we study the problem of visualizing a given clustering of overlapping clusters in bipartite graphs and the related problem of visualizing Boolean Matrix Factorizations. We conceptualize three different objectives that any good visualization should satisfy: (1) proximity of cluster elements, (2) large consecutive areas of elements from the same cluster, and (3) large uninterrupted areas in the visualization, regardless of the cluster membership. We provide objective functions that capture these goals and algorithms that optimize these objective functions. Interestingly, in experiments on real-world datasets, we find that the best trade-off between these competing goals is achieved by a novel heuristic, which locally aims to place rows and columns with similar cluster membership next to each other.

Place, publisher, year, edition, pages
Springer Nature, 2023
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14169
National Category
Computer Systems
Identifiers
urn:nbn:se:kth:diva-338942 (URN)10.1007/978-3-031-43412-9_44 (DOI)001156137100044 ()2-s2.0-85174449746 (Scopus ID)
Conference
European Conference, ECML PKDD 2023, Turin, Italy, September 18-22, 2023
Note

Part of book ISBN 9783031434112

QC 20231101

Available from: 2023-10-31 Created: 2023-10-31 Last updated: 2024-03-05Bibliographically approved
Tu, S. & Neumann, S. (2022). A Viral Marketing-Based Model For Opinion Dynamics in Online Social Networks. In: Proceedings of the ACM Web Conference 2022 WWW'2022: . Paper presented at 31st ACM World Wide Web Conference, WWW 2022, Virtual/Online, 25-29 April 2022 (pp. 1570-1578). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>A Viral Marketing-Based Model For Opinion Dynamics in Online Social Networks
2022 (English)In: Proceedings of the ACM Web Conference 2022 WWW'2022, Association for Computing Machinery (ACM) , 2022, p. 1570-1578Conference paper, Published paper (Refereed)
Abstract [en]

Online social networks provide a medium for citizens to form opinions on different societal issues, and a forum for public discussion. They also expose users to viral content, such as breaking news articles. In this paper, we study the interplay between these two aspects: opinion formation and information cascades in online social networks. We present a new model that allows us to quantify how users change their opinion as they are exposed to viral content. Our model is a combination of the popular Friedkin-Johnsen model for opinion dynamics and the independent cascade model for information propagation. We present algorithms for simulating our model, and we provide approximation algorithms for optimizing certain network indices, such as the sum of user opinions or the disagreement-controversy index; our approach can be used to obtain insights into how much viral content can increase these indices in online social networks. Finally, we evaluate our model on real-world datasets. We show experimentally that marketing campaigns and polarizing contents have vastly different effects on the network: while the former have only limited effect on the polarization in the network, the latter can increase the polarization up to 59% even when only 0.5% of the users start sharing a polarizing content. We believe that this finding sheds some light into the growing segregation in today's online media.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2022
Keywords
online social networks, opinion dynamics, information spread
National Category
Human Computer Interaction Sociology (Excluding Social Work, Social Anthropology, Demography and Criminology)
Identifiers
urn:nbn:se:kth:diva-320994 (URN)10.1145/3485447.3512203 (DOI)000852713001061 ()2-s2.0-85129796448 (Scopus ID)
Conference
31st ACM World Wide Web Conference, WWW 2022, Virtual/Online, 25-29 April 2022
Note

Part of proceedings: ISBN 978-1-4503-9096-5

QC 20221104

Available from: 2022-11-04 Created: 2022-11-04 Last updated: 2025-05-14Bibliographically approved
Tommasini, R., Roy, S. B., Wang, X., Wang, H., Ji, H., Han, J., . . . He, X. (2022). Accepted Tutorials at The Web Conference 2022. In: WWW 2022 - Companion Proceedings of the Web Conference 2022: . Paper presented at 31st ACM Web Conference, WWW 2022, 25 April 2022 (pp. 391-399). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>Accepted Tutorials at The Web Conference 2022
Show others...
2022 (English)In: WWW 2022 - Companion Proceedings of the Web Conference 2022, Association for Computing Machinery (ACM) , 2022, p. 391-399Conference paper, Published paper (Refereed)
Abstract [en]

This paper summarizes the content of the 20 tutorials that have been given at The Web Conference 2022: 85% of these tutorials are lecture style, and 15% of these are hands on. 

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2022
Keywords
the web conference, tutorials, Hand-on, Lecture style, Tutorial, Web conferences
National Category
Computer Systems
Identifiers
urn:nbn:se:kth:diva-327048 (URN)10.1145/3487553.3547182 (DOI)001147592700078 ()2-s2.0-85137482806 (Scopus ID)
Conference
31st ACM Web Conference, WWW 2022, 25 April 2022
Note

QC 20230523

Available from: 2023-05-23 Created: 2023-05-23 Last updated: 2025-12-05Bibliographically approved
Neumann, S. & Peng, P. (2022). Sublinear-Time Clustering Oracle for Signed Graphs. In: Chaudhuri, K Jegelka, S Song, L Szepesvari, C Niu, G Sabato, S (Ed.), Proceedings International Conference on Machine Learning, ICML 2022: . Paper presented at 38th International Conference on Machine Learning (ICML), JUL 17-23, 2022, Baltimore, MD. ML Research Press
Open this publication in new window or tab >>Sublinear-Time Clustering Oracle for Signed Graphs
2022 (English)In: Proceedings International Conference on Machine Learning, ICML 2022 / [ed] Chaudhuri, K Jegelka, S Song, L Szepesvari, C Niu, G Sabato, S, ML Research Press , 2022Conference paper, Published paper (Refereed)
Abstract [en]

Social networks are often modeled using signed graphs, where vertices correspond to users and edges have a sign that indicates whether an interaction between users was positive or negative. The arising signed graphs typically contain a clear community structure in the sense that the graph can be partitioned into a small number of polarized communities, each defining a sparse cut and indivisible into smaller polarized sub-communities. We provide a local clustering oracle for signed graphs with such a clear community structure, that can answer membership queries, i.e., "Given a vertex v, which community does v belong to?", in sublinear time by reading only a small portion of the graph. Formally, when the graph has bounded maximum degree and the number of communities is at most O(log n), then with (O) over tilde(root n poly(1/epsilon)) preprocessing time, our oracle can answer each membership query in (O) over tilde(root n poly(1/epsilon)) time, and it correctly classifies a (1 - epsilon)-fraction of vertices w.r.t. a set of hidden planted ground-truth communities. Our oracle is desirable in applications where the clustering information is needed for only a small number of vertices. Previously, such local clustering oracles were only known for unsigned graphs; our generalization to signed graphs requires a number of new ideas and gives a novel spectral analysis of the behavior of random walks with signs. We evaluate our algorithm for constructing such an oracle and answering membership queries on both synthetic and real-world datasets, validating its performance in practice.

Place, publisher, year, edition, pages
ML Research Press, 2022
Series
Proceedings of Machine Learning Research, ISSN 2640-3498 ; 162
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-326571 (URN)000900064906029 ()2-s2.0-85163106853 (Scopus ID)
Conference
38th International Conference on Machine Learning (ICML), JUL 17-23, 2022, Baltimore, MD
Note

QC 20230508

Available from: 2023-05-08 Created: 2023-05-08 Last updated: 2023-07-13Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-3981-1500

Search in DiVA

Show all publications