kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science.ORCID iD: 0000-0003-4090-7283
University of Eastern Finland, Kuopio, Finland.ORCID iD: 0000-0003-2271-316X
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Theoretical Computer Science, TCS.ORCID iD: 0000-0002-3981-1500
2023 (English)In: Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Turin, Italy, September 18-22, 2023, Proceedings, Part I, Springer Nature , 2023, Vol. 14169, p. 743-758Conference paper, Published paper (Refereed)
Abstract [en]

Finding (bi-)clusters in bipartite graphs is a popular data analysis approach. Analysts typically want to visualize the clusters, which is simple as long as the clusters are disjoint. However, many modern algorithms find overlapping clusters, making visualization more complicated. In this paper, we study the problem of visualizing a given clustering of overlapping clusters in bipartite graphs and the related problem of visualizing Boolean Matrix Factorizations. We conceptualize three different objectives that any good visualization should satisfy: (1) proximity of cluster elements, (2) large consecutive areas of elements from the same cluster, and (3) large uninterrupted areas in the visualization, regardless of the cluster membership. We provide objective functions that capture these goals and algorithms that optimize these objective functions. Interestingly, in experiments on real-world datasets, we find that the best trade-off between these competing goals is achieved by a novel heuristic, which locally aims to place rows and columns with similar cluster membership next to each other.

Place, publisher, year, edition, pages
Springer Nature , 2023. Vol. 14169, p. 743-758
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14169
National Category
Computer Systems
Identifiers
URN: urn:nbn:se:kth:diva-338942DOI: 10.1007/978-3-031-43412-9_44ISI: 001156137100044Scopus ID: 2-s2.0-85174449746OAI: oai:DiVA.org:kth-338942DiVA, id: diva2:1808580
Conference
European Conference, ECML PKDD 2023, Turin, Italy, September 18-22, 2023
Note

Part of book ISBN 9783031434112

QC 20231101

Available from: 2023-10-31 Created: 2023-10-31 Last updated: 2024-03-05Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Marette, ThibaultNeumann, Stefan

Search in DiVA

By author/editor
Marette, ThibaultMiettinen, PauliNeumann, Stefan
By organisation
Computer ScienceTheoretical Computer Science, TCS
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 73 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf