kth.sePublikationer KTH
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations
KTH, Skolan för elektroteknik och datavetenskap (EECS), Datavetenskap.ORCID-id: 0000-0003-4090-7283
University of Eastern Finland, Kuopio, Finland.ORCID-id: 0000-0003-2271-316X
KTH, Skolan för elektroteknik och datavetenskap (EECS), Datavetenskap, Teoretisk datalogi, TCS.ORCID-id: 0000-0002-3981-1500
2023 (Engelska)Ingår i: Machine Learning and Knowledge Discovery in Databases: Research Track - European Conference, ECML PKDD 2023, Turin, Italy, September 18-22, 2023, Proceedings, Part I, Springer Nature , 2023, Vol. 14169, s. 743-758Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Finding (bi-)clusters in bipartite graphs is a popular data analysis approach. Analysts typically want to visualize the clusters, which is simple as long as the clusters are disjoint. However, many modern algorithms find overlapping clusters, making visualization more complicated. In this paper, we study the problem of visualizing a given clustering of overlapping clusters in bipartite graphs and the related problem of visualizing Boolean Matrix Factorizations. We conceptualize three different objectives that any good visualization should satisfy: (1) proximity of cluster elements, (2) large consecutive areas of elements from the same cluster, and (3) large uninterrupted areas in the visualization, regardless of the cluster membership. We provide objective functions that capture these goals and algorithms that optimize these objective functions. Interestingly, in experiments on real-world datasets, we find that the best trade-off between these competing goals is achieved by a novel heuristic, which locally aims to place rows and columns with similar cluster membership next to each other.

Ort, förlag, år, upplaga, sidor
Springer Nature , 2023. Vol. 14169, s. 743-758
Serie
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14169
Nationell ämneskategori
Datorsystem
Identifikatorer
URN: urn:nbn:se:kth:diva-338942DOI: 10.1007/978-3-031-43412-9_44ISI: 001156137100044Scopus ID: 2-s2.0-85174449746OAI: oai:DiVA.org:kth-338942DiVA, id: diva2:1808580
Konferens
European Conference, ECML PKDD 2023, Turin, Italy, September 18-22, 2023
Anmärkning

Part of book ISBN 9783031434112

QC 20231101

Tillgänglig från: 2023-10-31 Skapad: 2023-10-31 Senast uppdaterad: 2024-03-05Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

Marette, ThibaultNeumann, Stefan

Sök vidare i DiVA

Av författaren/redaktören
Marette, ThibaultMiettinen, PauliNeumann, Stefan
Av organisationen
DatavetenskapTeoretisk datalogi, TCS
Datorsystem

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 75 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf