kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A cutting plane algorithm for globally solving low-dimensional k-means problems
KTH, School of Engineering Sciences (SCI), Mathematics (Dept.), Numerical Analysis, Optimization and Systems Theory.ORCID iD: 0000-0002-3316-770X
KTH, School of Engineering Sciences (SCI), Mathematics (Dept.), Numerical Analysis, Optimization and Systems Theory.ORCID iD: 0000-0003-0299-5745
KTH, School of Engineering Sciences (SCI), Mathematics (Dept.), Numerical Analysis, Optimization and Systems Theory.ORCID iD: 0000-0001-5158-9255
2025 (English)In: Optimization Letters, ISSN 1862-4472, E-ISSN 1862-4480Article in journal (Refereed) Published
Abstract [en]

Clustering is one of the most fundamental tools in data science and machine learning, and k-means clustering is one of the most common of such methods. There is a variety of approximate algorithms for the k-means problem, but only a few methods compute the globally optimal solution, as it is in general NP-hard. In this paper, we consider the k-means problem for instances with low-dimensional data and formulate it as a structured concave assignment problem. This allows us to exploit the low-dimensional structure and solve the problem to global optimality for very large data sets with several clusters, complementing and outperforming state-of-the-art for this class of problems. The method builds on iteratively solving a small concave problem and a large linear programming or assignment problem. This gives a sequence of feasible solutions along with bounds, which we show converges to a zero optimality gap. The paper combines methods from global optimization to accelerate the procedure, and we provide numerical results on synthetic data and real-world data.

Place, publisher, year, edition, pages
Springer Nature , 2025.
Keywords [en]
clustering, Global optimization, k-means problem, Sum of squares
National Category
Computational Mathematics Control Engineering
Identifiers
URN: urn:nbn:se:kth:diva-369850DOI: 10.1007/s11590-025-02235-zISI: 001555998100001Scopus ID: 2-s2.0-105014033073OAI: oai:DiVA.org:kth-369850DiVA, id: diva2:1998495
Note

QC 20250917

Not duplicate with DiVA 1935628

Available from: 2025-09-17 Created: 2025-09-17 Last updated: 2025-09-17Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Ryner, MartinKronqvist, JanKarlsson, Johan

Search in DiVA

By author/editor
Ryner, MartinKronqvist, JanKarlsson, Johan
By organisation
Numerical Analysis, Optimization and Systems Theory
In the same journal
Optimization Letters
Computational MathematicsControl Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 29 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf