Change search
ReferencesLink to record
Permanent link

Direct link
Minimizing stochastic complexity using local search and GLA with applications to classification of bacteria
KTH, Superseded Departments, Mathematics.ORCID iD: 0000-0003-1489-8512
2000 (English)In: Biosystems (Amsterdam. Print), ISSN 0303-2647, E-ISSN 1872-8324, Vol. 57, no 1, 37-48 p.Article in journal (Refereed) Published
Abstract [en]

In this paper, we compare the performance of two iterative clustering methods when applied to an extensive data set describing strains of the bacterial family Enterobacteriaceae. In both methods, the classification (i.e. the number of classes and the partitioning) is determined by minimizing stochastic complexity. The first method performs the minimization by repeated application of the generalized Lloyd algorithm (GLA). The second method uses an optimization technique known as local search (LS). The method modifies the current solution by making global changes to the class structure and it, then, performs local fine-tuning to find a local optimum. II is observed that if we fix the number of classes, the LS finds a classification with a lower stochastic complexity value than GLA. In addition, the valiance of the solutions is much smaller for the LS due to its more systematic method of searching. Overall, the two algorithms produce similar classifications but they merge cel tain natural classes with microbiological relevance in different ways.

Place, publisher, year, edition, pages
2000. Vol. 57, no 1, 37-48 p.
Keyword [en]
classification, stochastic complexity, GLA, local search, numerical, taxonomy, vector quantizer design, algorithm, enterobacteriaceae, identification
URN: urn:nbn:se:kth:diva-19943ISI: 000088594900004OAI: diva2:338635
QC 20100525Available from: 2010-08-10 Created: 2010-08-10Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Koski, Timo
By organisation
In the same journal
Biosystems (Amsterdam. Print)

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 14 hits
ReferencesLink to record
Permanent link

Direct link