Minimizing stochastic complexity using local search and GLA with applications to classification of bacteria
2000 (English)In: Biosystems (Amsterdam. Print), ISSN 0303-2647, E-ISSN 1872-8324, Vol. 57, no 1, 37-48 p.Article in journal (Refereed) Published
In this paper, we compare the performance of two iterative clustering methods when applied to an extensive data set describing strains of the bacterial family Enterobacteriaceae. In both methods, the classification (i.e. the number of classes and the partitioning) is determined by minimizing stochastic complexity. The first method performs the minimization by repeated application of the generalized Lloyd algorithm (GLA). The second method uses an optimization technique known as local search (LS). The method modifies the current solution by making global changes to the class structure and it, then, performs local fine-tuning to find a local optimum. II is observed that if we fix the number of classes, the LS finds a classification with a lower stochastic complexity value than GLA. In addition, the valiance of the solutions is much smaller for the LS due to its more systematic method of searching. Overall, the two algorithms produce similar classifications but they merge cel tain natural classes with microbiological relevance in different ways.
Place, publisher, year, edition, pages
2000. Vol. 57, no 1, 37-48 p.
classification, stochastic complexity, GLA, local search, numerical, taxonomy, vector quantizer design, algorithm, enterobacteriaceae, identification
IdentifiersURN: urn:nbn:se:kth:diva-19943ISI: 000088594900004OAI: oai:DiVA.org:kth-19943DiVA: diva2:338635
QC 201005252010-08-102010-08-10Bibliographically approved