kth.sePublications KTH
Change search
Link to record
Permanent link

Direct link
Publications (4 of 4) Show all publications
Klasson, M., Zhang, C. & Kjellström, H. (2020). Using Variational Multi-view Learning for Classification of Grocery Items. Patterns, 1(8)
Open this publication in new window or tab >>Using Variational Multi-view Learning for Classification of Grocery Items
2020 (English)In: Patterns, ISSN 2666-3899, Vol. 1, no 8Article in journal (Refereed) Published
Abstract [en]

An essential task for computer vision-based assistive technologies is to help visually impaired people to recognize objects in constrained environments, for instance, recognizing food items in grocery stores. In this paper, we introduce a novel dataset with natural images of groceries—fruits, vegetables, and packaged products—where all images have been taken inside grocery stores to resemble a shopping scenario. Additionally, we download iconic images and text descriptions for each item that can be utilized for better representation learning of groceries. We select a multi-view generative model, which can combine the different item information into lower-dimensional representations. The experiments show that utilizing the additional information yields higher accuracies on classifying grocery items than only using the natural images. We observe that iconic images help to construct representations separated by visual differences of the items, while text descriptions enable the model to distinguish between visually similar items by different ingredients.

Place, publisher, year, edition, pages
Elsevier, 2020
National Category
Computer graphics and computer vision
Identifiers
urn:nbn:se:kth:diva-292181 (URN)
Note

QC 20220426

Available from: 2021-03-25 Created: 2021-03-25 Last updated: 2025-02-07Bibliographically approved
Klasson, M., Zhang, C. & Kjellström, H. (2019). A hierarchical grocery store image dataset with visual and semantic labels. In: Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019: . Paper presented at 19th IEEE Winter Conference on Applications of Computer Vision, WACV 2019, 7 January 2019 through 11 January 2019 (pp. 491-500). Institute of Electrical and Electronics Engineers (IEEE), Article ID 8658240.
Open this publication in new window or tab >>A hierarchical grocery store image dataset with visual and semantic labels
2019 (English)In: Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019, Institute of Electrical and Electronics Engineers (IEEE), 2019, p. 491-500, article id 8658240Conference paper, Published paper (Refereed)
Abstract [en]

Image classification models built into visual support systems and other assistive devices need to provide accurate predictions about their environment. We focus on an application of assistive technology for people with visual impairments, for daily activities such as shopping or cooking. In this paper, we provide a new benchmark dataset for a challenging task in this application – classification of fruits, vegetables, and refrigerated products, e.g. milk packages and juice cartons, in grocery stores. To enable the learning process to utilize multiple sources of structured information, this dataset not only contains a large volume of natural images but also includes the corresponding information of the product from an online shopping website. Such information encompasses the hierarchical structure of the object classes, as well as an iconic image of each type of object. This dataset can be used to train and evaluate image classification models for helping visually impaired people in natural environments. Additionally, we provide benchmark results evaluated on pretrained convolutional neural networks often used for image understanding purposes, and also a multi-view variational autoencoder, which is capable of utilizing the rich product information in the dataset.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2019
Keywords
Benchmarking, Computer vision, Electronic commerce, Image classification, Large dataset, Learning systems, Neural networks, Semantics, Accurate prediction, Assistive technology, Classification models, Convolutional neural network, Hierarchical structures, Natural environments, Structured information, Visually impaired people, Classification (of information)
National Category
Computer graphics and computer vision Computer graphics and computer vision
Identifiers
urn:nbn:se:kth:diva-252223 (URN)10.1109/WACV.2019.00058 (DOI)000469423400051 ()2-s2.0-85063566822 (Scopus ID)
Conference
19th IEEE Winter Conference on Applications of Computer Vision, WACV 2019, 7 January 2019 through 11 January 2019
Note

QC 20190611

Part of ISBN 9781728119755

Available from: 2019-06-11 Created: 2019-06-11 Last updated: 2025-02-07Bibliographically approved
Klasson, M., Kjellström, H. & Zhang, C.Learn the Time to Learn: Replay Scheduling in Continual Learning.
Open this publication in new window or tab >>Learn the Time to Learn: Replay Scheduling in Continual Learning
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Replay-based continual learning have shown to be successful in mitigating catastrophic forgetting despite having limited access to historical data. However, storing historical data is cheap in many real-world applications, yet replaying all seen data would be prohibited due to processing time constraints. In such settings, we propose learning the time to learn for a continual learning system, in which we learn replay schedules over which tasks to replay at different time steps. To demonstrate the importance of learning the time to learn, we use Monte Carlo tree search in an ideal continual learning scenario to find the proper replay schedule. We perform extensive evaluations to show the benefits of replay scheduling in various memory settings and in combination with different replay methods. Moreover, the results indicate that the found schedules are consistent with human learning insights. Our findings opens up for new research directions that can bring current continual learning research closer to real-world needs.

Keywords
Continual Learning; Replay Memory
National Category
Computer graphics and computer vision
Research subject
Computer Science
Identifiers
urn:nbn:se:kth:diva-320005 (URN)
Funder
Promobilia foundation, F-16500
Note

QC 20221018

Available from: 2022-10-12 Created: 2022-10-12 Last updated: 2025-02-07Bibliographically approved
Klasson, M., Kjellström, H. & Zhang, C.Policy Learning for Replay Scheduling in Continual Learning.
Open this publication in new window or tab >>Policy Learning for Replay Scheduling in Continual Learning
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Scheduling over which tasks to select for replay at different times have been demonstrated to be important in continual learning. However, a replay scheduling policy that can be applied in any continual learning scenario is currently missing, which makes replay scheduling infeasible in real-world scenarios. To this end, we propose using reinforcement learning to enable learning policies that can be applied in new continual learning scenarios without additional computational cost. In our experiments, we show that the learned policies can propose replay schedules that efficiently mitigate catastrophic forgetting in environments with previously unseen task orders and datasets. The proposed approach opens up for new research directions in replay-based continual learning that stems well with real-world needs.

Keywords
Continual Learning; Replay Memory; Replay Scheduling; Reinforcement Learning
National Category
Computer graphics and computer vision
Research subject
Computer Science
Identifiers
urn:nbn:se:kth:diva-320006 (URN)
Funder
Promobilia foundation, F-16500
Note

QC 20221018

Available from: 2022-10-12 Created: 2022-10-12 Last updated: 2025-02-07Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-8633-281X

Search in DiVA

Show all publications