kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards sim-to-real industrial parts classification with synthetic dataset
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL. Scania CV AB, Scania Cv Ab.ORCID iD: 0000-0002-4180-3809
Scania CV AB, Scania Cv Ab.
Scania CV AB, Scania Cv Ab.
University of Skövde, University of Skövde.
Show others and affiliations
2023 (English)In: Proceedings: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023, Institute of Electrical and Electronics Engineers (IEEE) , 2023, p. 4454-4463Conference paper, Published paper (Refereed)
Abstract [en]

This paper is about effectively utilizing synthetic data for training deep neural networks for industrial parts classification, in particular, by taking into account the domain gap against real-world images. To this end, we introduce a synthetic dataset that may serve as a preliminary testbed for the Sim-to-Real challenge; it contains 17 objects of six industrial use cases, including isolated and assembled parts. A few subsets of objects exhibit large similarities in shape and albedo for reflecting challenging cases of industrial parts. All the sample images come with and without random backgrounds and post-processing for evaluating the importance of domain randomization. We call it Synthetic Industrial Parts dataset (SIP-17). We study the usefulness of SIP-17 through benchmarking the performance of five state-of-the-art deep network models, supervised and self-supervised, trained only on the synthetic data while testing them on real data. By analyzing the results, we deduce some insights on the feasibility and challenges of using synthetic data for industrial parts classification and for further developing larger-scale synthetic datasets. Our dataset † and code ‡ are publicly available.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2023. p. 4454-4463
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-337847DOI: 10.1109/CVPRW59228.2023.00468Scopus ID: 2-s2.0-85170821045OAI: oai:DiVA.org:kth-337847DiVA, id: diva2:1803769
Conference
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023, Vancouver, Canada, Jun 18 2023 - Jun 22 2023
Note

Part of ISBN 9798350302493

QC 20231010

Available from: 2023-10-10 Created: 2023-10-10 Last updated: 2025-11-12Bibliographically approved
In thesis
1. Towards Automated Parts Recognition in Manufacturing with Synthetic Data
Open this publication in new window or tab >>Towards Automated Parts Recognition in Manufacturing with Synthetic Data
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

This thesis advances the understanding and application of synthetic data for manufacturing parts recognition. Vision-based inspection systems in manufacturing rely heavily on real image data, which are costly to collect, annotate, and adapt across products and environments. To address these challenges, this work presents a systematic investigation of how synthetic data can be effectively generated, evaluated, and applied for robust and scalable performance. The research introduces a series of new industrial benchmark datasets covering multiple manufacturing use cases and factory environments: SIP-17, SIP15-OD, and SIP2A-OD, to enable unified evaluation of sim-to-real transfer in classification and detection tasks. Building on these datasets, a domain randomization pipeline is developed to systematically explore the effects of rendering parameters, material variability, and illumination on model generalization. To further automate data generation, the thesis proposes Synthetic Active Learning (SAL), a closed-loop framework that identifies model weaknesses and adaptively refines synthetic data generation without requiring real samples or manual tuning. Experiments across the benchmark datasets show that the proposed method improves model robustness compared to existing approaches while reducing manual labeling requirements. Collectively, these contributions provide new insights into how synthetic data can be systematically leveraged to build data-efficient, automated, and reliable vision systems for manufacturing, aiming to support future development of intelligent and flexible production systems.

Abstract [sv]

 Denna avhandling bidrar till förståelsen och tillämpningen av syntetisk data för igenkänning av tillverkningskomponenter. Visionsbaserade inspektionssystem inom industrin är starkt beroende av verkliga bilddata, vilka är kostsamma att samla in, annotera och anpassa mellan olika produkter och miljöer. För att hantera dessa utmaningar presenterar arbetet en systematisk undersökning av hur syntetisk data effektivt kan genereras, utvärderas och tillämpas för robust och skalbar prestanda. Forskningen introducerar en serie nya industriella benchmark-datamängder som täcker flera användningsfall och fabriksmiljöer: SIP-17, SIP15-OD ochSIP2A-OD, för att möjliggöra en enhetlig utvärdering av sim-to-real-överföring inom klassificerings- och detektionsuppgifter. Med utgångspunkt i dessa datamängder utvecklas en domain randomizationpipeline som systematiskt undersöker effekterna av renderingsparametrar, materialvariation och belysning på modellens generaliseringsförmåga. För att ytterligare automatisera datagenereringen föreslås Synthetic Active Learning (SAL), ett slutet ramverk som identifierar modellens svagheter och adaptivt förfinar den syntetiska datagenereringen utan att kräva verkliga prover eller manuell justering. Experiment över benchmark-datamängderna visar att de föreslagna metoderna förbättrar modellens robusthet jämfört med befintliga tillvägagångssätt, samtidigt som behovet av manuell annotering minskar. Sammantaget ger dessa bidrag nya insikter i hur syntetisk data systematiskt kan utnyttjas för att bygga dataeffektiva, automatiserade och tillförlitliga visionssystem för tillverkningsindustrin, med målet att stödja utvecklingen av framtida intelligenta och flexibla produktionssystem.

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2025. p. xix, 75
Series
TRITA-EECS-AVL ; 2025:105
Keywords
Synthetic Data, Sim-to-real Transfer, Object Detection, Domain Randomization, Ac tive Learning, Dataset Generation, Automation, Computer Vision, Industrial Inspec tion, Syntetisk Data, Sim-to-real-överföring, Objektigenkänning, Domain Randomization, Aktivt Lärande, Datagenerering, Automation, Datorseende, Industriell Inspektion
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-372644 (URN)978-91-8106-466-7 (ISBN)
Public defence
2025-12-09, https://kth-se.zoom.us/j/62913476523, Kollegiesalen, Brinellvägen 8, Stockholm, 13:00 (English)
Opponent
Supervisors
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

QC 20251112

Available from: 2025-11-12 Created: 2025-11-11 Last updated: 2025-11-24Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Zhu, XiaomengBjörkman, MårtenMaki, Atsuto

Search in DiVA

By author/editor
Zhu, XiaomengBjörkman, MårtenMaki, Atsuto
By organisation
Robotics, Perception and Learning, RPL
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 269 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf