kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Systematic review of class imbalance problems in manufacturing
Artificial Engineering, Via del Rione Sirignano, 10, Naples, 80121, Italy.
Catholic University of the Sacred Heart, Largo Agostino Gemelli, 1, Milan, 20123, Italy.
KTH, School of Industrial Engineering and Management (ITM), Production engineering.ORCID iD: 0000-0001-8679-8049
2023 (English)In: Journal of manufacturing systems, ISSN 0278-6125, E-ISSN 1878-6642, Vol. 71, p. 620-644Article, review/survey (Refereed) Published
Abstract [en]

Class imbalance (CI) is a well-known problem in data science. Nowadays, it is affecting the data modeling of many of the real-world processes that are being digitized. The manufacturing industry turns out to be highly affected by this problem, especially in fault inspection, prediction or monitoring processes, and in all those processes where the production efficiency is high and the data samples of anomalous events are rare. In this work, we systematically review all the data manipulation, machine learning or deep learning solutions to the CI problem in the manufacturing domain. We also critically evaluate all the different metrics that researchers can compare in order to estimate the improvements carried by their proposed solutions, and we look at the availability of public source code and data-imbalanced datasets that can be used for benchmarking. Finally, we summarize the most applied solutions to the CI problem in manufacturing and we look at future challenges. While posing a reference for the best practices at the time of this review, we challenge researchers to standardize the use of data science algorithms for CI in the manufacturing domain.

Place, publisher, year, edition, pages
Elsevier BV , 2023. Vol. 71, p. 620-644
Keywords [en]
Class imbalance, Data manipulation, Deep learning, Machine learning, Manufacturing
National Category
Production Engineering, Human Work Science and Ergonomics Robotics and automation
Identifiers
URN: urn:nbn:se:kth:diva-339556DOI: 10.1016/j.jmsy.2023.10.014ISI: 001107074700001Scopus ID: 2-s2.0-85175302405OAI: oai:DiVA.org:kth-339556DiVA, id: diva2:1811910
Note

QC 20231114

Available from: 2023-11-14 Created: 2023-11-14 Last updated: 2025-12-08Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Wang, Lihui

Search in DiVA

By author/editor
Wang, Lihui
By organisation
Production engineering
In the same journal
Journal of manufacturing systems
Production Engineering, Human Work Science and ErgonomicsRobotics and automation

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 102 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf