kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Dynamic multi-tour order picking in an automotive-part warehouse based on attention-aware deep reinforcement learning
Beihang University.
Beihang University.
KTH, School of Industrial Engineering and Management (ITM), Production engineering, Industrial Production Systems.ORCID iD: 0000-0001-8679-8049
KTH, School of Industrial Engineering and Management (ITM), Production engineering, Advanced Maintenance and Production Logistics.ORCID iD: 0000-0003-4180-6003
Show others and affiliations
2025 (English)In: Robotics and Computer-Integrated Manufacturing, ISSN 0736-5845, E-ISSN 1879-2537, Robotics and Computer-Integrated Manufacturing, ISSN 0736-5845, Vol. 94, p. 102959-102959, article id 102959Article in journal (Refereed) Published
Abstract [en]

Dynamic order picking has usually demonstrated significant impacts on production efficiency in warehouse management. In the context of an automotive-part warehouse, this paper addresses a dynamic multi-tour order-picking problem based on a novel attention-aware deep reinforcement learning-based (ADRL) method. The multi-tour represents that one order-picking task must be split into multiple tours due to the cart capacity and the operator’s workload constraints. First, the multi-tour order-picking problem is formulated as a mathematical model, and then reformulated as a Markov decision process. Second, a novel DRL-based method is proposed to solve it effectively. Compared to the existing DRL-based methods, this approach employs multi-head attention to perceive warehouse situations. Additionally, three improvements are proposed to further strengthen the solution quality and generalization, including (1) the extra location representation to align the batch length during training, (2) the dynamic decoding to integrate real-time information of the warehouse environment during inference, and (3) the proximal policy optimization with entropy bonus to facilitate action exploration during training. Finally, comparison experiments based on thousands of order-picking instances from the Swedish warehouse validated that the proposed ADRL could outperform the other twelve DRL-based methods at most by 40.6%, considering the optimization objective. Furthermore, the performance gap between ADRL and seven evolutionary algorithms is controlled within 3%, while ADRL can be hundreds or thousands of times faster than these EAs regarding the solving speed.

Place, publisher, year, edition, pages
2025. Vol. 94, p. 102959-102959, article id 102959
Keywords [en]
Smart manufacturing system; Industry 5.0; Manual order picking; Deep reinforcement learning; Intelligent decision-making
National Category
Production Engineering, Human Work Science and Ergonomics
Research subject
Industrial Engineering and Management
Identifiers
URN: urn:nbn:se:kth:diva-358737DOI: 10.1016/j.rcim.2025.102959ISI: 001401135400001Scopus ID: 2-s2.0-85214875132OAI: oai:DiVA.org:kth-358737DiVA, id: diva2:1929595
Funder
Vinnova, 2022-02413
Note

QC 20250121

Available from: 2025-01-21 Created: 2025-01-21 Last updated: 2025-12-08Bibliographically approved

Open Access in DiVA

fulltext(3352 kB)264 downloads
File information
File name FULLTEXT01.pdfFile size 3352 kBChecksum SHA-512
6f00250b102edf766984a7ca29316130e5aac569caf32f9d97f741ffbdd276c840eb6974215f42de7e427046a9428c7a0f6a0199a5e072be643efd8f24c4c135
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Wang, LihuiRuiz Zúñiga, EnriqueWang, Xi VincentFlores-García, Erik

Search in DiVA

By author/editor
Wang, LihuiRuiz Zúñiga, EnriqueWang, Xi VincentFlores-García, Erik
By organisation
Industrial Production SystemsAdvanced Maintenance and Production LogisticsProduction engineeringXPRES, Excellence in production research
In the same journal
Robotics and Computer-Integrated Manufacturing
Production Engineering, Human Work Science and Ergonomics

Search outside of DiVA

GoogleGoogle Scholar
Total: 264 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 1130 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf