kth.sePublikationer KTH
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Dynamic multi-tour order picking in an automotive-part warehouse based on attention-aware deep reinforcement learning
Beihang University.
Beihang University.
KTH, Skolan för industriell teknik och management (ITM), Produktionsutveckling, Industriella produktionssystem.ORCID-id: 0000-0001-8679-8049
KTH, Skolan för industriell teknik och management (ITM), Produktionsutveckling, Avancerade underhållssystem och produktionslogistik.ORCID-id: 0000-0003-4180-6003
Visa övriga samt affilieringar
2025 (Engelska)Ingår i: Robotics and Computer-Integrated Manufacturing, ISSN 0736-5845, E-ISSN 1879-2537, Robotics and Computer-Integrated Manufacturing, ISSN 0736-5845, Vol. 94, s. 102959-102959, artikel-id 102959Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Dynamic order picking has usually demonstrated significant impacts on production efficiency in warehouse management. In the context of an automotive-part warehouse, this paper addresses a dynamic multi-tour order-picking problem based on a novel attention-aware deep reinforcement learning-based (ADRL) method. The multi-tour represents that one order-picking task must be split into multiple tours due to the cart capacity and the operator’s workload constraints. First, the multi-tour order-picking problem is formulated as a mathematical model, and then reformulated as a Markov decision process. Second, a novel DRL-based method is proposed to solve it effectively. Compared to the existing DRL-based methods, this approach employs multi-head attention to perceive warehouse situations. Additionally, three improvements are proposed to further strengthen the solution quality and generalization, including (1) the extra location representation to align the batch length during training, (2) the dynamic decoding to integrate real-time information of the warehouse environment during inference, and (3) the proximal policy optimization with entropy bonus to facilitate action exploration during training. Finally, comparison experiments based on thousands of order-picking instances from the Swedish warehouse validated that the proposed ADRL could outperform the other twelve DRL-based methods at most by 40.6%, considering the optimization objective. Furthermore, the performance gap between ADRL and seven evolutionary algorithms is controlled within 3%, while ADRL can be hundreds or thousands of times faster than these EAs regarding the solving speed.

Ort, förlag, år, upplaga, sidor
2025. Vol. 94, s. 102959-102959, artikel-id 102959
Nyckelord [en]
Smart manufacturing system; Industry 5.0; Manual order picking; Deep reinforcement learning; Intelligent decision-making
Nationell ämneskategori
Produktionsteknik, arbetsvetenskap och ergonomi
Forskningsämne
Industriell ekonomi och organisation
Identifikatorer
URN: urn:nbn:se:kth:diva-358737DOI: 10.1016/j.rcim.2025.102959ISI: 001401135400001Scopus ID: 2-s2.0-85214875132OAI: oai:DiVA.org:kth-358737DiVA, id: diva2:1929595
Forskningsfinansiär
Vinnova, 2022-02413
Anmärkning

QC 20250121

Tillgänglig från: 2025-01-21 Skapad: 2025-01-21 Senast uppdaterad: 2026-02-25Bibliografiskt granskad

Open Access i DiVA

fulltext(3352 kB)268 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 3352 kBChecksumma SHA-512
6f00250b102edf766984a7ca29316130e5aac569caf32f9d97f741ffbdd276c840eb6974215f42de7e427046a9428c7a0f6a0199a5e072be643efd8f24c4c135
Typ fulltextMimetyp application/pdf

Övriga länkar

Förlagets fulltextScopus

Person

Wang, LihuiRuiz Zúñiga, EnriqueWang, Xi VincentFlores-García, Erik

Sök vidare i DiVA

Av författaren/redaktören
Wang, LihuiRuiz Zúñiga, EnriqueWang, Xi VincentFlores-García, Erik
Av organisationen
Industriella produktionssystemAvancerade underhållssystem och produktionslogistikProduktionsutvecklingXPRES, Excellence in production research
I samma tidskrift
Robotics and Computer-Integrated Manufacturing
Produktionsteknik, arbetsvetenskap och ergonomi

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 268 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 1132 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf