kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Federated deep reinforcement learning for dynamic job scheduling in cloud-edge collaborative manufacturing systems
KTH, School of Industrial Engineering and Management (ITM), Production engineering. School of Automation Science and Electrical Engineering, Beihang University, Beijing, People's Republic of China.
School of Automation Science and Electrical Engineering, Beihang University, Beijing, People's Republic of China; State Key Laboratory of Intelligent Manufacturing System Technology, Beijing, People's Republic of China.
KTH, School of Industrial Engineering and Management (ITM), Production engineering, Industrial Production Systems.ORCID iD: 0000-0001-8679-8049
KTH, School of Industrial Engineering and Management (ITM), Production engineering.ORCID iD: 0000-0001-9694-0483
Show others and affiliations
2024 (English)In: International Journal of Production Research, ISSN 0020-7543, E-ISSN 1366-588X, Vol. 62, no 21, p. 7743-7762Article in journal (Refereed) Published
Abstract [en]

The cloud-edge collaborative manufacturing system (CCMS) connects distributed factories to a cloud centre through cloud-edge collaborative communication, introducing both opportunities and challenges to conventional dynamic job scheduling. Enhancing each factory's scheduling performance by sharing general scheduling knowledge among heterogeneous factories under the consideration of data privacy protection remains challenging. To this end, this paper proposes to solve the dynamic job scheduling in the context of CCMS with a novel federated deep reinforcement learning (FDRL) approach. Within each factory, the scheduling objective involves minimising the makespan and energy consumption, accounting for machine warm-up procedures and real-time dynamics. To handle heterogeneous policy structures, we aggregate their hidden parameters through FDRL, with states, actions, and rewards designed to facilitate the aggregation. The two-phase algorithm, comprising iterative local training and global aggregation, trains the scheduling policies. Constraint items are introduced to the loss functions to smooth local training, and the global aggregation considers production scales and obtained objectives. The proposed approach enhances the solution quality and generalisation of each factory's scheduling policy without exposing original production data. Numerical experiments conducted on sixty scheduling instances validate the superiority of the proposed approach compared to twelve dynamic scheduling methods. Compared to independently trained DRL-based approaches, the proposed FDRL-based approach achieves up to an 8.9% reduction in makespan and a 22.3% decrease in energy consumption through knowledge sharing.

Place, publisher, year, edition, pages
Informa UK Limited , 2024. Vol. 62, no 21, p. 7743-7762
Keywords [en]
cloud manufacturing, cloud-edge collaboration, Dynamic scheduling, federated deep reinforcement learning, innovation and infrastructure, intelligent decision-making, SDG 9: Industry
National Category
Computer Sciences Production Engineering, Human Work Science and Ergonomics Computer Systems
Identifiers
URN: urn:nbn:se:kth:diva-367441DOI: 10.1080/00207543.2024.2328116ISI: 001185832600001Scopus ID: 2-s2.0-85188420583OAI: oai:DiVA.org:kth-367441DiVA, id: diva2:1984819
Note

QC 20250718

Available from: 2025-07-18 Created: 2025-07-18 Last updated: 2025-07-18Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Wang, LihuiWang, Xi Vincent

Search in DiVA

By author/editor
Wang, XiaohanWang, LihuiWang, Xi VincentLiu, Yongkui
By organisation
Production engineeringIndustrial Production Systems
In the same journal
International Journal of Production Research
Computer SciencesProduction Engineering, Human Work Science and ErgonomicsComputer Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 112 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf