Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Safe Deep Reinforcement Learning for Renewable Energy Integrated Power System
KTH, Skolan för elektroteknik och datavetenskap (EECS), Elektroteknik, Elkraftteknik.ORCID-id: 0000-0003-0746-0221
KTH, Skolan för elektroteknik och datavetenskap (EECS), Elektroteknik, Elkraftteknik.ORCID-id: 0000-0002-2793-9048
2025 (engelsk)Inngår i: Digitalisation of Local Energy Systems / [ed] Weiqi Hua; Xiao-Ping Zhang; David C.H. Wallom, Springer Nature , 2025, Vol. Part F989, s. 313-336Kapittel i bok, del av antologi (Annet vitenskapelig)
Abstract [en]

Deep reinforcement learning (DRL) is a promising solution for the coordination of power systems with high penetration of inverter-based renewable energy sources (RESs). Yet, when adopting the DRL-based control method, the safe and optimal operation of the system cannot be guaranteed at the same time, as the conventional DRL agent is not designed to solve the hard constraint problem. To address this challenge, a deep neural network (DNN) assisted projection-based DRL method for safe control of distribution grids is proposed in this section. First, a finite iteration projection algorithm is proposed to guarantee hard constraints by converting a non-convex optimization problem into a finite iteration problem. Next, a DNN-assisted projection method is proposed to accelerate the calculation of projection and achieve the practical implementation of hard constraints in the DRL problem. Finally, a DNN Projection embedded twin-delayed deep deterministic policy gradient (DPe-TD3) method is proposed to achieve optimal operation of distribution grids with guaranteed 100% safety of the distribution grid. The safety of the DRL training is guaranteed via the embedded Projection DNN in TD3 with participation in the gradient return process, which could smoothly and effectively project the DRL agent actions into the feasible area, thus guaranteeing the safety of data-driven control and the optimal operation at the same time. The case studies and comparisons are conducted in the IEEE 33 bus system to show the effectiveness of the proposed method.

sted, utgiver, år, opplag, sider
Springer Nature , 2025. Vol. Part F989, s. 313-336
HSV kategori
Identifikatorer
URN: urn:nbn:se:kth:diva-372595DOI: 10.1007/978-3-031-77833-9_11Scopus ID: 2-s2.0-105019185039OAI: oai:DiVA.org:kth-372595DiVA, id: diva2:2013111
Merknad

Part of ISBN 9783031778322, 9783031778339

QC 20251111

Tilgjengelig fra: 2025-11-11 Laget: 2025-11-11 Sist oppdatert: 2025-11-11bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstScopus

Person

Zhang, MengfanXu, Qianwen

Søk i DiVA

Av forfatter/redaktør
Zhang, MengfanXu, Qianwen
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 12 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf