TD3LVSL: A lane-level variable speed limit approach based on twin delayed deep deterministic policy gradient in a connected automated vehicle environmentShow others and affiliations
2023 (English)In: Transportation Research Part C: Emerging Technologies, ISSN 0968-090X, E-ISSN 1879-2359, Vol. 153, article id 104221Article in journal (Refereed) Published
Abstract [en]
Variable speed limit (VSL) control plays a vital role in the emerging connected automated vehicle highway (CAVH) system, which can alleviate recurrent traffic congestion caused by capacity drop and accidents. However, the effect of the VSL on improving traffic efficiency is still controversial. It is necessary to study how to explore the potential benefits of the VSL by balancing its influence on reducing crash risk and enhancing traffic efficiency. To fill the technological gap above, this paper proposes a reinforcement learning-based lane-level VSL (LVSL) control approach to conduct refined traffic control on the mainlines. Firstly, an actor-critic framework is developed to generate and evaluate the discrete speed limits of each lane in continuous action space. To optimize traffic control performance, a hybrid reward function is developed by synchronously considering traffic safety and traffic efficiency of the bottleneck area. Then, to solve the overestimation bias problem of the actor-critic methods caused by function approximation error, a twin delayed deep deterministic policy gradient (TD3) method is introduced to train the framework of the LVSL method. Finally, a real-world recurrent bottleneck of the State Route 91 highway in California is simulated with consideration of the connected automated vehicles to examine the performance of the TD3-based LVSL (TD3LVSL) controller. The simulation results reveal that the proposed method is capable of reducing crash risk and improving traffic efficiency synchronously. Compared with the LVSL controller based on the deep deterministic policy gradient approach, the TD3LVSL controller shows better performance in terms of traffic safety and efficiency. These findings indicate that the proposed controller could contribute to future traffic control in a CAVH environment.
Place, publisher, year, edition, pages
Elsevier BV , 2023. Vol. 153, article id 104221
Keywords [en]
Connected automated vehicle, Deep reinforcement learning, Lane segment, Twin delayed deep deterministic policy gradient (TD3), Variable speed limit
National Category
Transport Systems and Logistics Control Engineering Vehicle and Aerospace Engineering
Identifiers
URN: urn:nbn:se:kth:diva-334867DOI: 10.1016/j.trc.2023.104221ISI: 001147343200001Scopus ID: 2-s2.0-85163144553OAI: oai:DiVA.org:kth-334867DiVA, id: diva2:1792060
Note
QC 20231123
2023-08-282023-08-282025-02-14Bibliographically approved