[1] KOONCE P. Traffic Signal Timing Manual[R]. United States. Federal Highway Administration, 2008.
[2] COOLS S B, GERSHENSON C, HOOGHE B. Self-organizing Traffic Lights: A Realistic Simulation [M].Springer, London,2013.
[3] VARAIYA P. Advances in Dynamic Network Modeling in Complex Transportation System[M]. Springer: New York, 2013.
[4] VAN Der Pol E, OLIEHOEK F A. Coordinated deep reinforcement learners for traffic light control[C]∥ Proceedings of Learning, Inference and Control of Multi-agent Systems. Barcelona, Spain: MIT Press, 2016, 8: 21-38.
[5] MOUSAVI S S, SCHUKAT M, HOWLEY E. Traffic light control using deep policy-gradient and value-function-based reinforcement learning[J]. IET Intelligent Transport Systems, 2017, 11(7): 417-423.
[6] LI Li, LYU Yisheng, WANG Feiyue. Traffic signal timing via deep reinforcement learning[J]. IEEE/CAA Journal of Automatica Sinica, 2016, 3(3): 247-254.
[7] GANAR Y, KUMAR V, DULERA S, et al. Optimizing autonomous intersection control using single agent reinforcement learning[C]∥Proceedings of the 26th International Conference on Distributed Computing and Networking. Hyderabad India. ACM, 2025: 383-389.
[8] 徐建闽,周湘鹏,首艳芳. 基于深度强化学习的自适应交通信号控制研究[J]. 重庆交通大学学报(自然科学版), 2022, 41(8): 24-29.
XU Jianmin, ZHOU Xiangpeng, SHOU Yanfang. Adaptive traffic signal control based on deep reinforcement learning[J].Journal of Chongqing Jiaotong University(Natural Science), 2022, 41(8): 24-29.
[9] HUANG Liben, QU Xiaohui. Improving traffic signal control operations using proximal policy optimization[J]. IET Intelligent Transport Systems, 2023, 17(3): 592-605.
[10] CAI Changjian, WEI Min. Adaptive urban traffic signal control based on enhanced deep reinforcement learning[J]. Scientific Reports, 2024, 14(1): 14116.
[11] CARTA T, ROMAC C, WOLF T, et al. Grounding large language models in interactive environments with online reinforcement learning[J/OL].(2024-10-17)[2025-01-10].https:∥arxiv.org/abs/2302. 02662.
[12] XU Zhenhua, ZHANG Yujia, XIE Enze, et al. DriveGPT4: Interpretable end-to-end autonomous driving via large language model[J]. IEEE Robotics and Automation Letters, 2024, 9(10): 8186-8193.
[13] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[J/OL]. (2023-08-02) [2025-01-10]. https:∥arxiv.org/abs/1706.03762.
[14] RAFFEL C, SHAZEER N M, ROBERTS A, et al. Exploring the limits of transfer learning with a unified text-to-text transformer[J]. J Mach Learn Res, 2019, 21: 140: 1-140: 67.
[15] Géron A. Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems[M]. United States: O'Reilly Media, Inc, 2022.
[16] CHEN Longchuan, LI Zongru. Bailong: Bilingual transfer learning based on QLoRA and zip-tie embedding[J/OL]. (2024-04-01) [2025-01-10]. https:∥arxiv.org/abs/2404.00862.
[17] QIN H, MA X, ZHENG X, et al. Accurate lora-finetuning quantization of llms via information retention[J/OL]. (2024-05-27) [2025-01-10]. https:∥arxiv.org/abs/2402.05445.
[18] WEI Hua, ZHENG Guanjie, GAYAH V , et al. A survey on traffic signal control methods[J/OL]. (2020-01-16) [2025-01-10]. https:∥arxiv.org/abs/1904.08117. |