[1] SWAPNO S M M R, NOBEL S N, MEENA P, et al. A reinforcement learning approach for reducing traffic congestion using deep Q learning[J]. Scientific Reports, 2024, 14: 30452.
[2] WANG Tao, ZHU Zhipeng, ZHANG Jing, et al. A large-scale traffic signal control algorithm based on multi-layer graph deep reinforcement learning[J]. Transportation Research Part C: Emerging Technologies, 2024, 162: 104582.
[3] CHIOU S W. Co-evolutionary traffic signal control using reinforcement learning for road networks under stochastic capacity[J]. Applied Soft Computing, 2024, 161: 111701.
[4] DEVAILLY F X, LAROCQUE D, CHARLIN L. Model-based graph reinforcement learning for inductive traffic signal control[J]. IEEE Open Journal of Intelligent Transportation Systems, 2024, 5: 238-250.
[5] LI Mi, PAN Xiaolong, LIU Chuhui, et al. Federated deep reinforcement learning-based urban traffic signal optimal control[J]. Scientific Reports, 2025, 15: 11724.
[6] 张萌, 王殿海, 金盛. 结合领域经验的深度强化学习信号控制方法[J]. 浙江大学学报(工学版), 2023, 57(12): 2524-2532.
ZHANG Meng, WANG Dianhai, JIN Sheng. Deep reinforcement learning approach to signal control combined with domain experience[J]. Journal of Zhejiang University (Engineering Science), 2023, 57(12): 2524-2532, 2543.
[7] YUSOP M A M, MANSOR H, GUNAWAN T S, et al. Intelligent traffic lights using Q-learning[C]//2022 IEEE 8th International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA). IEEE, 2022: 200-204.
[8] 李振龙, 张靖思, 刘钦, 等. 基于改进Q学习的双周期干线信号协调控制方法[J]. 科学技术与工程, 2021, 21(29): 12744-12750.
LI Zhenlong, ZHANG Jingsi, LIU Qin, et al. Coordinated control method of double-cycling arterial based on improved Q-learning[J]. Science Technology and Engineering, 2021, 21(29): 12744-12750.
[9] 张永梅, 赵家瑞, 吴爱燕. 好奇心驱动的深度强化学习机器人路径规划算法[J]. 科学技术与工程, 2022, 22(25): 11075-11083.
ZHANG Yongmei, ZHAO Jiarui, WU Aiyan. Robot path planning algorithm based on curiosity-driven deep reinforcement learning[J]. Science Technology and Engineering, 2022, 22(25): 11075-11083.
[10] ZHANG Huizhen, FANG Zhenwei, CHEN Youqing, et al. Traffic signal optimization control method based on attention mechanism updated weights double deep Q network[J]. Complex & Intelligent Systems, 2025, 11(5): 217.
[11] ZHENG Yanliu, LUO Juan, GAO Han, et al. Pri-DDQN: Learning adaptive traffic signal control strategy through a hybrid agent[J]. Complex & Intelligent Systems, 2024, 11(1): 47.
[12] 苏杰, 刘光宇, 暨仲明, 等. 改进DDPG算法在外骨骼机械臂轨迹运动中的应用[J]. 传感器与微系统, 2023, 42(2): 149-152, 160.
SU Jie, LIU Guangyu, JI Zhongming, et al. Application of improved DDPG algorithm in trajectory motion of exoskeleton manipulator[J]. Transducer and Microsystem Technologies, 2023, 42(2): 149-152, 160.
[13] ZHANG Weibin, YAN Chen, LI Xiaofeng, et al. Distributed signal control of arterial corridors using multi-agent deep reinforcement learning[J]. IEEE Transactions on Intelligent Transportation Systems, 2023, 24(1): 178-190.
[14] CHEN Xinning, LIU Xuan, LUO Canhui, et al. Robust multi-agent reinforcement learning for noisy environments[J]. Peer-to-Peer Networking and Applications, 2022, 15(2): 1045-1056.
[15] WU Lan, WU Yuanming, QIAO Cong, et al. Multiagent soft actor-critic for traffic light timing[J]. Journal of Transportation Engineering, Part A: Systems, 2023, 149(2): 04022133. |