Traffic Signal Control System Based on Lightweight Large Language Model

doi:10.3969/j.issn.1674-0696.2026.03.08

Journal of Chongqing Jiaotong University(Natural Science) ›› 2026, Vol. 45 ›› Issue (3): 65-72.DOI: 10.3969/j.issn.1674-0696.2026.03.08

• Traffic & Transportation+Artificial Intelligence • Previous Articles

Traffic Signal Control System Based on Lightweight Large Language Model

WANG Haiyong, WANG Menglin, ZHANG Dan

(School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, Gansu, China)

Received:2025-04-17 Revised:2025-11-16 Published:2026-03-24

基于轻量化大语言模型的交通信号控制系统

王海涌，王孟琳，张丹

(兰州交通大学电子与信息工程学院，甘肃兰州 730070)

作者简介:王海涌(1974—)，男，甘肃会宁人，教授，博士，主要从事智能信息处理等方面的研究。E-mail：wanghyong@mail.lzjtu.cn 通信作者：王孟琳(1998—)，男，山东临沂人，硕士研究生，主要从事智慧交通等方面的研究。 E-mail：wangmenglinba@163.com
基金资助:
国家自然科学基金项目（52062028）

Abstract

Abstract: Aiming at the inherent limitation that conventional traffic signal control systems are difficult in adapting to dynamic traffic flow, as well as the problems such as insufficient model generalization capability of existing reinforcement learning methods and high deployment complexity of large language models (LLM), a lightweight large language model-based traffic signal control system (L3M-TCS) was proposed. Firstly, an instruction-based fine-tuning dataset specifically designed for traffic signal control was constructed. Secondly, through fine-tuning and parameter quantization techniques, the LLM was compressed into a lightweight architecture suitable for roadside devices. Finally, a verification system architecture with real-time feedback mechanisms was designed to validate its effectiveness and reliability in real-world traffic environments. Research results demonstrate that: compared to the traditional fixed-timing scheme, L3M-TCS reduces traffic delays by 60.6% and queue lengths by 50.2%. Compared to reinforcement learning methods, L3M-TCS reduces the delay at untrained intersections by 64.7%, while providing natural language-based explanations for decision-making. The proposed model achieves an inference speed of 11.41 tokens/s on roadside device side, with memory footprint compressed to 81.7% of the original model, while maintaining control decision latency within 2 500 ms.

Key words: traffic engineering; intelligent transportation; traffic signal control; large language model; command data set; domain-specific fine-tuning

摘要： 针对传统交通信号控制系统难以适应动态交通流量的固有缺陷，且现有强化学习方法存在的模型泛化能力不足、大语言模型部署复杂度高等问题，提出了基于轻量化大语言模型(LLM)的交通信号控制系统(L3M-TCS)。首先构建面向交通信号控制的指令微调数据集；其次采用微调与参数量化技术，将大语言模型压缩为适用于路侧设备的轻量化架构；最后设计具有实时反馈机制的验证系统架构，验证其在真实交通环境中的有效性和可靠性。研究表明：相较传统固定配时方案L3M-TCS降低了60.6%的交通延误与50.2%的排队长度；相比强化学习方法，L3M-TCS在未训练交叉口的延误降低了64.7%，同时可提供基于自然语言的决策依据解释；模型在路侧设备端的推理速度达到11.41 tokens/s，内存占用压缩至原模型的81.7%，且控制决策延迟在2 500 ms以内。

关键词: 交通工程；智能交通；交通信号控制；大语言模型；指令数据集；领域化微调

CLC Number:

U491.54

WANG Haiyong, WANG Menglin, ZHANG Dan. Traffic Signal Control System Based on Lightweight Large Language Model[J]. Journal of Chongqing Jiaotong University(Natural Science), 2026, 45(3): 65-72.

王海涌，王孟琳，张丹. 基于轻量化大语言模型的交通信号控制系统[J]. 重庆交通大学学报（自然科学版）, 2026, 45(3): 65-72.

References

［1］ KOONCE P. Traffic Signal Timing Manual［R］. United States. Federal Highway Administration, 2008.
［2］ COOLS S B, GERSHENSON C, HOOGHE B. Self-organizing Traffic Lights: A Realistic Simulation ［M］.Springer, London,2013.
［3］ VARAIYA P. Advances in Dynamic Network Modeling in Complex Transportation System［M］. Springer: New York, 2013.
［4］ VAN Der Pol E, OLIEHOEK F A. Coordinated deep reinforcement learners for traffic light control［C］∥ Proceedings of Learning, Inference and Control of Multi-agent Systems. Barcelona, Spain: MIT Press, 2016, 8: 21-38.
［5］ MOUSAVI S S, SCHUKAT M, HOWLEY E. Traffic light control using deep policy-gradient and value-function-based reinforcement learning［J］. IET Intelligent Transport Systems, 2017, 11(7): 417-423.
［6］ LI Li, LYU Yisheng, WANG Feiyue. Traffic signal timing via deep reinforcement learning［J］. IEEE/CAA Journal of Automatica Sinica, 2016, 3(3): 247-254.
［7］ GANAR Y, KUMAR V, DULERA S, et al. Optimizing autonomous intersection control using single agent reinforcement learning［C］∥Proceedings of the 26th International Conference on Distributed Computing and Networking. Hyderabad India. ACM, 2025: 383-389.
［8］徐建闽，周湘鹏，首艳芳. 基于深度强化学习的自适应交通信号控制研究［J］. 重庆交通大学学报（自然科学版）, 2022, 41(8): 24-29.
XU Jianmin, ZHOU Xiangpeng, SHOU Yanfang. Adaptive traffic signal control based on deep reinforcement learning［J］.Journal of Chongqing Jiaotong University(Natural Science), 2022, 41(8): 24-29.
［9］ HUANG Liben, QU Xiaohui. Improving traffic signal control operations using proximal policy optimization［J］. IET Intelligent Transport Systems, 2023, 17(3): 592-605.
［10］ CAI Changjian, WEI Min. Adaptive urban traffic signal control based on enhanced deep reinforcement learning［J］. Scientific Reports, 2024, 14(1): 14116.
［11］ CARTA T, ROMAC C, WOLF T, et al. Grounding large language models in interactive environments with online reinforcement learning［J/OL］.(2024-10-17)［2025-01-10］.https:∥arxiv.org/abs/2302. 02662.
［12］ XU Zhenhua, ZHANG Yujia, XIE Enze, et al. DriveGPT4: Interpretable end-to-end autonomous driving via large language model［J］. IEEE Robotics and Automation Letters, 2024, 9(10): 8186-8193.
［13］ VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need［J/OL］. (2023-08-02) ［2025-01-10］. https:∥arxiv.org/abs/1706.03762.
［14］ RAFFEL C, SHAZEER N M, ROBERTS A, et al. Exploring the limits of transfer learning with a unified text-to-text transformer［J］. J Mach Learn Res, 2019, 21: 140: 1-140: 67.
［15］ Géron A. Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems［M］. United States: O'Reilly Media, Inc, 2022.
［16］ CHEN Longchuan, LI Zongru. Bailong: Bilingual transfer learning based on QLoRA and zip-tie embedding［J/OL］. (2024-04-01) ［2025-01-10］. https:∥arxiv.org/abs/2404.00862.
［17］ QIN H, MA X, ZHENG X, et al. Accurate lora-finetuning quantization of llms via information retention［J/OL］. (2024-05-27) ［2025-01-10］. https:∥arxiv.org/abs/2402.05445.
［18］ WEI Hua, ZHENG Guanjie, GAYAH V , et al. A survey on traffic signal control methods［J/OL］. (2020-01-16) ［2025-01-10］. https:∥arxiv.org/abs/1904.08117.

[1]	SHI Qin, CHEN Hujiang, CHEN Yikai. Speed Guidance Strategy of Intelligent Connected Vehicle Platoon at Signalized Intersection [J]. Journal of Chongqing Jiaotong University(Natural Science), 2021, 40(02): 47-53.
[2]	WANG Anlin，SUN Xiaolong，ZHONG Fusheng. Self-organizing Control Method for Urban Traffic Signal Based on the Pass-Priority Rules [J]. Journal of Chongqing Jiaotong University(Natural Science), 2018, 37(02): 96-101.
[3]	XIA Xin-hai，XU Lun-hui. Method of Intersection Agent Coordination Based on Game-learning [J]. Journal of Chongqing Jiaotong University(Natural Science), 2010, 29(2): 269-271.
[4]	ZHAO Xin1, FENG Lei1, LIN Haobo2, 3, CHEN Xi1, XIAO Yuzhou1. Green Wave Control Method of Non-straight Path at Adjacent T-shaped Intersection [J]. Journal of Chongqing Jiaotong University(Natural Science), 2022, 41(09): 18-25.
[5]	XU Jianmin1, LIU Peng1, SHOU Yanfang2, LIN Yongjie1,2, LU Kai1. Coordinated Control of Maximum Bandwidth of Arterial Green Waves Considering the Disturbance of Vehicle Flow Speed Distribution [J]. Journal of Chongqing Jiaotong University(Natural Science), 2024, 43(3): 64-72.

Traffic Signal Control System Based on Lightweight Large Language Model

基于轻量化大语言模型的交通信号控制系统

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 5

Recommended Articles

Metrics