Vehicle and Pedestrian Detection Algorithm Based on Attention Scale Sequence Fusion

doi:10.3969/j.issn.1674-0696.2025.07.10

Journal of Chongqing Jiaotong University(Natural Science) ›› 2025, Vol. 44 ›› Issue (7): 75-82.DOI: 10.3969/j.issn.1674-0696.2025.07.10

• Transportation+Big Data & Artificial Intelligence • Previous Articles

Vehicle and Pedestrian Detection Algorithm Based on Attention Scale Sequence Fusion

LI Jun1, 2, ZOU Jun1, CHEN Cui2, ZHANG Shiyi3

(1. School of Mechatronics and Vehicle Engineering, Chongqing Jiaotong University, Chongqing 400074, China; 2. School of Vehicle and Transportation, Chongqing Vocational and Technical University of Mechatronics, Chongqing 402760, China; 3. School of Navigation and Ship Engineering, Chongqing Jiaotong University, Chongqing 400074, China)

Received:2024-07-09 Revised:2025-03-19 Published:2025-07-31

基于注意力尺度序列融合的车辆行人检测算法

李军1,2，邹军1，陈翠2，张世义3

(1. 重庆交通大学机电与车辆工程学院，重庆 400074；2. 重庆机电职业技术大学车辆与交通学院，重庆 402760； 3. 重庆交通大学航海与船舶工程学院，重庆 400074)

作者简介:李军（1964—），男，重庆人，博士，教授，主要从事计算机视觉、智能网联汽车技术方面的研究。E-mail:cqleejun@163.com 通信作者：邹军（1998—），男，重庆人，硕士研究生，主要从事车辆工程与计算机视觉方面的研究。E-mail:673312348@qq.com
基金资助:
重庆市技术创新与应用发展专项基金项目(CSTB2022TIAD-STX0003)；国家自然科学基金项目（52172381）

Abstract

Abstract: In view of the problems of low detection accuracy and high missed detection rate in vehicle and pedestrian detection at roadside ends, a vehicle and pedestrian detection algorithm YOLOv8-APC based on attention scale sequence fusion was proposed. Firstly, the scale sequence fusion module (SSFF) and the three-feature encoder (TFE) were used in the neck network to enhance the extraction and fusion of multi-scale information, meanwhile, the channel and position attention mechanism (CPAM) was introduced to improve the detection accuracy. Then, the P2 detection layer was added on the basis of the improved network structure to improve the detection ability of small targets and reduce the missed detection rate. Finally, the C2f_GhostDynamicConv (C2f_GDC) module was applied in the backbone network to effectively reduce the complexity of the model. To verify the effectiveness of the proposed algorithm, the validation was conducted on the roadside end dataset Vapddsits in the Chongqing Science Valley Demonstration Zone. The experimental results show that the mAP50 value and recall rate of YOLOv8-APC are 11.1% and 11.9% higher than those of the original model; the parameter quantity and model volume are only 1.85M and 4.1MB respectively, which are 38.3% and 34.9% lower than those of the original model. The proposed algorithm can achieve more accurate detection of distant small targets and occluded targets, which doesn’t occupy too much memory resources, providing a solution for vehicle and pedestrian detection at roadside ends.

Key words: traffia and transportation engineering; YOLOv8; vehicles and pedestrians; feature extraction; attention mechanism; scale sequence fusion

摘要： 针对在路侧端车辆与行人检测中存在检测精度低，漏检率较高等问题，提出了一种注意力尺度序列融合的车辆行人检测算法YOLOv8-APC。首先，在颈部网络中使用尺度序列融合模块SSFF与三特征编码器TFE，以增强对多尺度信息的提取与融合，同时引入通道与位置注意力机制CPAM提高检测精度。然后，在改进后的网络结构基础上增加P2检测层，提高对小目标的检测能力，降低漏检率。最后，在主干网络中应用C2f_GhostDynamicConv (C2f_GDC)模块，有效降低模型的复杂度。为验证算法的有效性，在重庆科学谷示范区路侧端数据集Vapddsits上进行验证，实验结果表明：YOLOv8-APC的mAP50值与召回率较原模型提升了11.1%、11.9%；参数量与模型体积分别仅有1.85 M、4.1 MB，分别较原模型下降了38.3%、34.9%，其对远距离小目标以及遮挡目标能够实现更为准确的检测，且不会占用过多的内存资源，为路侧端车辆行人检测提供了一种解决方案。

关键词: 交通运输工程；YOLOv8；车辆与行人；特征提取；注意力机制；尺度序列融合

CLC Number:

TP391.4

LI Jun1, 2, ZOU Jun1, CHEN Cui2, ZHANG Shiyi3. Vehicle and Pedestrian Detection Algorithm Based on Attention Scale Sequence Fusion[J]. Journal of Chongqing Jiaotong University(Natural Science), 2025, 44(7): 75-82.

李军1,2，邹军1，陈翠2，张世义3. 基于注意力尺度序列融合的车辆行人检测算法[J]. 重庆交通大学学报（自然科学版）, 2025, 44(7): 75-82.

References

［1］ WANG C Y, BOCHKOVSKIY A, LIAO H M. YOLOv7:Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors ［C］∥2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Vancouver, BC, Canada. IEEE, 2023: 7464-7475.
［2］ SHAHIN S, SADEGHIAN R, SAREH S. Faster R-CNN-based decision making in a novel adaptive dual-mode robotic anchoring system ［C］ ∥2021 IEEE International Conference on Robotics and Automation (ICRA). Xi’an, China. IEEE, 2021: 11010-11016.
［3］ CAI Zhaowei, VASCONCELOS N. Cascade R-CNN: Delving into high quality object detection ［C］ ∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, UT, USA. IEEE, 2018: 6154-6162.
［4］ GUO Lie, ZHAO Yibing, GAO Jiandong. Compression of vehicle and pedestrian detection network based on YOLOv3 model ［J］. IEICE Transactions on Information and Systems, 2023(5): 735-745.
［5］董恒祥, 潘江如, 董芙楠, 等. 基于改进YOLOv5s模型的车辆及行人检测方法［J］. 北华大学学报(自然科学版), 2024, 25(2): 244-254.
DONG Hengxiang, PAN Jiangru, DONG Funan, et al. Vehicle and pedestrian detection method based on improved YOLOv5s model ［J］. Journal of Beihua University (Natural Science), 2024, 25(2): 244-254.
［6］ ZHANG Yu, GUO Zhongyin, WU Jianqing, et al. Real-time vehicle detection based on improved YOLOv5 ［J］. Sustainability, 2022, 14(19): 12274.
［7］ ZHU Xingkui, LYU Shuchang, WANG Xu, et al. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios ［C］∥2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW). Montreal, BC, Canada. IEEE, 2021: 2778-2788.
［8］蔡刘畅, 杨培峰, 张秋仪. 基于YOLOv7的道路监控车辆检测方法［J］. 陕西科技大学学报, 2023, 41(6): 155-161.
CAI Liuchang, YANG Peifeng, ZHANG Qiuyi. Vehicle detection method based on YOLOv7 in traffic monitoring ［J］. Journal of Shaanxi University of Science & Technology, 2023, 41(6): 155-161.
［9］彭红星, 袁畅, 柯威曳, 等. 基于改进YOLOv5的高速公路隧道车辆和人员检测［J］. 科学技术与工程, 2024, 24(6): 2453-2461.
PENG Hongxing, YUAN Chang, KE Weiye, et al. Vehicle and personnel detection in highway tunnels based on improved YOLOv5 ［J］. Science Technology and Engineering, 2024, 24(6): 2453-2461.
［10］邓天民, 刘金凤, 王春霞, 等. 基于内容感知重组特征的车辆行人检测算法［J］. 重庆交通大学学报(自然科学版), 2023, 42(10): 132-141.
DENG Tianmin, LIU Jinfeng, WANG Chunxia, et al. Vehicle and pedestrian detection algorithm based on content-aware reassembly of features ［J］. Journal of Chongqing Jiaotong University (Natural Science), 2023, 42(10): 132-141.
［11］高瑞贞, 李树楠, 李晓辉. 机器人视觉中行人和车辆检测算法的研究［J］. 机械设计与制造, 2023(10): 277-280.
GAO Ruizhen, LI Shunan, LI Xiaohui. Research on pedestrian and vehicle detection algorithms in robot vision ［J］. Machinery Design & Manufacture, 2023(10): 277-280.
［12］王雪秋, 高焕兵, 郏泽萌. 改进YOLOv8的道路缺陷检测算法［J］. 计算机工程与应用, 2024, 60(17): 179-190.
WANG Xueqiu, GAO Huanbing, JIA Zemeng. Improved road defect detection algorithm based on YOLOv8 ［J］. Computer Engineering and Applications, 2024, 60(17): 179-190.
［13］ KANG M, TING C M, TING F F, et al. ASF-YOLO: A novel YOLO model with attentional scale sequence fusion for cell instance segmentation ［J］. Image and Vision Computing, 2024, 147: 105057.
［14］史涛, 刘祖林, 朱文旭, 等. 基于改进YOLOv5s的车辆行人检测［J］. 国外电子测量技术, 2023, 42(12): 195-200.
SHI Tao, LIU Zulin, ZHU Wenxu, et al. Vehicle and pedestrian detection based on improved YOLOv5s ［J］. Foreign Electronic Measurement Technology, 2023, 42(12): 195-200.
［15］李琳, 靳志鑫, 俞晓磊, 等. Haar小波下采样优化YOLOv9的道路车辆和行人检测［J］. 计算机工程与应用, 2024, 60(20): 207-214.
LI Lin, JIN Zhixin, YU Xiaolei, et al. Road vehicle and pedestrian detection based on Haar wavelet down sampling optimized YOLOv9 ［J］. Computer Engineering and Applications, 2024, 60(20): 207-214.

[1]	ZHAO Shuen, LIU Wei. Low-Illumination Road Traffic Sign Recognition Based on Improved VGG Model [J]. Journal of Chongqing Jiaotong University(Natural Science), 2021, 40(10): 178-184.
[2]	LI Shengyong1, ZHANG Zhihua1, WANG Shengnan2, WANG Meng2. Defect Recognition Algorithm for Vehicle Metal Materials [J]. Journal of Chongqing Jiaotong University(Natural Science), 2021, 40(07): 136-144.
[3]	SU Zhouyu, LAN Quanxiang, YUAN Quan, CAO Jianqiu. Potholes Image Edge Extraction Based on PCNN and Morphology [J]. Journal of Chongqing Jiaotong University(Natural Science), 2016, 35(1): 60-65.
[4]	Cao Jianqiu, Lan Quanxiang, Wang Danmei. Potholes Measurement Method Based on Image Processing Without Reference Material [J]. Journal of Chongqing Jiaotong University(Natural Science), 2015, 34(4): 57-61.
[5]	Zhao Mengmeng，Cao Jianqiu. Image Ｒegistration Algorithm of SIFT Based on Edge and Corner Point [J]. Journal of Chongqing Jiaotong University(Natural Science), 2013, 32(4): 721-724.
[6]	Feng Huanfei，He Youquan，Liu Chong. Adaptive Median Filter Based on Neighborhood Correlation [J]. Journal of Chongqing Jiaotong University(Natural Science), 2013, 32(3): 547-550.
[7]	DENG Tian-min, YU Yong, SHAO Yi-ming. A Novel Method of Vehicle Classification Based on Image Identification [J]. Journal of Chongqing Jiaotong University(Natural Science), 2008, 27(6): 1142-1145.
[8]	DU Ting-na. Morphological operation and structure element analysis about 2-numeric image [J]. Journal of Chongqing Jiaotong University(Natural Science), 2006, 25(1): 162-164.
[9]	WANG Xiaoping1, SHI Xinlan2. Improved MDnet Target Tracking Algorithm in Complex Traffic Scene [J]. Journal of Chongqing Jiaotong University(Natural Science), 2021, 40(12): 19-26.
[10]	ZHAO Fengkui, CHENG Haifei, SU Shanshan, ZHANG Yong. Traffic Scene Target Detection Technology Based on Improved CenterNet [J]. Journal of Chongqing Jiaotong University(Natural Science), 2022, 41(12): 11-17.
[11]	DENG Tianmin1，LIU Jinfeng1，WANG Chunxia1，LI Qingying2. Vehicle and Pedestrian Detection Algorithm Based on Content-Aware Reassembly of Features [J]. Journal of Chongqing Jiaotong University(Natural Science), 2023, 42(10): 132-141.
[12]	PENG Jing, GAO Baoqu. Rail Surface Defect Detection Based on Multi-Perception Synergy and Hybrid Sampling Strategy [J]. Journal of Chongqing Jiaotong University(Natural Science), 2025, 44(7): 41-50.

Vehicle and Pedestrian Detection Algorithm Based on Attention Scale Sequence Fusion

基于注意力尺度序列融合的车辆行人检测算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 12

Recommended Articles

Metrics