计算机科学与探索 ›› 2022, Vol. 16 ›› Issue (1): 231-241.DOI: 10.3778/j.issn.1673-9418.2105033

• 图形图像 • 上一篇    下一篇

改进YOLOv5的交通灯实时检测鲁棒算法

钱伍, 王国中, 李国平+()   

  1. 上海工程技术大学 电子电气工程学院,上海 201620
  • 收稿日期:2021-05-11 修回日期:2021-08-13 出版日期:2022-01-01 发布日期:2021-08-25
  • 通讯作者: + E-mail: liguoping@sues.edu.cn
  • 作者简介:钱伍(1995—),男,硕士研究生,主要研究方向为计算机视觉、深度学习。
    王国中(1962—),男,博士,教授,博士生导师,主要研究方向为视频编解码、图像处理、机器学习。
    李国平(1974—),男,博士,高级工程师,硕士生导师,主要研究方向为音视频编码、智能媒体处理、机器学习与识别。
  • 基金资助:
    国家重点研发计划(2019YFB1802700)

Improved YOLOv5 Traffic Light Real-Time Detection Robust Algorithm

QIAN Wu, WANG Guozhong, LI Guoping+()   

  1. School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
  • Received:2021-05-11 Revised:2021-08-13 Online:2022-01-01 Published:2021-08-25
  • About author:QIAN Wu, born in 1995, M.S. candidate. His research interests include computer vision and deep learning.
    WANG Guozhong, born in 1962, Ph.D., professor, Ph.D. supervisor. His research interests include video codec, image processing and machine learning.
    LI Guoping, born in 1974, Ph.D., senior engi-neer, M.S. supervisor. His research interests in-clude audio and video coding, intelligent media processing, machine learning and recognition.
  • Supported by:
    National Key Research and Development Program of China(2019YFB1802700)

摘要:

交通灯检测算法作为自动驾驶任务中的一个重要环节,直接关系到智能汽车的行车安全。因为交通灯尺度小且环境复杂,给算法研究带来了困难。针对交通检测存在的痛点,提出改进YOLOv5的交通灯检测算法。首先使用可见标签比确定模型输入;然后引入ACBlock结构增加主干网络的特征提取能力,设计SoftPool减少主干网络的采样信息损失,使用DSConv卷积核减少模型参数;最后设计了记忆性特征融合网络,高效利用了高级语义信息和底层特征。对模型输入和主干网络的改进,直接提高模型在复杂环境下对特征的提取能力;对特征融合网络的改进,使模型能够充分利用特征信息,增加对目标定位和边界回归的精准度。实验结果表明,改进后的方法在BDD100K数据集上取得了74.3%的AP和111 frame/s的检测速度,比YOLOv5提高11.0个百分点的AP;在Bosch数据集上取得了84.4%的AP和126 frame/s的检测速度,比YOLOv5提高9.3个百分点的AP。鲁棒性测试结果表明,改进后的模型在各种复杂环境中对目标的检测能力都有显著提升,鲁棒性增加,做到了高精度实时检测。

关键词: 交通灯检测, YOLOv5, 记忆性特征融合网络, BDD100K, 实时检测

Abstract:

Traffic light detection algorithm, a critical procedure for realization of automatic driving, is directly related to the driving safety of intelligent vehicles. However, due to the small size of traffic lights and complicated environment, the algorithm research meets plenty of difficulties. This paper puts forward a traffic light detection algorithm based on optimized YOLOv5. Firstly, it uses a visible label ratio to determine the model input. Secondly, the ACBlock structure is introduced to increase the feature extraction ability of the backbone network; the SoftPool is designed to reduce the sample loss of the backbone network and the DSConv convolution kernel is used to reduce the model parameters. Finally, a memory feature fusion network is designed to efficiently utilize high level semantic information and low level features. As a result, the improvement of model input and backbone network directly improves the feature extraction ability of the model in complex environment; the improvement of feature fusion network enables the model to make full use of feature information and increase the accuracy of target positioning and boundary regression. Experimental results show that, the proposed algorithm achieves 74.3% AP and 111 frame/s detection speed on BDD100K, which is 11.0 percentage points higher than the AP of YOLOv5. In Bosch data set, 84.4% AP and 126 frame/s detection speed are obtained, which is 9.3 percentage points higher than the AP of YOLOv5. The robustness test results show that the proposed algorithm has significantly improved the detection ability of tar-gets in a variety of complex environments, and the robustness is increased to achieve high-precision real-time detection.

Key words: traffic light detection, YOLOv5, memory feature fusion network, BDD100K, real-time detection

中图分类号: