Journal of Frontiers of Computer Science and Technology

• Science Researches •     Next Articles

Open-World Occluded Object Detection: Multi-Level Feature Fusion and Enhanced Feature Representation

JIANG Yanji,  CHEN Pengda,  DONG Hao,  LIU Daqian,  FEI Bowen   

  1. 1. College of Software, Liaoning Technical University, Huludao, Liaoning 125105
    2. OpenSafe Lab, Uter (Jiangsu) SafetyTechnology Co., Ltd, Suzhou, Jiangsu 215100
    3. Tsinghua University Suzhou Automobile Research Institute, Suzhou, Jiangsu 215100
    4. School of Automotive Engineering, Suzhou University of Technology, Suzhou, Changshu 215104

开放世界遮挡目标检测:多层次特征融合与增强特征表示

姜彦吉,陈鹏达,董浩,刘大千,费博雯   

  1. 1. 辽宁工程技术大学 软件学院,辽宁 葫芦岛 125105
    2. 优策(江苏)安全科技有限公司OpenSafe实验室,江苏 苏州 215100
    3. 清华大学苏州汽车研究院(相城),江苏 苏州 215100
    4. 苏州工学院 汽车工程学院,江苏 常熟 215104

Abstract: Solving the problem of occluded object detection is a challenging task in the research of open-world autonomous driving object detection. This paper mainly studies the detection of occluded objects in open-world autonomous driving scenarios, where the diversity of unknown occlusion patterns leads to the sparsity and semantic ambiguity of the features presented by occluded objects in images, as well as the uncertainty of object shapes. By leveraging the information differences in feature maps of different dimensions and the relationships between the whole and parts of the object and within parts, this paper proposes an occluded object detection algorithm based on multi-level feature fusion and enhanced feature representation. Firstly, a co-occurrence spatial graph module is designed to enhance the features within and between parts, improving the feature representation of occluded objects and the semantic recovery ability of occluded regions. Secondly, a cross-level scale feature pyramid network is adopted to enhance the context information and the correlation between features of the core features. Finally, a parallel adaptive feature fusion module is used to fuse features of different levels, alleviating feature loss and improving the performance of object detection. The proposed method is compared with existing detection algorithms on the datasets Pascal VOC, BDD100K, and a self-made dataset. Compared with the top 2 algorithms, the increase in the recall rate and mAP of unknown classes ranges from 1.4% to 1.9% and 0.5% to 1.5% (relative). This algorithm aims to improve the detection accuracy of occluded objects in open-world autonomous driving scenarios by learning the feature correlations of occluded objects, effectively alleviating the adverse effects of feature sparsity and semantic ambiguity. Experiments verify that this algorithm can effectively improve the detection performance of occluded objects, demonstrate good generalization ability, and enhance the detection ability of unknown objects.

Key words: Open World, Occluded Object Detection, Feature Fusion, Graph Convolutional Neural Network

摘要: 解决遮挡目标检测问题是开放世界自动驾驶目标检测研究中一项充满挑战的任务。本文主要研究在开放世界自动驾驶场景中,由于未知遮挡模式的多样性,被遮挡目标在图像中呈现的特征稀疏性与语义模糊性以及目标形态的不确定性,利用不同维度特征图的信息差异性以及目标整体与部分间和部分内的关系进行遮挡目标检测。提出了一种基于多层次特征融合和增强特征表示的遮挡目标检测算法。首先,设计了共现空间图模块完成部分内与部分间的特征增强,提高遮挡目标的特征表现,增强遮挡区域的语义恢复能力。其次,采用跨层次尺度的特征金字塔网络增强核心特征的上下文信息和特征之间的关联性。最后,通过并行自适应特征融合模块融合不同层次的特征,缓解特征丢失,提高目标检测性能。所提方法在数据集Pascal VOC,BDD100K和自制数据集上与现有检测算法进行比较,相较Top2算法的未知类召回率和mAP的增值范围在1.4%-1.9%和0.5%-1.5%(相对)。该算法旨在提升开放世界自动驾驶场景中遮挡目标的检测精度,通过学习遮挡目标的特征关联性,有效缓解了特征稀疏与语义模糊带来的不利影响。实验验证了本算法可以有效提升遮挡目标的检测性能,展现了很好的泛化能力,并增强对未知目标的检测能力。

关键词: 开放世界, 遮挡目标检测, 特征融合, 图卷积神经网络