计算机科学与探索

• 学术研究 •    下一篇

基于Transformer和多关系图卷积网络的行人轨迹预测

刘桂红,周宗润,孟祥福   

  1. 辽宁工程技术大学,辽宁 葫芦岛 125105

Pedestrian trajectory prediction based on Transformers and Multi-Relation Graph Convolutional Networks

LIU Guihong, ZHOU Zongrun, MENG Xiangfu   

  1. Liaoning Technical University, Huludao, Liaoning 125105, China

摘要: 在自动导航应用领域,行人轨迹相对复杂,有效且合理预测行人未来轨迹对自动驾驶和出行安全至关重要。行人轨迹随机性和动态性极高且与交通环境有着复杂相互作用,因此需要对行人的时间依赖性和空间相互作用进行合理建模。为了解决该问题,本文提出了一种基于Transformer和多关系图卷积网络(GCN)的行人轨迹预测模型,该模型由交互捕获模块、锚点控制模块和轨迹修正补全模块构成,交互捕获模块由T-Transformer和多关系图卷积网络组成,分别提取每个行人在时间序列和空间序列上的运动特征,并结合锚点控制模块推断行人的中间目的地以减少递归累计误差,最后由修正补全模块进行最终轨迹细化。此外,在提取特征时添加逆关系可得到更为优化的结果,使用高斯剪枝减少虚假路径的生成也可提高模型效率。在ETH与UCY数据集上的实验结果表明,在平均位移误差(ADE)和最终位移误差(FDE)方面,该模型具有比现有大部分主流模型更好性能。由于在行人轨迹预测上的出色性能,可避免不必要的轨迹变更和碰撞风险,为行人轨迹预测应用提供了更为可能的解决方案。

关键词: T-Transformer, 图卷积网络, 锚点控制, 行人轨迹预测

Abstract: In the field of autonomous navigation, predicting pedestrian trajectories accurately is crucial for ensuring safe travel and autonomous driving. Pedestrian trajectories are highly complex, dynamic, and influenced by their surroundings, necessitating effective modeling of their temporal and spatial interactions. To address this, a model combining Transformer and Multi-Relation Graph Convolutional Networks (GCN) is proposed. It comprises an Interaction Capture Module, Anchor Control Module, and Trajectory Refinement Module. The Interaction Capture Module extracts motion features using T-Transformer and GCN, while the Anchor Control Module reduces errors by inferring intermediate destinations. Finally, the Trajectory Refinement Module enhances predictions. Experimental results on ETH and UCY datasets show superior performance in Average and Final Displacement Errors compared to mainstream models. This model's accuracy minimizes unnecessary trajectory changes and collision risks, offering a promising solution for pedestrian trajectory prediction applications.

Key words: T-Transformer , Graph Convolutional Networks , anchor control , pedestrian trajectory prediction