计算机科学与探索 ›› 2021, Vol. 15 ›› Issue (1): 141-149.DOI: 10.3778/j.issn.1673-9418.2003005

• 人工智能 • 上一篇    下一篇

深度图注意力CNN的三维模型识别

党吉圣,杨军   

  1. 兰州交通大学 电子与信息工程学院,兰州 730070
  • 出版日期:2021-01-01 发布日期:2021-01-07

3D Model Recognition Based on Deep Graph Attention CNN

DANG Jisheng, YANG Jun   

  1. School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
  • Online:2021-01-01 Published:2021-01-07

摘要:

针对现有基于深度学习的三维模型识别方法缺乏结合三维模型的上下文细粒度局部特征,可能造成几何形状极其相似,局部细节信息略有不同的类识别混淆的问题,提出一种基于深度图注意力卷积神经网络的三维模型识别方法。首先,通过引入邻域选择机制挖掘三维模型的细粒度局部特征。其次,通过空间上下文编码机制捕捉多尺度空间上下文信息,且与细粒度局部特征相互补偿以增强特征的完备性。最后,采用一种多头部机制,使图注意力卷积层聚合多个单头部的特征以增强特征的丰富性。此外,设计选择性丢弃算法,根据度量权重值对神经元重要性进行排序,智能地丢弃重要性较低的神经元来防止网络过拟合。算法在ModelNet40数据集上的三维模型识别准确率达到了92.6%,且网络复杂度较低,在三维模型识别准确率和网络复杂度之间达到最佳平衡,优于当前主流方法。

关键词: 机器视觉, 三维模型识别, 图注意力卷积层, 卷积神经网络(CNN), 选择性丢弃

Abstract:

In order to solve the problem that the existing 3D model recognition methods based on deep learning lack the contextual fine-grained local features of 3D models, which may cause the confusion of recognition with very similar geometric shapes and slightly different local details, a 3D model recognition method based on deep graph attention convolutional neural network is proposed. Firstly, the fine-grained local features of 3D models are mined by introducing a neighborhood selection mechanism. Secondly, the multi-scale spatial context information is captured by a spatial context coding mechanism, and is compensated with fine-grained local features to enhance the completeness of features. Finally, a multi-head mechanism is adopted to make the graph attention convolution layer aggregate features of multiple single-head to enhance the richness of features. In addition, a selective dropout algorithm is designed to prevent network overfitting, which ranks the importance of neurons according to value of the measure-ment weight, and intelligently discards those with lower importance. The accuracy of 3D model recognition on the ModelNet40 dataset of the algorithm in this paper reaches 92.6%, and the network complexity is low. The trade-off between accuracy rate of 3D model recognition and network complexity achieved by the proposed algorithm is superior to contemporary mainstream methods.

Key words: machine vision, 3D model recognition, graph attention convolution layer, convolutional neural network(CNN), selectable dropout