Journal of Frontiers of Computer Science and Technology ›› 2025, Vol. 19 ›› Issue (2): 465-475.DOI: 10.3778/j.issn.1673-9418.2402040

• Artificial Intelligence·Pattern Recognition • Previous Articles     Next Articles

Method of Credit Fraud Detection by Combining Sub-graph Selection and Neighborhood Filtering

TANG Xiaoyong, WANG Haodong   

  1. School of Computer and Communication Engineering, Changsha University of Science & Technology, Changsha 410114, China
  • Online:2025-02-01 Published:2025-01-23

融合子图选择和邻域过滤的信贷欺诈审核方法

唐小勇,王浩东   

  1. 长沙理工大学 计算机与通信工程学院,长沙 410114

Abstract: Credit fraud detection is a hot and difficult research topic in the field of financial fraud detection, especially fraud detection in the scenario of large-scale financial credit transactions. However, the extremely uneven distribution of fraudster nodes in the credit fraud review process and the problem of fraudster nodes disguising themselves have always been important challenges. Therefore, researchers propose a graph neural network model that integrates reconstruction subgraph selection and reinforced neighborhood filtering (RSRF-GNN) for large-scale Internet financial credit dynamic graphs. In order to improve the effectiveness of credit fraud audit, this method first defines the unbalanced distribution of the number of fraudsters and fraud camouflage problem from the data perspective. Then, according to the node category and access degree information design, the balance subgraph selection module is reconstructed to solve the unbalanced distribution of the number of fraudsters. Next, for the fraud camouflage problem, researchers introduce the reinforcement learning framework and design a neighborhood filtering module embedded in dynamic filtering neighborhood nodes. In addition, researchers design an edge aggregation module to aggregate the neighborhood edge embedding of central nodes, further enriching the expression of neighborhood embedding information of central nodes. Finally, experimental verification is conducted on a real dataset DGraph-Fin, and the results show that the RSRF-GNN model proposed in this paper has significantly improved the effectiveness compared with existing models. The RSRF-GNN model is improved by 5 to 8 percentage points in AUC and 18 to 29 percentage points in AP score, which is a significant advantage in model performance.

Key words: credit fraud detection, large-scale dynamic graph, distribution imbalance, fraud disguise, graph neural network

摘要: 信贷欺诈审核是金融欺诈检测领域的研究热点与难点,尤其是大规模金融信贷交易场景下的欺诈检测问题。然而,信贷欺诈审核过程中的欺诈者类节点数量分布极不平衡和欺诈者节点伪装自身问题一直是其所面临的重要挑战。基于此,面向大规模互联网金融信贷动态图提出融合重构子图选择和强化邻域过滤的图神经网络(RSRF-GNN)模型,以提高信贷欺诈审核的有效性。该方法从数据角度定义欺诈者数量分布不平衡和欺诈伪装问题。依据节点类别和出入度信息设计重构平衡子图选择模块以解决欺诈者数量分布不平衡问题。针对欺诈伪装问题,设计动态过滤邻域节点嵌入的邻域过滤模块。设计了边聚合模块聚合中心节点的邻域边嵌入,进一步提高中心节点邻域嵌入信息表达效果。为验证RSRF-GNN模型的有效性,在真实数据集DGraph-Fin上进行实验,结果表明RSRF-GNN模型在有效性方面比现有模型有较大提升。RSRF-GNN模型在AUC上提高了5~8个百分点,AP分数表现提高了18~29个百分点,模型性能优势显著。

关键词: 信贷欺诈审核, 大规模动态图, 分布不平衡, 欺诈伪装, 图神经网络