计算机科学与探索

• 学术研究 •    下一篇

基于预训练的人脸图像伪造算法识别模型

丁博文,芦天亮,彭舒凡,王珑皓   

  1. 中国人民公安大学 信息网络安全学院,北京 100038

Recognition Model of Face Image Forgery Algorithm Based on Pre-training

DING Bowen,  LU Tianliang,  PENG Shufan,  WANG Longhao   

  1. College of Information Network Security, People's Public Security University of China, Beijing 100038, China

摘要: 随着深度学习的发展,人脸伪造手段逐渐增多,现有的深伪人脸检测模型多集中于真假二分类阶段,泛化性较差,难以适应如今不断迭代的伪造算法,并且对受到不同清晰度和噪声等干扰的人脸图像研究不够深入。因此,为解决上述问题,提出了一种基于预训练的多分类人脸图像伪造算法识别方法。具体而言,整个模型分为预训练和主体训练两大模块。在预训练阶段,设计了基于掩码策略改进的PR-MAE(Perception and Reconstruction-MAE),深入挖掘掩码特征来构造感知损失,增强模型全局的感知学习能力;构建了UDenseNet训练网络,利用U型化的稠密块提升模型对细节的捕捉能力,以适应更加复杂的任务;加入多噪声融合模块MNF(Multi-Noise Fusion),以动态注入策略提高模型对多类噪声的抵抗能力,提升模型鲁棒性。在主体训练过程,引入CLIP(Contrastive Language-Image Pretraining)补充学习模块,与预训练后的UDenseNet进行对比学习,以进一步提升模型归类的准确率。实验结果表明,在最新的MCFF(Multi-Classification Fake Faces)和DF40多分类深伪人脸数据集上准确率(Accuracy, ACC)指标达到89.12%和90.25%,在低清晰度图像中ACC平均值达到87.22%,在各类噪声下检测率达到83.36%,泛化性测试中指标最高也能达到90%以上。

关键词: 伪造算法识别, 人脸伪造, 深度学习

Abstract: With the development of deep learning, face forgery methods are gradually increasing. The existing deepfake face detection models are mostly concentrated on the true or false binary classification, with poor generalization, which is difficult to adapt to the current iterative forgery algorithms. What’s more, the researches on the face images with different sharpness and noise interference are not deep enough. Therefore, in order to solve the above problems, this paper proposes a low quality face image forging algorithm recognition method based on pre-training model. Specifically, the entire model is divided into two major modules: pre-training and main training.In the pre-training stage, Perception and Reconstruction-MAE (PR-MAE) based on mask strategy is designed to further excavate mask features to construct perceptual loss and enhance the global perceptual learning ability of the model. And the UDenseNet training network is constructed to improve the ability to capture details by U-shaped dense blocks, in order to adapt to more complex tasks. Meanwhile, the Multi-Noise Fusion (MNF) module is added to improve the model's resistance to multiple types of noise by dynamic injection strategy and enhance the robustness. In the main training process, CLIP supplementary learning module is introduced to perform the comparative learning with the pre-trained UDenseNet to improve the accuracy of the model. The experimental results show that the ACC index of this method can reach 89.12% and 90.25% on the latest MCFF and DF40 classification datasets, the average ACC index can reach 87.22% in the tests of the low definition images, 83.36% in the tests of the various kinds of noise and the index can reach more than 90% at the highest in the generalization test.

Key words: Forgery algorithm recognition, Face forgery, Deep learning