Journal of Frontiers of Computer Science and Technology

• Science Researches •     Next Articles

A Review of Machine Unlearning

HE Lisong,  YANG Yang   

  1. 1. School of Management, Xi’an Jiaotong University, Xi’an 710049, China
    2. School of Finance and Data Science, Xi’an Eurasia University, Xi’an 710065, China

遗忘学习综述

何黎松, 杨洋   

  1. 1. 西安交通大学 管理学院,西安 710049
    2. 西安欧亚学院 金融与数据科学学院,西安 710065

Abstract: To effectively protect data privacy and implement the "right to be forgotten," it is necessary to eliminate the influence of specific subsets of training data from machine learning models and ensure that this data cannot be reverse-engineered. To address this issue, the research field of "machine unlearning" has emerged in recent years. This paper reviews the progress in machine unlearning research from three aspects: definitions, metrics, and algorithms. Firstly, it systematically outlines the core concepts, definitions, and evaluation metrics of machine unlearning, emphasizing the critical significance of certifiability metrics. Secondly, it categorizes unlearning algorithms into six major classes based on their design principles: structured initial training, influence functions approximate, gradient updates, noise unlearning, knowledge distillation unlearning, and boundary unlearning. It provides detailed descriptions of nine representative machine unlearning algorithms and their evolution. Based on a comparison of existing algorithms' strengths and weaknesses, the paper discusses the potential and significance of constructing a unified framework for machine unlearning based on certification, and analyzes the theoretical and practical relationships between machine unlearning research and privacy protection. Finally, the paper outlines future research directions for machine unlearning, including the need to extend unlearning algorithms to subfields such as fair machine learning, transfer learning, and reinforcement learning; the potential for integrating various design approaches into future unlearning algorithms; the need for collaboration between technology and regulation in unlearning practices; and the benefits of integrating machine unlearning with incremental learning to improve the management and operation efficiency of machine learning models.

Key words: Data Privacy Protection, Machine Unlearning, Certifiability, Unified Framework, Algorithmic Design

摘要: 为了有效保护数据隐私并实现“被遗忘的权力”,需要从机器学习模型中消除特定训练数据子集的影响,并确保这些数据不会被反向推测。为了解决这一问题,近年来形成了“遗忘学习”的研究领域。本文从定义、度量方法和算法三个方面全面介绍遗忘学习的研究进展。首先,梳理了遗忘学习的核心概念定义和评价指标,并着重分析了可认证性指标的重要意义。接着,按照算法设计原理将遗忘算法划分为结构化初训练、影响函数近似估计、梯度更新、噪声遗忘、知识蒸馏遗忘和边界遗忘6大类,并详细介绍了其中9种代表性的遗忘学习算法及其演变。在总结比较已有算法优劣基础上,讨论了遗忘学习统一框架的意义,并分析了遗忘学习研究与隐私保护的理论和实践关系。最后,展望了遗忘学习未来的研究方向,包括:机器学习的公平性、迁移学习和强化学习等子领域尚需拓展遗忘学习算法;未来遗忘算法有可能综合多种设计思路;遗忘实践需要技术与法规的协同合作;以及遗忘学习与增量学习的统一将有助于提高机器学习模型的管理和运营效率。

关键词: 数据隐私保护, 遗忘学习, 可认证性, 统一框架, 算法设计