Journal of Frontiers of Computer Science and Technology ›› 2025, Vol. 19 ›› Issue (2): 334-343.DOI: 10.3778/j.issn.1673-9418.2403008

• Frontiers·Surveys • Previous Articles     Next Articles

Survey of Research on Curriculum Learning in Neural Machine Translation

HU Chunyue, SI Qintu, WANG Siriguleng   

  1. College of Computer Science and Technology, Inner Mongolia Normal University, Hohhot 010022, China
  • Online:2025-02-01 Published:2025-01-23

神经机器翻译中课程学习研究综述

胡春月,斯琴图,王斯日古楞   

  1. 内蒙古师范大学 计算机科学技术学院,呼和浩特 010022

Abstract: Curriculum learning, as an emerging technology, has gradually attracted attention in recent years. It is in line with human learning habits, from simple to difficult, from shallow to deep. Its core idea is to allow the model to learn from simple and basic concepts, and gradually transition to more complex and higher-level content. In the translation of neural machines, curriculum learning is a training strategy to help models learn in accordance with certain laws. Curriculum learning has been proven to accelerate the convergence of the model and improve the quality and stability of the model translation. This paper first introduces the definition and basic framework of curriculum learning from the perspective of machine learning, and further explores its application in the field of neural machine translation. Two approaches of curriculum learning, namely predefined curriculum learning and dynamic curriculum learning, are discussed in detail from the perspectives of sample difficulty evaluation and model training scheduling strategies. Predefined curriculum learning guides the model to gradually learn tasks from simple to complex by pre-determining the difficulty order of samples. In contrast, dynamic curriculum learning adjusts the difficulty of samples dynamically based on the model’s current learning state, offering a more flexible training approach. Additionally, this paper analyzes the future research trends of curriculum learning in neural machine translation and proposes three promising research directions.

Key words: curriculum learning, neural machine translation, difficulty measure, training schedule

摘要: 课程学习作为一种新兴技术,近年来逐渐受到关注,它符合人类的学习习惯,由简到难、由浅至深。其核心思想是让模型从简单的、基础的概念开始学习,逐渐过渡到更复杂、更高层次的内容。在神经机器翻译中,课程学习作为一种训练策略,旨在帮助模型按照一定规律学习。课程学习现已被证明可以加速模型收敛,提高模型翻译质量和稳定性。从机器学习的角度介绍了课程学习的定义及其基础框架,并进一步探讨了课程学习在神经机器翻译领域中的应用。从样本难度评估与模型训练调度策略两个方面,详细阐述了预定义课程学习和动态课程学习两种方法。预定义课程学习通过事先确定样本的难度顺序,引导模型从简单到复杂的任务逐步学习;而动态课程学习则依据模型当前的学习状态动态调整样本的难度,提供了更灵活的训练方式。分析了课程学习在神经机器翻译领域的未来研究趋势,并提出了三个值得进一步探索的研究方向。

关键词: 课程学习, 神经机器翻译, 难度评估, 训练调度