Journal of Frontiers of Computer Science and Technology ›› 2023, Vol. 17 ›› Issue (6): 1225-1248.DOI: 10.3778/j.issn.1673-9418.2210114

• Frontiers·Surveys • Previous Articles     Next Articles

Survey of Research on Automatic Music Annotation and Classification Methods

ZHANG Rulin, WANG Hailong, LIU Lin, PEI Dongmei   

  1. School of Computer Science and Technology, Inner Mongolia Normal University, Hohhot 010022, China
  • Online:2023-06-01 Published:2023-06-01

音乐自动标注分类方法研究综述

张如琳,王海龙,柳林,裴冬梅   

  1. 内蒙古师范大学 计算机科学技术学院,呼和浩特 010022

Abstract: Music is one of the most popular forms of art and entertainment, and it is an artistic language to express or entrust people??s feelings. However, with the rapid increase of digital music, it is very difficult to manage and filter music through shallow information. As an effective means to organize massive music and enrich the music information, automatic music annotation can overcome the semantic gap in music information retrieval, improve music information, make music more intuitive in expression, and promote the in-depth research of music information retrieval tasks such as music classification, music recommendation, and instrument identification. The current automatic music annotation mainly focuses on solving two problems: feature extraction and model selection. Combined with the current research focus, this paper expounds the relevant knowledge of automatic music annotation. This paper systematically sorts out various audio feature representation and feature extraction methods in the field of music automatic annotation, and conducts quantitative and qualitative analysis of each extraction method. This paper summarizes the related research results in this field, and focuses on the differences between different model methods from the perspectives of machine learning and deep learning. The commonly used datasets and performance evaluation indicators are introduced, the characteristics of different datasets are summarized, and the evaluation indicators are classified and analyzed. Finally, the difficulties and challenges faced by the research in the field of music automatic annotation are pointed out and the future is prospected.

Key words: music information retrieval, automatic music annotation, feature extraction, deep learning

摘要: 音乐是目前最受欢迎的艺术和娱乐形式之一,是表达或寄托人们感情的艺术语言,然而随着数字音乐急剧增加,通过浅层的信息管理与筛选音乐十分困难。音乐自动标注作为一种组织海量音乐与丰富音乐信息的有效手段,可克服音乐信息检索语义鸿沟,健全音乐信息,使音乐具有更直观的表达,并推动音乐分类、音乐推荐、乐器识别等音乐信息检索任务的深入研究。当前音乐自动标注主要聚焦于解决特征提取、模型选择两类问题,结合目前研究重点,阐述了音乐自动标注的相关知识;系统地梳理了音乐自动标注领域的各类音频特征表示及特征提取方法,并对每类提取方法进行了定量分析与定性分析;归纳了该领域相关研究成果,从机器学习与深度学习两个角度着重分析了不同模型方法的差异性;介绍了常用数据集与性能评价指标,总结了不同数据集特点,并对评价指标进行了归类分析;最后指出了音乐自动标注领域研究面临的难点与挑战,并对未来研究方向进行了展望。

关键词: 音乐信息检索, 音乐自动标注, 特征提取, 深度学习