计算机科学与探索

• 学术研究 •    下一篇

机器学习在社交媒体用户自杀意念检测中的应用研究综述

蒙秀扬, 王世屹, 李渡渡, 王春玲   

  1. 1. 北京林业大学 信息学院, 北京 100083
    2. 国家林业草原林业智能信息处理工程技术研究中心, 北京 100083
    3. 中央民族大学 信息工程学院,北京 100081

Review on Application of Machine Learning in Detecting Suicidal Ideation for Social Media Users

MENG Xiuyang,  WANG Shiyi,  LI Dudu,  WANG Chunling   

  1. 1. School of Information Science and Technology, Beijing Forestry University, Beijing 100083, China
    2. Engineering Research Center for Forestry-Oriented Intelligent Information Processing of National Forestry and Grassland Administration, Beijing 100083, China
    3. School of Information Engineering, Minzu University of China, Beijing 100081, China

摘要: 自杀是严峻的社会问题,是突出的全球性公共卫生挑战,也是全球死亡的重要因素之一。近年,互联网迅猛发展,社交媒体平台成为人类发布情感甚至是自杀意念、企图和行为的崭新阵地,使其成为自杀意念检测的重要数据平台和关键评估依据。随着人工智能技术的兴起,关于机器学习在社交媒体用户自杀意念的检测中的应用研究已然成为热点。但在国内,该领域相关研究较为匮乏,且尚未形成完整体系。为系统梳理其研究现状及发展脉络,对机器学习技术赋能自杀意念检测的研究进行了全面总结,是近三年来国内第一篇关于此领域的综述。首先,概述了自杀意念检测的定义、流程、常见方法及评价指标,总结了当下自杀意念检测任务中常用的数据集和现有特征工程及其技术。其次,分别从传统的机器学习和深度学习的角度对自杀意念检测进行了系统总结,对比分析了每种方法的关键技术、核心思想及优缺点。此外,归纳了当前该领域中亟待解决的问题及创新解决方法,特别介绍了ChatGPT等大语言模型、多模态模型在该领域的应用。最后,讨论了机器学习在社交媒体自杀意念检测中的应用研究中的局限性,并提出了未来的研究方向,以期进一步推动形成数据驱动、人机协同、跨学科融合、跨文化畛域的数智化自杀意念检测新范式,以期为相关领域的研究人员提供借鉴和参考。

关键词: 自杀意念检测, 社交媒体, 机器学习, 深度学习, 特征提取

Abstract: Suicide constitutes a grave societal quandary, an eminent worldwide public health predicament, and a pivotal determinant of global mortality. In recent years, with the rapid development of the Internet, social media have emerged as a novel domain for individuals to express their emotions, including suicidal ideation, attempts, and behaviors. Consequently, these platforms have evolved into crucial data repositories and essential assessment criteria for detecting suicidal ideation. With the advent of artificial intelligence technology, the utilization of machine learning in detecting suicidal ideation among social media users has emerged as a scintillating subject. The field in China, however, lacks sufficient research and has yet to establish a comprehensive system. To systematically review the research status and development context of suicide ideation detection, this paper presents a systematically summary of machine learning technology's application in empowering suicide ideation detection, marking the first review in this field conducted in China over the past three years. Firstly, this paper provides an overview of the definition, process, commonly employed methods, and evaluation indicators for detecting suicidal ideation. Secondly, this paper provides a comprehensive overview of suicide ideation detection techniques, encompassing both traditional machine learning and deep learning approaches. The key methodologies, fundamental concepts, merits, and limitations of each method are thoroughly compared and analyzed. Furthermore, the urgent issues and innovative solutions in this field are summarized, with a particular focus on the application of large language models such as ChatGPT and multi-modal models. Finally, the limitations of machine learning in the application research of suicide ideation detection on social media are comprehensively discussed, and future research directions are proposed, in order to further promote the formation of a new paradigm of data-driven, human-computer collaboration, interdisciplinary integration, and cross-cultural domain of suicide ideation detection, so as to provide reference and reference for researchers in related fields.

Key words: Suicidal ideation detection, Social media, Machine learning, Deep learning, Feature extraction