计算机科学与探索 ›› 2021, Vol. 15 ›› Issue (12): 2345-2352.DOI: 10.3778/j.issn.1673-9418.2008098

• 人工智能 • 上一篇    下一篇

面向案件审判难度预测的神经网络模型研究

王悦,王平辉,许诺,陈龙,杨鹏,吴用   

  1. 西安交通大学 智能网络与网络安全教育部重点实验室,西安 710049
  • 出版日期:2021-12-01 发布日期:2021-12-09

Research on Neural Network for Trial Difficulty Prediction

WANG Yue, WANG Pinghui, XU Nuo, CHEN Long, YANG Peng, WU Yong   

  1. Ministry of Education Key Lab for Intelligent Networks and Network Security, Xi'an Jiaotong University, Xi'an 710049, China
  • Online:2021-12-01 Published:2021-12-09

摘要:

审判难度预测(TDP)是指在给定案情描述文本的情况下,自动预测案件审判难易程度,其在司法智能化系统中具有广阔的应用前景。现阶段,案件审判难度预测工具严重依赖专家经验规则,存在较大偏差,相关的研究工作较少。针对此问题,将其归结为自然语言处理中的文本分类问题,通过分析发现传统分类方法未考虑起诉状中审判要素间的结构独特性和逻辑依赖性,导致难以准确预测案件难易程度。为解决上述挑战,通过对起诉状的研究,结合案件繁简审判要素,提出一种新的神经网络模型MAT-TAN。具体地,该模型首先采用一种掩码注意力网络(MAT)对案情描述文本进行细粒度分析。其中的掩码机制扮演智能门控者的角色,起到聚焦审判要素特定位置的作用,结合自注意力机制,实现了对各审判要素全面、准确的特征提取。其次,提出一种拓扑关联网络(TAN)对要素间的司法逻辑依赖关系进行建模,并有效融合不同要素的特征,最终实现案件审判难度预测。在法院真实数据上的实验结果表明,与基准的文本分类方法相比,该模型宏平均F1值提升了0.036,在审判难度预测上具备较好的使用效果。

关键词: 审判难度预测(TDP), 审判要素, 掩码注意力网络(MAT), 拓扑关联网络(TAN)

Abstract:

Trial difficulty prediction (TDP) is the task of automatically predicting the difficulty of a trial given the case text, which has a broad application prospect in judicial intelligent system. In practice, the tools of TDP rely heavily on the experience of experts, which leads different conclusions in predicting the difficulty of the trial. However, there are few related research work. To address these issues, this paper regards it as a text classification problem in natural language processing. Through the analysis, it is found that, traditional text classification methods don??t consider the structural uniqueness and logical dependence among trial elements in complaint, which makes it difficult to predict the difficulty of a trial accurately. In order to solve the mentioned challenges, this paper carefully studies indictments and considers the complex and simple trial elements for judging cases, presents an end-to-end model, MAT-TAN (mask-attention and topological association network). Specifically, this paper proposes a novel mask-attention network (MAT), to carry out fine-grained analysis of a case description text in indictments. The masking mechanism plays a role of the intelligent gatekeeper, focusing on the specific position of the trial elements in indictments. Together with the self-attention mechanism, it extracts the comprehensive and accurate characteristics of each trial element. This paper proposes a novel topological association network (TAN), which models the judicial logic dependency relationship between different elements, and effectively integrates the characteristics of different elements. Finally, the TDP is realized. The experimental results conducted on real-world datasets demonstrate that the MAT-TAN can improve the macro averaged F1 up to 0.036 compared with baselines, showing that it has a better performance in TDP.

Key words: trial difficulty prediction (TDP), trial elements, mask-attention network (MAT), topological association network (TAN)