维吾尔语短语自动抽取研究进展

doi:10.3778/j.issn.1673-9418.1509005

计算机科学与探索 ›› 2015, Vol. 9 ›› Issue (12): 1420-1429.DOI: 10.3778/j.issn.1673-9418.1509005

维吾尔语短语自动抽取研究进展

张海军1,2+

1. 新疆师范大学计算机科学技术学院，乌鲁木齐 830054
2. 新疆师范大学初等教育学院，乌鲁木齐 830054

出版日期:2015-12-01 发布日期:2015-12-04

Progress of Automatic Extraction of Uyghur Phrases

ZHANG Haijun1,2+

Online:2015-12-01 Published:2015-12-04

摘要/Abstract

摘要： 短语识别是机器翻译与信息检索的技术基础，具有重要的研究价值。围绕维吾尔语短语识别的研究进展，阐述了维吾尔语的语言特点，分析了这些特点对维吾尔语短语识别的影响，总结了近年来维吾尔语短语识别的有关语言学研究成果，重点梳理了维吾尔语短语自动抽取的相关研究方法。通过总结和梳理，发现目前维吾尔语短语自动抽取研究在理论和实现技术方面取得了较大进展，但在短语标注标准、研究语料及研究领域等方面还有大量工作尚未有效开展，需要予以关注。希望该文能为维吾尔语短语抽取相关研究提供借鉴和参考。

关键词: 维吾尔语, 短语, 规则, 统计, 术语, 命名实体

Abstract: Phrase extraction, which is the research basis of machine translation and information retrieval, plays a very important role in natural language processing. This paper puts the emphasis on the research progress of of Uyghur phrase extraction. To make convenience for discussion, this paper studies the linguistic features of Uyghur phrases and analyzes the impacts of these features on the phrase extraction. This paper mainly summarizes the philological theories of phrase identification in Uyghur and discusses the technologies of automatic extraction of Uyghur phrases. There has made great progress on the extraction of Uyghur phrases in both theory and technology. However, there are still lots of work to be carried out, such as to formulate tagging standard, study tagged corpus and expend research domains etc. It is hoped that this paper can give some references to the research on phrase extraction in Uyghur.

Key words: Uyghur, phrase, rules, statistics, term, named entity

张海军. 维吾尔语短语自动抽取研究进展[J]. 计算机科学与探索, 2015, 9(12): 1420-1429.

ZHANG Haijun. Progress of Automatic Extraction of Uyghur Phrases[J]. Journal of Frontiers of Computer Science and Technology, 2015, 9(12): 1420-1429.

[1]	郭望皓, 范江威, 张克亮. 融合语言学知识的神经机器翻译研究进展[J]. 计算机科学与探索, 2021, 15(7): 1183-1194.
[2]	瞿于荃, 龙华, 段荧, 邵玉斌, 杜庆治. 联合总变率空间和时延神经网络的说话人识别[J]. 计算机科学与探索, 2021, 15(7): 1255-1264.
[3]	黄镓辉, 彭力, 谢林柏. 无人机场景下尺度自适应的车辆跟踪算法[J]. 计算机科学与探索, 2021, 15(7): 1302-1309.
[4]	闫心怡, 温馨, 陈泽华. 分辨矩阵在逻辑优化中的应用[J]. 计算机科学与探索, 2021, 15(7): 1332-1338.
[5]	钟倩漪, 钱谦, 伏云发, 冯勇. 粒子群优化算法在关联规则挖掘中的研究综述[J]. 计算机科学与探索, 2021, 15(5): 777-793.
[6]	李猛, 李艳玲, 林民. 命名实体识别的迁移学习研究综述[J]. 计算机科学与探索, 2021, 15(2): 206-218.
[7]	郭奕，徐亮，熊雪军. 社交网络中意见领袖挖掘方法综述[J]. 计算机科学与探索, 2021, 15(11): 2077-2092.
[8]	韩鑫鑫，贲可荣，张献. 军用软件测试领域的命名实体识别技术研究[J]. 计算机科学与探索, 2020, 14(5): 740-748.
[9]	贾楠，张少霞，翟岩慧，李德玉. 决策蕴涵上的推理规则和推理过程研究[J]. 计算机科学与探索, 2020, 14(2): 344-352.
[10]	储传鑫，王丽珍，周丽华，李旭阳. 恶性肿瘤与工业污染之间的模糊关系挖掘[J]. 计算机科学与探索, 2020, 14(12): 2061-2071.
[11]	温馨，闫心怡，陈泽华. 中心概念及其在规则提取中的应用[J]. 计算机科学与探索, 2020, 14(11): 1967-1974.
[12]	董祥祥，高昂，梁英，毕晓迪. 动态社会网络数据发布隐私保护方法[J]. 计算机科学与探索, 2019, 13(9): 1441-1458.
[13]	荣垂田，李银银，王琰. 中文关键短语自动提取方法研究[J]. 计算机科学与探索, 2019, 13(9): 1481-1492.
[14]	王永贵，徐山珊，肖成龙. 无线城市社团发现的研究——在Spark上利用改进关联规则实现社团发现的算法[J]. 计算机科学与探索, 2019, 13(9): 1582-1592.
[15]	宗海燕，吴秦，王田辰，张淮. 核协同表示下的多特征融合场景识别[J]. 计算机科学与探索, 2019, 13(6): 1038-1048.

维吾尔语短语自动抽取研究进展

Progress of Automatic Extraction of Uyghur Phrases

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics