Progress of Automatic Extraction of Uyghur Phrases

doi:10.3778/j.issn.1673-9418.1509005

Abstract

Abstract: Phrase extraction, which is the research basis of machine translation and information retrieval, plays a very important role in natural language processing. This paper puts the emphasis on the research progress of of Uyghur phrase extraction. To make convenience for discussion, this paper studies the linguistic features of Uyghur phrases and analyzes the impacts of these features on the phrase extraction. This paper mainly summarizes the philological theories of phrase identification in Uyghur and discusses the technologies of automatic extraction of Uyghur phrases. There has made great progress on the extraction of Uyghur phrases in both theory and technology. However, there are still lots of work to be carried out, such as to formulate tagging standard, study tagged corpus and expend research domains etc. It is hoped that this paper can give some references to the research on phrase extraction in Uyghur.

Key words: Uyghur, phrase, rules, statistics, term, named entity

摘要： 短语识别是机器翻译与信息检索的技术基础，具有重要的研究价值。围绕维吾尔语短语识别的研究进展，阐述了维吾尔语的语言特点，分析了这些特点对维吾尔语短语识别的影响，总结了近年来维吾尔语短语识别的有关语言学研究成果，重点梳理了维吾尔语短语自动抽取的相关研究方法。通过总结和梳理，发现目前维吾尔语短语自动抽取研究在理论和实现技术方面取得了较大进展，但在短语标注标准、研究语料及研究领域等方面还有大量工作尚未有效开展，需要予以关注。希望该文能为维吾尔语短语抽取相关研究提供借鉴和参考。

关键词: 维吾尔语, 短语, 规则, 统计, 术语, 命名实体

ZHANG Haijun. Progress of Automatic Extraction of Uyghur Phrases[J]. Journal of Frontiers of Computer Science and Technology, 2015, 9(12): 1420-1429.

张海军. 维吾尔语短语自动抽取研究进展[J]. 计算机科学与探索, 2015, 9(12): 1420-1429.

[1]	GUO Wanghao, FAN Jiangwei, ZHANG Keliang. Advance Research on Neural Machine Translation Integrating Linguistic Knowledge [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(7): 1183-1194.
[2]	GAO Yang, LIU Yuan. Recommendation Algorithm Combining Knowledge Graph and Short-Term Preferences [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(6): 1133-1144.
[3]	LI Meng, LI Yanling, LIN Min. Review of Transfer Learning for Named Entity Recognition [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(2): 206-218.
[4]	ZU Xian, XIE Fei, LIU Xiaojian. Keyphrase Extraction Combining Word and Document Embeddings [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(2): 294-304.
[5]	LI Xingxiu, TANG Jianjun, HUA Jing. Arrhythmia Classification Based on CNN and Bidirectional LSTM [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(12): 2353-2361.
[6]	GUO Yi, XU Liang, XIONG Xuejun. Survey on Methods of Opinion Leader Mining in Social Networks [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(11): 2077-2092.
[7]	CHENG Qiqin, WAN Liang. Application Research of BiLSTM in Cross-Site Scripting Detection [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(8): 1338-1347.
[8]	HAN Xinxin, BEN Kerong, ZHANG Xian. Research on Named Entity Recognition Technology in Military Software Testing [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(5): 740-748.
[9]	ZHANG Zhoubin, XIANG Yan, LIANG Junge, YANG Jialin, MA Lei. Using Position-Enhanced Attention Mechanism for Aspect-Based Sentiment Classi-fication [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(4): 619-627.
[10]	ZHANG Shengxia, TIAN Chengliang. Security Outsourcing Algorithms by Unimodular Matrix Encryption Method [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(1): 73-82.
[11]	DONG Xiangxiang, GAO Ang, LIANG Ying, BI Xiaodi. Method of Privacy Preserving in Dynamic Social Network Data Publication [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(9): 1441-1458.
[12]	RONG Chuitian, LI Yinyin, WANG Yan. Research on Technologies of Chinese Key-Phrase Automatic Extraction [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(9): 1481-1492.
[13]	ZHANG Tao, LIU Yang, REN Xiangying. Voice Activity Detection Based on Long-Term Power Spectrum Variability [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(9): 1534-1542.
[14]	WANG Yonggui, XU Shanshan, XIAO Chenglong. Research on Wireless City Community Detection: Using Improved Association Rules to Achieve Community Detection Algorithm on Spark [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(9): 1582-1592.
[15]	LIU Chen, XIAO Zhiyong, DU Nianmao. Application of Improved Convolutional Neural Network in Medical Image Seg- mentation [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(9): 1593-1603.

Progress of Automatic Extraction of Uyghur Phrases

维吾尔语短语自动抽取研究进展

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics