混合模型的微博交叉话题发现

doi:10.3778/j.issn.1673-9418.1305004

计算机科学与探索 ›› 2013, Vol. 7 ›› Issue (8): 747-753.DOI: 10.3778/j.issn.1673-9418.1305004

混合模型的微博交叉话题发现

詹勇，杨燕+，王红军

西南交通大学信息科学与技术学院，成都 610031

出版日期:2013-08-01 发布日期:2013-08-06

Extracting Overlapping Topics from Micro-Blog Based on Mixture Model

ZHAN Yong, YANG Yan+, WANG Hongjun

School of Information Science and Technology, Southwest Jiaotong University, Chengdu 610031, China

Online:2013-08-01 Published:2013-08-06

摘要/Abstract

摘要： 微博具有信息量庞大，信息分散多样等特点，已经成为快速分享和传播信息的新平台。传统话题发现算法大部分都是基于划分的，没有考虑话题之间的关联性，存在一定的局限性，因此研究了大规模微博文本集上的话题发现问题。采用具有分词准确率较高、歧义识别特点的西南交通大学思维与智慧研究所中文分词系统对文本进行分词处理，并提出了基于混合模型的微博交叉话题发现算法。实验结果表明，该算法具有一定可行性和有效性。

关键词: 微博, 交叉话题发现, 混合模型

Abstract: Micro-blog is a new platform to share and disseminate information quickly. It is characterized by huge amount of scattered and diverse information. The most of traditional topics extraction algorithms are partitioning method, which do not consider the relationship between the topics, so there are some limitations. This paper focuses on the task of news topics extraction from large-scale short posts of micro-blog service. The word segmentation is processed according to the characteristics of the micro-blog text using the Chinese word segmentation software with high accuracy and ambiguity recognition, which is developed by Institute of Noetics and Wisdom, Southwest Jiaotong University. And then, this paper proposes an overlapping topic detection algorithm based on mixture model. The experimental results prove the feasibility and validity of the algorithm.

Key words: micro-blog, overlapping topic detection, mixture model

詹勇，杨燕，王红军. 混合模型的微博交叉话题发现[J]. 计算机科学与探索, 2013, 7(8): 747-753.

ZHAN Yong, YANG Yan, WANG Hongjun. Extracting Overlapping Topics from Micro-Blog Based on Mixture Model[J]. Journal of Frontiers of Computer Science and Technology, 2013, 7(8): 747-753.

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	46

来源	本网站	其他网站

次数	42	4
比例	91%	9%

摘要

308

最新录用	在线预览	正式出版

0	0	308

	来源	本网站

	次数	308
	比例	100%

[1]	张墨华，彭建华. 面向图像复原的分层贝叶斯局部高斯混合模型[J]. 计算机科学与探索, 2020, 14(2): 325-335.
[2]	李聪，葛洪伟. 非线性幂变换Gammachirp滤波器的鲁棒语音特征提取[J]. 计算机科学与探索, 2019, 13(8): 1351-1359.
[3]	黄畅，郭文忠，郭昆. 面向微博热点话题发现的改进BBTM模型研究[J]. 计算机科学与探索, 2019, 13(7): 1102-1113.
[4]	周南，杜军平，姚旭，梁美玉，薛哲，LEE JangMyung. 基于卷积神经网络的微博话题内容搜索方法[J]. 计算机科学与探索, 2019, 13(5): 753-764.
[5]	张绍武，刘华丽，杨亮，邵华，林鸿飞. 基于图排序模型的微博观点信息识别[J]. 计算机科学与探索, 2018, 12(2): 292-299.
[6]	刘培玉，侯秀艳，朱振方，刘芳，蔡肖红. 基于热度联合排序的微博热点话题发现[J]. 计算机科学与探索, 2016, 10(4): 573-581.
[7]	刘超，徐雅斌，武装. 微博社区快速发现方法[J]. 计算机科学与探索, 2015, 9(9): 1100-1107.
[8]	黄磊，李寿山，王晶晶. 基于认证用户信息的微博用户类型识别方法[J]. 计算机科学与探索, 2015, 9(6): 719-725.
[9]	王峰，余伟，李石君. 新浪微博平台上的用户可信度评估[J]. 计算机科学与探索, 2013, 7(12): 1125-1134.
[10]	杨平，王丹，赵文兵. 微博网站中面向主题的权威信息搜索技术研究[J]. 计算机科学与探索, 2013, 7(12): 1135-1145.
[11]	童薇，陈威，孟小峰. EDM：高效的微博事件检测算法[J]. 计算机科学与探索, 2012, 6(12): 1076-1086.
[12]	王晟，王子琪，张铭. 个性化微博推荐算法[J]. 计算机科学与探索, 2012, 6(10): 895-902.

混合模型的微博交叉话题发现

Extracting Overlapping Topics from Micro-Blog Based on Mixture Model

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 12

编辑推荐 0

Metrics