Micro Blog Evolutionary Network to Classification Method of Negative Information

doi:10.3778/j.issn.1673-9418.1509090

Abstract

Abstract: Aiming at the relationship of the Sina micro blogging, this paper establishes the evolving network by user's transmit blog, which classifies blog by SMO SVM (sequential minimal optimization support vector machine) algorithm, and implements the classification of malicious posts, spam, trash marketing information. The method enables users to accurately block the unwanted posts and blogger. The first step, classifying the entire Sina micro blogs based on the evolving network of transmit relationship and SVM classification algorithm; The second step, annotating the bloggers of often sending malicious advertisements by using the complex network technology; When the malicious bloggers sending message, blocking them in the network; Finally, finding out the source of spam, and discerning the blogger malicious or not, on the macro to better curb the spread of spam.?The results of this paper are compared with user feedback actual situation from the UCI data set, the experimental results of machine learning classification reaches 89%.

Key words: sequential minimal optimization (SMO), support vector machine (SVM), evolutionary network, UCI data set, negative information

摘要： 针对Sina微博博文的转发关系，建立起用户转发博文之间的演化网络，从而利用SMO SVM（sequential minimal optimization support vector machine）分类算法对博文进行分类，筛选出恶意博文、垃圾广告、垃圾营销信息，使用户能够精确地屏蔽不想要的博文和博主。第一步基于微博转发关系的演化网络和SVM分类算法对整个Sina微博进行分类；第二步利用复杂网络等技术对经常发送恶意广告的博主进行标注，从而在网络中对他们进行屏蔽；最后找出垃圾信息的来源以及分辨出博主是不是恶意转发者，在宏观上能更好地遏制垃圾信息的传播。与用户从UCI数据集中实际反馈情况进行比较，实验结果表明，机器学习分类的实验结果吻合度达到89%。

关键词: 序列最小优化（SMO）, 支持向量机（SVM）, 演化网络, UCI数据集, 负信息

ZHAO Yi, HE Keqing, LI Zhao, HUANG Yiwang. Micro Blog Evolutionary Network to Classification Method of Negative Information[J]. Journal of Frontiers of Computer Science and Technology, 2017, 11(1): 91-98.

赵一，何克清，李昭，黄贻望. 微博演化网络的负信息分类方法[J]. 计算机科学与探索, 2017, 11(1): 91-98.

[1]	LIN Hao, LI Leixiao, WANG Hui. Survey on Research and Application of Support Vector Machines in Intelligent Transportation System [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(6): 901-917.
[2]	FU Kang'an, WANG Wenjian, GUO Husheng. Categorical Data Classification Approach Based on Space Correlation Analysis [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(7): 1165-1173.
[3]	WANG Lijuan, DING Shifei. SVM-ELM Model Based on Particle Swarm Optimization [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(4): 657-665.
[4]	HE Li, HAN Keping, LIU Ying. Self-Adaptive SVM Incremental Learning Algorithm [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(4): 647-656.
[5]	WU Yifan, LIANG Jiye, WANG Junhong. Classification Algorithm Based on Hybrid Sampling for Unbalanced Data [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(2): 342-349.
[6]	PENG Qing, JI Guishu, XIE Linjiang1 ZHANG Shaobo. Application of Convolutional Neural Network in Vehicle Recognition [J]. Journal of Frontiers of Computer Science and Technology, 2018, 12(2): 282-291.
[7]	LU Shuxia, ZHOU Mi, JIN Zhao. Imbalanced Weighted Stochastic Gradient Descent Online Algorithm for SVM [J]. Journal of Frontiers of Computer Science and Technology, 2017, 11(10): 1662-1671.
[8]	WANG Hongqiao, CAI Yanning, FU Guangyuan, WANG Shicheng. Probability Density Estimation for Non-flat Functions [J]. Journal of Frontiers of Computer Science and Technology, 2016, 10(4): 589-599.
[9]	TANG Li, GONG Xiujun, HE Li. Survey on PAC-Bayes Theory and Application Research [J]. Journal of Frontiers of Computer Science and Technology, 2015, 9(1): 1-13.
[10]	GUO Yanxiang, CHEN Yaowu. Vehicle License Plate Location Method Based on Edge Detection and Color-Texture Histogram [J]. Journal of Frontiers of Computer Science and Technology, 2014, 8(6): 719-726.
[11]	ZHANG Yanping, ZHA Yongliang, ZHAO Shu, DU Xiuquan. Protein Structure Class Prediction Based on Autocorrelation Coefficient and PseAAC [J]. Journal of Frontiers of Computer Science and Technology, 2014, 8(1): 103-110.
[12]	ZHANG Ling, QIAN Fulan, HE Fugui. Granular Computing and Statistical Learning [J]. Journal of Frontiers of Computer Science and Technology, 2013, 7(8): 754-761.
[13]	TIAN Hao, LI Guohui, LIAN Lin, JIA Li. Hierarchical Matching Kernel for Buildings Classification in Remote Sensing Images [J]. Journal of Frontiers of Computer Science and Technology, 2011, 5(7): 588-594.
[14]	ZHAI Junhai, WANG Tingting, WANG Xizhao . Instance Reduction Support Vector Machine [J]. Journal of Frontiers of Computer Science and Technology, 2011, 5(12): 1131-1138.
[15]	LE THI Hoai An+， NGUYEN Van-Vinh， OUCHANI Samir. Gene Selection for Cancer Classification Using DCA [J]. Journal of Frontiers of Computer Science and Technology, 2009, 3(6): 612-620.

Micro Blog Evolutionary Network to Classification Method of Negative Information

微博演化网络的负信息分类方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics