混合秩矩阵分解模型

doi:10.3778/j.issn.1673-9418.1806023

摘要/Abstract

摘要： 随着推荐系统的发展，矩阵近似算法成为研究热点，而以概率矩阵分解为代表的低秩矩阵近似模型因其具有较高的推荐精度而广受关注。但是，随着大数据时代的到来，评分矩阵越来越复杂，简单的单个矩阵近似模型会使一些隐藏在数据中的信息被忽视。为了解决这个问题，提出了一种基于boosting框架的混合秩矩阵近似算法（mixture rank matrix factorization，MRMF）。该算法基于boosting框架融合多个不同秩矩阵获取丰富的评分信息。具体方法为首先从整体结构出发，获取矩阵的整体信息，然后基于boosting求偏差获得残差矩阵，抓取局部的相关性。同时为了更好地学习局部特征，引入服从拉普拉斯先验分布的样本权重，构建自适应权重的概率矩阵模型（adaptive weight matrix factorization，AWMF）。在获取残差矩阵之后，通过EM算法学习残差矩阵的权重，避免模型过拟合以及减少人工调差的复杂度。实验结果验证，所提出的算法在四个真实数据集（Ciao、Epinions、Douban、Movielens（10M））上均具有较好的推荐精度。

关键词: 矩阵近似, 梯度提升, 自适应秩, 样本权重, 推荐系统, 权重矩阵分解

Abstract: With the development of recommendation system, matrix approximation algorithm has become a research hotspot, and low-rank matrix approximation model represented by probability matrix decomposition has attracted wide attention because of its high recommendation accuracy. However, with the arrival of the era of large data, scoring matrices become more and more complex. Simple single matrix approximation model will make some hidden information in data ignored. To solve this problem, a hybrid rank matrix factorization (MRMF) algorithm based on boosting framework is proposed. The algorithm combines multiple different rank matrices to obtain rich scoring information. The specific method is to obtain the overall information of the matrix from the overall structure, and then obtain the residual matrix based on boosting deviation to capture the local phase. At the same time, in order to learn local features better, sample weights obeying Laplacian prior distribution are introduced to construct an adaptive weight matrix factorization (AWMF). After obtaining the residual matrix, the weight of the residual matrix is learnt by EM algorithm to avoid over-fitting of the model and reduce the complexity of manual adjustment. The proposed method has good recommendation accuracy on four real data sets (Ciao, Epinions, Douban, Movielens (10M)).

Key words: matrix approximation, gradient boosting, adaptive rank, weighted sample, recommendation systems, weight matrix factorization

李幸幸，刘华锋，景丽萍. 混合秩矩阵分解模型[J]. 计算机科学与探索, 2019, 13(7): 1114-1122.

LI Xingxing, LIU Huafeng, JING Liping. Mixture Rank Matrix Factorization Model[J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(7): 1114-1122.

106

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	106

来源	本网站	其他网站

次数	105	1
比例	99%	1%

摘要

175

最新录用	在线预览	正式出版

0	0	175

	来源	本网站

	次数	175
	比例	100%

[1]	武家伟，孙艳春. 融合知识图谱和深度学习方法的问诊推荐系统[J]. 计算机科学与探索, 2021, 15(8): 1432-1440.
[2]	高仰，刘渊. 融合知识图谱和短期偏好的推荐算法[J]. 计算机科学与探索, 2021, 15(6): 1133-1144.
[3]	邢长征，郭亚兰，张全贵，赵宏宝. 融合短文本层级注意力和时间信息的推荐方法[J]. 计算机科学与探索, 2021, 15(11): 2222-2232.
[4]	李广丽，滑瑾，袁天，朱涛，邬任重，姬东鸿，张红斌. 基于用户偏好挖掘生成对抗网络的推荐系统[J]. 计算机科学与探索, 2020, 14(5): 803-814.
[5]	王玮皓，陈松灿. 双曲因子分解机[J]. 计算机科学与探索, 2020, 14(4): 590-597.
[6]	刘忠慧，邹璐，杨梅，闵帆. 启发式概念构造的组推荐方法[J]. 计算机科学与探索, 2020, 14(4): 703-711.
[7]	杜师帅，邱天，李灵巧，胡锦泉，郑安兵，冯艳春，胡昌勤，杨辉华. 多层梯度提升树在药品鉴别中的应用[J]. 计算机科学与探索, 2020, 14(2): 260-273.
[8]	王绍卿，李鑫鑫，孙福振，方春. 个性化新闻推荐技术研究综述[J]. 计算机科学与探索, 2020, 14(1): 18-29.
[9]	陈虹，陈建虎，肖成龙，万广雪，肖振久. 深度学习模型下多分类器的入侵检测方法[J]. 计算机科学与探索, 2019, 13(7): 1123-1133.
[10]	王宇琛，王宝亮，侯永宏. 融合协同过滤与上下文信息的Bandits推荐算法[J]. 计算机科学与探索, 2019, 13(3): 361-373.
[11]	和凤珍，石进平. 非均匀划分拟阵约束下的多样性推荐方法[J]. 计算机科学与探索, 2019, 13(2): 226-238.
[12]	庄福振，罗丹，何清. 基于集成局部性特征学习的推荐算法[J]. 计算机科学与探索, 2018, 12(6): 851-858.
[13]	郭宁宁，王宝亮，侯永宏，常鹏. 融合社交网络特征的协同过滤推荐算法[J]. 计算机科学与探索, 2018, 12(2): 208-217.
[14]	李佳琪，刘红岩，何军，王蓓，杜小勇. 应用商城中用户年龄的推断及在推荐中的应用[J]. 计算机科学与探索, 2018, 12(11): 1729-1739.
[15]	房倩琦，柳玲，文俊浩，曾骏，高旻. 社交关系在基于模型社会化推荐系统中的影响[J]. 计算机科学与探索, 2018, 12(1): 82-91.

混合秩矩阵分解模型

Mixture Rank Matrix Factorization Model

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐 0

Metrics