双曲因子分解机

doi:10.3778/j.issn.1673-9418.1905025

计算机科学与探索 ›› 2020, Vol. 14 ›› Issue (4): 590-597.DOI: 10.3778/j.issn.1673-9418.1905025

双曲因子分解机

王玮皓，陈松灿

1. 南京航空航天大学计算机科学与技术学院，南京 211106
2. 模式分析与机器智能工业和信息化部重点实验室，南京 211106

出版日期:2020-04-01 发布日期:2020-04-10

Hyperbolic Factorization Machine

WANG Weihao, CHEN Songcan

1. College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
2. MIIT Key Laboratory of Pattern Analysis and Machine Intelligence, Nanjing 211106, China

Online:2020-04-01 Published:2020-04-10

摘要/Abstract

摘要：

因子分解机（FM）自提出以来已被广泛用于推荐系统，为了捕捉特征间的二阶交互，FM将任意两个特征的二阶系数表示成欧氏空间中对应嵌入向量的内积。考虑到推荐场景中的对象如商品、用户、属性、上下文信息等，可用具有层次结构的异构网络进行表达，而平坦的欧氏空间无法刻画这种层次结构，限制了FM的特征表示能力，为此提出了双曲因子分解机（HFM）。它将每维特征表示为双曲空间而非欧氏空间中的向量，并利用双曲距离度量评估特征间的二阶交互强度。选择双曲空间是因为其被证明更适合树、图和词汇等具有层次结构的对象嵌入。分别设计了基于庞加莱球和基于双曲面两种双曲空间模型的HFM，并导出了对应的黎曼梯度下降优化算法。在多个数据集上的实验结果表明，HFM在等量参数的情形下，获得了比FM更优的性能，同时揭示出了在FM中欠缺的特征间的层次关系，使之具有部分可解释性。

关键词: 因子分解机, 双曲空间, 推荐系统, 表示学习, 流形学习

Abstract:

Factorization machine (FM) has been widely applied in recommender systems since its publication, and to capture feature interactions, FM represents its 2nd-order coefficient of any two features as inner product of the corresponding embedding vectors in Euclidean space. Considering that objects in recommender systems such as items, users, properties and contexts can be described as a heterogeneous network exhibiting hierarchical structures, whereas flat Euclidean space is not able to capture this kind of structure, restricting the feature representation ability of FM, this paper proposes hyperbolic FM (HFM). It represents each feature as a vector in hyperbolic space rather than in Euclidean space, and evaluates the 2nd-order feature interaction strength with hyperbolic distance measure. The reason for adopting hyperbolic geometry is that it has been shown to be the underlying embedding space of hierarchical structures, like trees, graphs and vocabulary. This paper designs two HFMs based on Poincaré ball model and hyperboloid model, respectively, and derives the corresponding Riemannian gradient descent algorithm for optimization. Experiments conducted on various datasets indicate that HFM achieves better performance than original FM with identical number of trainable parameters, and reveals the hierarchical structure of features which is missing in FM, offering explanability to some extent.

Key words: factorization machine, hyperbolic space, recommender system, representation learning, manifold learning

王玮皓，陈松灿. 双曲因子分解机[J]. 计算机科学与探索, 2020, 14(4): 590-597.

WANG Weihao, CHEN Songcan. Hyperbolic Factorization Machine[J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(4): 590-597.

[1]	蔡明昕，孙晶，王斌. 多角度语义轨迹相似度计算模型[J]. 计算机科学与探索, 2021, 15(9): 1632-1640.
[2]	武家伟，孙艳春. 融合知识图谱和深度学习方法的问诊推荐系统[J]. 计算机科学与探索, 2021, 15(8): 1432-1440.
[3]	陈洁，刘洋，赵姝，张燕平. 利用多粒度属性网络表示学习进行引文推荐[J]. 计算机科学与探索, 2021, 15(6): 1103-1113.
[4]	高仰，刘渊. 融合知识图谱和短期偏好的推荐算法[J]. 计算机科学与探索, 2021, 15(6): 1133-1144.
[5]	赵雪莉，卢光跃，吕少卿，张潘. 结合属性信息的二分网络表示学习[J]. 计算机科学与探索, 2021, 15(3): 495-505.
[6]	邢长征，郭亚兰，张全贵，赵宏宝. 融合短文本层级注意力和时间信息的推荐方法[J]. 计算机科学与探索, 2021, 15(11): 2222-2232.
[7]	李广丽，滑瑾，袁天，朱涛，邬任重，姬东鸿，张红斌. 基于用户偏好挖掘生成对抗网络的推荐系统[J]. 计算机科学与探索, 2020, 14(5): 803-814.
[8]	刘忠慧，邹璐，杨梅，闵帆. 启发式概念构造的组推荐方法[J]. 计算机科学与探索, 2020, 14(4): 703-711.
[9]	王绍卿，李鑫鑫，孙福振，方春. 个性化新闻推荐技术研究综述[J]. 计算机科学与探索, 2020, 14(1): 18-29.
[10]	许磊，黄玲，王昌栋. 保持Motif结构的网络表示学习[J]. 计算机科学与探索, 2019, 13(8): 1261-1271.
[11]	周慧，赵中英，李超. 面向异质信息网络的表示学习方法研究综述[J]. 计算机科学与探索, 2019, 13(7): 1081-1093.
[12]	李幸幸，刘华锋，景丽萍. 混合秩矩阵分解模型[J]. 计算机科学与探索, 2019, 13(7): 1114-1122.
[13]	杨晓翠，宋甲秀，张曦煌. 基于网络表示学习的链路预测算法[J]. 计算机科学与探索, 2019, 13(5): 812-821.
[14]	王宇琛，王宝亮，侯永宏. 融合协同过滤与上下文信息的Bandits推荐算法[J]. 计算机科学与探索, 2019, 13(3): 361-373.
[15]	和凤珍，石进平. 非均匀划分拟阵约束下的多样性推荐方法[J]. 计算机科学与探索, 2019, 13(2): 226-238.

双曲因子分解机

Hyperbolic Factorization Machine

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics