Survey on Representation Learning Methods Oriented to Heterogeneous Information Network

doi:10.3778/j.issn.1673-9418.1903056

Abstract

Abstract: Network representation learning aims to learn a series of low-dimensional vectors for the components (node, edge, subgraph, etc.) in a network. Meanwhile, the characters of the components in the original network should be largely retained in these vectors. Heterogeneous information network is the network composed of various types of nodes, link relationships and attribute information. It is characterized by dynamics, large scale and heterogeneity, and is ubiquitous in the real life. Network representation learning by integrating various heterogeneous information can not only alleviate the problem of data sparsity, but also help to learn the representation vectors with high discriminative and inferential ability. At the same time, it also faces the challenge of dealing with complex data relationships and balancing heterogeneous information. In recent years, researchers have designed different representation learning algorithms for heterogeneous information networks, which have greatly promoted the development of this field. In view of these algorithms, this paper first designs a unified classification framework, then generalizes and compares the representative algorithms in each category, including their time complexities, advantages, etc. In addition, the information of the commonly used data sets is summarized into a table. Some challenges and possible research directions are provided at the end of this paper.

Key words: network representation learning, heterogeneous information network, network analysis

摘要： 网络表示学习旨在为网络中的组件（节点、边、子网络等）学习出低维的表征向量，使得这些向量能够在最大程度上保留组件在原网络中的特性。异质信息网络是由多种类型的节点、链接关系以及属性信息组成的网络，具有动态性、大规模和异质性等特点，在现实生活中普遍存在。融合多种异质信息进行网络表示学习，能在一定程度上解决数据稀疏问题，同时有助于训练出具有高区别力和推理能力的表征向量。但与此同时，也面临着如何有效处理复杂数据关系以及平衡异质信息的挑战。近年来，研究者们针对异质信息网络设计了不同的表示学习算法，在很大程度上推动了该领域的发展。针对这些算法，首先设计一个统一的分类框架，接着对各类别下的代表性算法进行概括介绍和比较，分析它们的时间复杂度和优缺点。此外，分类汇总了实验中的常用数据集。最后给出了该领域的挑战和未来可能的研究方向。

关键词: 网络表示学习, 异质信息网络, 网络分析

ZHOU Hui, ZHAO Zhongying, LI Chao. Survey on Representation Learning Methods Oriented to Heterogeneous Information Network[J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(7): 1081-1093.

周慧，赵中英，李超. 面向异质信息网络的表示学习方法研究综述[J]. 计算机科学与探索, 2019, 13(7): 1081-1093.

[1]	LIU Wenxing, FAN Min, LI Jinhai. Research on Community Division Method Under Network Formal Context [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(8): 1441-1449.
[2]	DUAN Xiangyu, YUAN Guan, MENG Fanrong. Dynamic Community Detection: A Survey [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(4): 612-630.
[3]	ZHAO Xueli, LU Guangyue, LV Shaoqing, ZHANG Pan. Attributed Bipartite Network Representation Learning [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(3): 495-505.
[4]	ZHAO Chuan, ZHANG Kaihan, LIANG Jiye. Asymmetric Recommendation Algorithm in Heterogeneous Information Network [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(6): 939-946.
[5]	CHEN Kejia, CHEN Liming, WU Tong. Survey on Community Detection in Multi-layer Networks [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(11): 1801-1812.
[6]	XU Lei, HUANG Ling, WANG Changdong. Motif-Preserving Network Representation Learning [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(8): 1261-1271.
[7]	PU Jianyu, CHEN Lei, SHAO Kai. Exploiting Katz Method to Boost Inductive Matrix Completion for Predicting Gene-Disease Associations [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(7): 1154-1164.
[8]	YANG Xiaocui, SONG Jiaxiu, ZHANG Xihuang. Link Prediction Algorithm Based on Network Representation Learning [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(5): 812-821.
[9]	ZHANG Lei, QIAN Feng, ZHAO Shu, CHEN Jie, ZHANG Yanping. Network Representation Learning via Variational Auto-Encoder [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(10): 1733-1744.
[10]	WANG Hongxu, WU Bin, LIU Yang. Parallel Graph Data Analysis System Based on Spark [J]. Journal of Frontiers of Computer Science and Technology, 2015, 9(9): 1066-1074.
[11]	XU Bin, YANG Dan, ZHANG Yu, LI Feng, GAO Kening. Learners’ Activities Based Study Buddies Recommendation Towards MOOCs [J]. Journal of Frontiers of Computer Science and Technology, 2015, 9(1): 71-79.
[12]	WU Lei, ZHANG Wensheng, WANG Jue. Fusion Probabilistic Graphical Model on Heterogeneous Information Network Data [J]. Journal of Frontiers of Computer Science and Technology, 2014, 8(6): 712-718.
[13]	XU Bin, YANG Dan, ZHANG Yu, LI Feng, GAO Kening. Relationship Bind Topic Model Toward Tag Recommendation for Micro-Blog Users [J]. Journal of Frontiers of Computer Science and Technology, 2014, 8(3): 288-295.
[14]	HE Di¹, PENG Zhiyong², MEI Xiaorong². Brief Survey of Web Community Management Research [J]. Journal of Frontiers of Computer Science and Technology, 2011, 5(2): 97-113.
[15]	QIAN Tieyun¹⁺， LI Qing^1，2， SHEU Phillip¹. A Core Group Method for Segmenting the Life Cycle of Scientific Topics [J]. Journal of Frontiers of Computer Science and Technology, 2010, 4(2): 170-179.

Survey on Representation Learning Methods Oriented to Heterogeneous Information Network

面向异质信息网络的表示学习方法研究综述

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics