AttrHIN: Network Representation Learning Method for Heterogeneous Information Network

Link prediction on heterogeneous information network (HIN) is considered as a challenge problem due to the complexity and diversity in types of nodes and links. Currently, there are remained challenges of meta-path-based link prediction in HIN. Previous works of link prediction in HIN via network embedding approach are mainly focused on exploiting features of node rather than existing relations in forms of meta-paths between nodes. In fact, predicting the existence of new links between non-linked nodes is absolutely inconvincible. Moreover, recent HIN-based embedding models also lack of thorough evaluations on the topic similarity between text-based nodes along given meta-paths. To tackle these challenges, in this paper, we proposed a novel approach of topic-driven multiple meta-path-based HIN representation learning framework, namely W-MMP2Vec. Our model leverages the quality of node representations by combining multiple meta-paths as well as calculating the topic similarity weight for each meta-path during the processes of network embedding learning in content-based HINs. To validate our approach, we apply W-TMP2Vec model in solving several link prediction tasks in both content-based and non-content-based HINs (DBLP, IMDB and BlogCatalog). The experimental outputs demonstrate the effectiveness of proposed model which outperforms recent state-of-the-art HIN representation learning models.

Download Full-text

SERL: Semantic-Path Biased Representation Learning of Heterogeneous Information Network

Knowledge Science, Engineering and Management - Lecture Notes in Computer Science ◽

10.1007/978-3-319-99365-2_26 ◽

2018 ◽

pp. 287-298

Author(s):

Haining Tan ◽

Weiqiang Tang ◽

Xinxin Fan ◽

Quanliang Jing ◽

Jingping Bi

Keyword(s):

Representation Learning ◽

Information Network ◽

Heterogeneous Information Network ◽

Heterogeneous Information

Download Full-text

Author Name Disambiguation on Heterogeneous Information Network with Adversarial Representation Learning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i01.5356 ◽

2020 ◽

Vol 34 (01) ◽

pp. 238-245

Author(s):

Haiwen Wang ◽

Ruijie Wan ◽

Chuan Wen ◽

Shuhao Li ◽

Yuting Jia ◽

...

Keyword(s):

Representation Learning ◽

High Order ◽

Information Network ◽

Feature Engineering ◽

Name Disambiguation ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

Training Strategy ◽

Author Name Disambiguation ◽

Academic Information

Author name ambiguity causes inadequacy and inconvenience in academic information retrieval, which raises the necessity of author name disambiguation (AND). Existing AND methods can be divided into two categories: the models focusing on content information to distinguish whether two papers are written by the same author, the models focusing on relation information to represent information as edges on the network and to quantify the similarity among papers. However, the former requires adequate labeled samples and informative negative samples, and are also ineffective in measuring the high-order connections among papers, while the latter needs complicated feature engineering or supervision to construct the network. We propose a novel generative adversarial framework to grow the two categories of models together: (i) the discriminative module distinguishes whether two papers are from the same author, and (ii) the generative module selects possibly homogeneous papers directly from the heterogeneous information network, which eliminates the complicated feature engineering. In such a way, the discriminative module guides the generative module to select homogeneous papers, and the generative module generates high-quality negative samples to train the discriminative module to make it aware of high-order connections among papers. Furthermore, a self-training strategy for the discriminative module and a random walk based generating algorithm are designed to make the training stable and efficient. Extensive experiments on two real-world AND benchmarks demonstrate that our model provides significant performance improvement over the state-of-the-art methods.

Download Full-text

HIN_DRL: A random walk based dynamic network representation learning method for heterogeneous information networks

Expert Systems with Applications ◽

10.1016/j.eswa.2020.113427 ◽

2020 ◽

Vol 158 ◽

pp. 113427

Author(s):

LU Meilian ◽

YE Danna

Keyword(s):

Random Walk ◽

Dynamic Network ◽

Representation Learning ◽

Information Networks ◽

Learning Method ◽

Heterogeneous Information ◽

Heterogeneous Information Networks ◽

Network Representation

Download Full-text

HEPre: Click frequency prediction of applications based on heterogeneous information network embedding

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211488 ◽

2021 ◽

pp. 1-16

Author(s):

Chao Li ◽

Yeyu Yan ◽

Zhongying Zhao ◽

Jun Luo ◽

Qingtian Zeng

Keyword(s):

Prediction Model ◽

Mobile Applications ◽

Mobile Application ◽

User Behavior ◽

Absolute Error ◽

Research Direction ◽

Information Network ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

Network Representation

Owing the continuous enrichment of mobile application resources, mobile applications carry almost all user behaviors and preferences. The analysis of user behavior regarding mobile terminals has become an important research direction. The frequency with which users click on mobile applications reflects their preferences to a certain extent. In this study, we propose a mobile application click-frequency prediction model based on heterogeneous information network representation. This model first constructs a heterogeneous information network between users’ mobile devices and mobile applications. To generate a meaningful sequence of network-embedded nodes, we perform a random walk on a specified meta-path. Finally, the prediction of users’ mobile application click frequency is completed using representation fusion and matrix factorization. Experiments show that our method outperforms other baseline methods in terms of the mean absolute error and root mean square error. Therefore, the application of a heterogeneous information network representation method to the prediction model is effective. This study is significant to the behavior research of mobile terminal users.

Download Full-text