scholarly journals AttrHIN: Network Representation Learning Method for Heterogeneous Information Network

IEEE Access ◽  
2021 ◽  
pp. 1-1
Author(s):  
Qingbiao Zhou ◽  
Chen Wang ◽  
Qi Li
2021 ◽  
Vol 25 (3) ◽  
pp. 711-738
Author(s):  
Phu Pham ◽  
Phuc Do

Link prediction on heterogeneous information network (HIN) is considered as a challenge problem due to the complexity and diversity in types of nodes and links. Currently, there are remained challenges of meta-path-based link prediction in HIN. Previous works of link prediction in HIN via network embedding approach are mainly focused on exploiting features of node rather than existing relations in forms of meta-paths between nodes. In fact, predicting the existence of new links between non-linked nodes is absolutely inconvincible. Moreover, recent HIN-based embedding models also lack of thorough evaluations on the topic similarity between text-based nodes along given meta-paths. To tackle these challenges, in this paper, we proposed a novel approach of topic-driven multiple meta-path-based HIN representation learning framework, namely W-MMP2Vec. Our model leverages the quality of node representations by combining multiple meta-paths as well as calculating the topic similarity weight for each meta-path during the processes of network embedding learning in content-based HINs. To validate our approach, we apply W-TMP2Vec model in solving several link prediction tasks in both content-based and non-content-based HINs (DBLP, IMDB and BlogCatalog). The experimental outputs demonstrate the effectiveness of proposed model which outperforms recent state-of-the-art HIN representation learning models.


2020 ◽  
Vol 34 (01) ◽  
pp. 238-245
Author(s):  
Haiwen Wang ◽  
Ruijie Wan ◽  
Chuan Wen ◽  
Shuhao Li ◽  
Yuting Jia ◽  
...  

Author name ambiguity causes inadequacy and inconvenience in academic information retrieval, which raises the necessity of author name disambiguation (AND). Existing AND methods can be divided into two categories: the models focusing on content information to distinguish whether two papers are written by the same author, the models focusing on relation information to represent information as edges on the network and to quantify the similarity among papers. However, the former requires adequate labeled samples and informative negative samples, and are also ineffective in measuring the high-order connections among papers, while the latter needs complicated feature engineering or supervision to construct the network. We propose a novel generative adversarial framework to grow the two categories of models together: (i) the discriminative module distinguishes whether two papers are from the same author, and (ii) the generative module selects possibly homogeneous papers directly from the heterogeneous information network, which eliminates the complicated feature engineering. In such a way, the discriminative module guides the generative module to select homogeneous papers, and the generative module generates high-quality negative samples to train the discriminative module to make it aware of high-order connections among papers. Furthermore, a self-training strategy for the discriminative module and a random walk based generating algorithm are designed to make the training stable and efficient. Extensive experiments on two real-world AND benchmarks demonstrate that our model provides significant performance improvement over the state-of-the-art methods.


2021 ◽  
pp. 1-16
Author(s):  
Chao Li ◽  
Yeyu Yan ◽  
Zhongying Zhao ◽  
Jun Luo ◽  
Qingtian Zeng

Owing the continuous enrichment of mobile application resources, mobile applications carry almost all user behaviors and preferences. The analysis of user behavior regarding mobile terminals has become an important research direction. The frequency with which users click on mobile applications reflects their preferences to a certain extent. In this study, we propose a mobile application click-frequency prediction model based on heterogeneous information network representation. This model first constructs a heterogeneous information network between users’ mobile devices and mobile applications. To generate a meaningful sequence of network-embedded nodes, we perform a random walk on a specified meta-path. Finally, the prediction of users’ mobile application click frequency is completed using representation fusion and matrix factorization. Experiments show that our method outperforms other baseline methods in terms of the mean absolute error and root mean square error. Therefore, the application of a heterogeneous information network representation method to the prediction model is effective. This study is significant to the behavior research of mobile terminal users.


Sign in / Sign up

Export Citation Format

Share Document