Author Name Disambiguation on Heterogeneous Information Network with Adversarial Representation Learning

Haiwen Wang; Ruijie Wan; Chuan Wen; Shuhao Li; Yuting Jia; Weinan Zhang; Xinbing Wang

doi:10.1609/aaai.v34i01.5356

Author Name Disambiguation on Heterogeneous Information Network with Adversarial Representation Learning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i01.5356 ◽

2020 ◽

Vol 34 (01) ◽

pp. 238-245

Author(s):

Haiwen Wang ◽

Ruijie Wan ◽

Chuan Wen ◽

Shuhao Li ◽

Yuting Jia ◽

...

Keyword(s):

Representation Learning ◽

High Order ◽

Information Network ◽

Feature Engineering ◽

Name Disambiguation ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

Training Strategy ◽

Author Name Disambiguation ◽

Academic Information

Author name ambiguity causes inadequacy and inconvenience in academic information retrieval, which raises the necessity of author name disambiguation (AND). Existing AND methods can be divided into two categories: the models focusing on content information to distinguish whether two papers are written by the same author, the models focusing on relation information to represent information as edges on the network and to quantify the similarity among papers. However, the former requires adequate labeled samples and informative negative samples, and are also ineffective in measuring the high-order connections among papers, while the latter needs complicated feature engineering or supervision to construct the network. We propose a novel generative adversarial framework to grow the two categories of models together: (i) the discriminative module distinguishes whether two papers are from the same author, and (ii) the generative module selects possibly homogeneous papers directly from the heterogeneous information network, which eliminates the complicated feature engineering. In such a way, the discriminative module guides the generative module to select homogeneous papers, and the generative module generates high-quality negative samples to train the discriminative module to make it aware of high-order connections among papers. Furthermore, a self-training strategy for the discriminative module and a random walk based generating algorithm are designed to make the training stable and efficient. Extensive experiments on two real-world AND benchmarks demonstrate that our model provides significant performance improvement over the state-of-the-art methods.

Download Full-text

W-MMP2Vec: Topic-driven network embedding model for link prediction in content-based heterogeneous information network

Intelligent Data Analysis ◽

10.3233/ida-205168 ◽

2021 ◽

Vol 25 (3) ◽

pp. 711-738

Author(s):

Phu Pham ◽

Phuc Do

Keyword(s):

Link Prediction ◽

Representation Learning ◽

Information Network ◽

Network Embedding ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

Learning Framework ◽

Novel Approach ◽

Proposed Model ◽

Meta Path

Link prediction on heterogeneous information network (HIN) is considered as a challenge problem due to the complexity and diversity in types of nodes and links. Currently, there are remained challenges of meta-path-based link prediction in HIN. Previous works of link prediction in HIN via network embedding approach are mainly focused on exploiting features of node rather than existing relations in forms of meta-paths between nodes. In fact, predicting the existence of new links between non-linked nodes is absolutely inconvincible. Moreover, recent HIN-based embedding models also lack of thorough evaluations on the topic similarity between text-based nodes along given meta-paths. To tackle these challenges, in this paper, we proposed a novel approach of topic-driven multiple meta-path-based HIN representation learning framework, namely W-MMP2Vec. Our model leverages the quality of node representations by combining multiple meta-paths as well as calculating the topic similarity weight for each meta-path during the processes of network embedding learning in content-based HINs. To validate our approach, we apply W-TMP2Vec model in solving several link prediction tasks in both content-based and non-content-based HINs (DBLP, IMDB and BlogCatalog). The experimental outputs demonstrate the effectiveness of proposed model which outperforms recent state-of-the-art HIN representation learning models.

Download Full-text

SERL: Semantic-Path Biased Representation Learning of Heterogeneous Information Network

Knowledge Science, Engineering and Management - Lecture Notes in Computer Science ◽

10.1007/978-3-319-99365-2_26 ◽

2018 ◽

pp. 287-298

Author(s):

Haining Tan ◽

Weiqiang Tang ◽

Xinxin Fan ◽

Quanliang Jing ◽

Jingping Bi

Keyword(s):

Representation Learning ◽

Information Network ◽

Heterogeneous Information Network ◽

Heterogeneous Information

Download Full-text

W-MetaPath2Vec: The topic-driven meta-path-based model for large-scaled content-based heterogeneous information network representation learning

Expert Systems with Applications ◽

10.1016/j.eswa.2019.01.015 ◽

2019 ◽

Vol 123 ◽

pp. 328-344 ◽

Cited By ~ 7

Author(s):

Phu Pham ◽

Phuc Do

Keyword(s):

Representation Learning ◽

Information Network ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

Network Representation ◽

Meta Path

Download Full-text

Network Schema Preserving Heterogeneous Information Network Embedding

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/190 ◽

2020 ◽

Author(s):

Jianan Zhao ◽

Xiao Wang ◽

Chuan Shi ◽

Zekuan Liu ◽

Yanfang Ye

Keyword(s):

Dimensional Space ◽

High Order ◽

Order Structure ◽

Information Network ◽

Heterogeneous Structure ◽

High Order Structure ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

Structure Information ◽

Real World Datasets

As heterogeneous networks have become increasingly ubiquitous, Heterogeneous Information Network (HIN) embedding, aiming to project nodes into a low-dimensional space while preserving the heterogeneous structure, has drawn increasing attention in recent years. Many of the existing HIN embedding methods adopt meta-path guided random walk to retain both the semantics and structural correlations between different types of nodes. However, the selection of meta-paths is still an open problem, which either depends on domain knowledge or is learned from label information. As a uniform blueprint of HIN, the network schema comprehensively embraces the high-order structure and contains rich semantics. In this paper, we make the first attempt to study network schema preserving HIN embedding, and propose a novel model named NSHE. In NSHE, a network schema sampling method is first proposed to generate sub-graphs (i.e., schema instances), and then multi-task learning task is built to preserve the heterogeneous structure of each schema instance. Besides preserving pairwise structure information, NSHE is able to retain high-order structure (i.e., network schema). Extensive experiments on three real-world datasets demonstrate that our proposed model NSHE significantly outperforms the state-of-the-art methods.

Download Full-text

First-order and High-order Information Fusion over Heterogeneous Information Network for Top-N Recommendation System

2021 IEEE 24th International Conference on Computer Supported Cooperative Work in Design (CSCWD) ◽

10.1109/cscwd49262.2021.9437779 ◽

2021 ◽

Author(s):

Nan Mu ◽

Daren Zha

Keyword(s):

Information Fusion ◽

Recommendation System ◽

High Order ◽

Information Network ◽

Order Information ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

First Order

Download Full-text

Malware classification based on heterogeneous information network representation learning

2020 International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE) ◽

10.1109/icbaie49996.2020.00018 ◽

2020 ◽

Author(s):

Yu Chen ◽

Bin Qin ◽

Changchun Ma ◽

Ming Xu

Keyword(s):

Representation Learning ◽

Information Network ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

Malware Classification ◽

Network Representation

Download Full-text

AttrHIN: Network Representation Learning Method for Heterogeneous Information Network

IEEE Access ◽

10.1109/access.2021.3110200 ◽

2021 ◽

pp. 1-1

Author(s):

Qingbiao Zhou ◽

Chen Wang ◽

Qi Li

Keyword(s):

Representation Learning ◽

Information Network ◽

Learning Method ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

Network Representation

Download Full-text

Semantic Based Heterogeneous Information Network Embedding for Patent Citation Recommendation

2020 International Conference on Artificial Intelligence and Computer Engineering (ICAICE) ◽

10.1109/icaice51518.2020.00106 ◽

2020 ◽

Author(s):

Yanping Zhang ◽

Shuang Li ◽

Xi Chen ◽

Fulan Qian ◽

Shu Zhao ◽

...

Keyword(s):

Patent Citation ◽

Information Network ◽

Network Embedding ◽

Heterogeneous Information Network ◽

Heterogeneous Information

Download Full-text

FallbackWalk: A Random Walk Based Fallback for Heterogeneous Information Network

2021 IEEE 6th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA) ◽

10.1109/icccbda51879.2021.9442589 ◽

2021 ◽

Author(s):

Zhengjun Liu ◽

Ying Liang ◽

Xiaojie Xie ◽

Zisen Wang ◽

Yongkang Du

Keyword(s):

Random Walk ◽

Information Network ◽

Heterogeneous Information Network ◽

Heterogeneous Information

Download Full-text

Citation Recommendation Based on Weighted Heterogeneous Information Network Containing Semantic Linking

2019 IEEE International Conference on Multimedia and Expo (ICME) ◽

10.1109/icme.2019.00014 ◽

2019 ◽

Cited By ~ 3

Author(s):

Jie Chen ◽

Yang Liu ◽

Shu Zhao ◽

Yanping Zhang

Keyword(s):

Information Network ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

Semantic Linking

Download Full-text