A hybrid code representation learning approach for predicting method names

Motivated by the proliferation of trajectory data produced by advanced GPS-enabled devices, trajectory is gaining in complexity and beginning to embroil additional attributes beyond simply the coordinates. As a consequence, this creates the potential to define the similarity between two attribute-aware trajectories. However, most existing trajectory similarity approaches focus only on location based proximities and fail to capture the semantic similarities encompassed by these additional asymmetric attributes (aspects) of trajectories. In this paper, we propose multi-aspect embedding for attribute-aware trajectories (MAEAT), a representation learning approach for trajectories that simultaneously models the similarities according to their multiple aspects. MAEAT is built upon a sentence embedding algorithm and directly learns whole trajectory embedding via predicting the context aspect tokens when given a trajectory. Two kinds of token generation methods are proposed to extract multiple aspects from the raw trajectories, and a regularization is devised to control the importance among aspects. Extensive experiments on the benchmark and real-world datasets show the effectiveness and efficiency of the proposed MAEAT compared to the state-of-the-art and baseline methods. The results of MAEAT can well support representative downstream trajectory mining and management tasks, and the algorithm outperforms other compared methods in execution time by at least two orders of magnitude.

Download Full-text

Merging Omnichannel Behavioral Data: A Similarity-based Multi-view Representation Learning Approach

SSRN Electronic Journal ◽

10.2139/ssrn.3778419 ◽

2021 ◽

Author(s):

Chenshuo Sun ◽

Anindya Ghose ◽

Xiao Liu

Keyword(s):

Representation Learning ◽

Behavioral Data ◽

Learning Approach

Download Full-text

Co-embedding: a semi-supervised multi-view representation learning approach

Neural Computing and Applications ◽

10.1007/s00521-021-06599-y ◽

2021 ◽

Author(s):

Xiaodong Jia ◽

Xiao-Yuan Jing ◽

Xiaoke Zhu ◽

Ziyun Cai ◽

Chang-Hui Hu

Keyword(s):

Representation Learning ◽

Learning Approach

Download Full-text

SIR-GN: A Fast Structural Iterative Representation Learning Approach For Graph Nodes

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3450315 ◽

2021 ◽

Vol 15 (6) ◽

pp. 1-39

Author(s):

Mikel Joaristi ◽

Edoardo Serra

Keyword(s):

Structural Information ◽

Representation Learning ◽

Graph Representation ◽

Structural Function ◽

Learning Approach ◽

Numerical Representation ◽

Learning Methods ◽

Computational Performance ◽

Latent Space ◽

Graph Nodes

Graph representation learning methods have attracted an increasing amount of attention in recent years. These methods focus on learning a numerical representation of the nodes in a graph. Learning these representations is a powerful instrument for tasks such as graph mining, visualization, and hashing. They are of particular interest because they facilitate the direct use of standard machine learning models on graphs. Graph representation learning methods can be divided into two main categories: methods preserving the connectivity information of the nodes and methods preserving nodes’ structural information. Connectivity-based methods focus on encoding relationships between nodes, with connected nodes being closer together in the resulting latent space. While methods preserving structure generate a latent space where nodes serving a similar structural function in the network are encoded close to each other, independently of them being connected or even close to each other in the graph. While there are a lot of works that focus on preserving node connectivity, only a few works focus on preserving nodes’ structure. Properly encoding nodes’ structural information is fundamental for many real-world applications as it has been demonstrated that this information can be leveraged to successfully solve many tasks where connectivity-based methods usually fail. A typical example is the task of node classification, i.e., the assignment or prediction of a particular label for a node. Current limitations of structural representation methods are their scalability, representation meaning, and no formal proof that guaranteed the preservation of structural properties. We propose a new graph representation learning method, called Structural Iterative Representation learning approach for Graph Nodes ( SIR-GN ). In this work, we propose two variations ( SIR-GN: GMM and SIR-GN: K-Means ) and show how our best variation SIR-GN: K-Means : (1) theoretically guarantees the preservation of graph structural similarities, (2) provides a clear meaning about its representation and a way to interpret it with a specifically designed attribution procedure, and (3) is scalable and fast to compute. In addition, from our experiment, we show that SIR-GN: K-Means is often better or, in the worst-case comparable than the existing structural graph representation learning methods present in the literature. Also, we empirically show its superior scalability and computational performance when compared to other existing approaches.

Download Full-text