scholarly journals Deep Interest Evolution Network for Click-Through Rate Prediction

Author(s):  
Guorui Zhou ◽  
Na Mou ◽  
Ying Fan ◽  
Qi Pi ◽  
Weijie Bian ◽  
...  

Click-through rate (CTR) prediction, whose goal is to estimate the probability of a user clicking on the item, has become one of the core tasks in the advertising system. For CTR prediction model, it is necessary to capture the latent user interest behind the user behavior data. Besides, considering the changing of the external environment and the internal cognition, user interest evolves over time dynamically. There are several CTR prediction methods for interest modeling, while most of them regard the representation of behavior as the interest directly, and lack specially modeling for latent interest behind the concrete behavior. Moreover, little work considers the changing trend of the interest. In this paper, we propose a novel model, named Deep Interest Evolution Network (DIEN), for CTR prediction. Specifically, we design interest extractor layer to capture temporal interests from history behavior sequence. At this layer, we introduce an auxiliary loss to supervise interest extracting at each step. As user interests are diverse, especially in the e-commerce system, we propose interest evolving layer to capture interest evolving process that is relative to the target item. At interest evolving layer, attention mechanism is embedded into the sequential structure novelly, and the effects of relative interests are strengthened during interest evolution. In the experiments on both public and industrial datasets, DIEN significantly outperforms the state-of-the-art solutions. Notably, DIEN has been deployed in the display advertisement system of Taobao, and obtained 20.7% improvement on CTR.

2020 ◽  
Vol 34 (01) ◽  
pp. 156-163 ◽  
Author(s):  
Zequn Lyu ◽  
Yu Dong ◽  
Chengfu Huo ◽  
Weijun Ren

Click-through rate (CTR) prediction is a core task in the field of recommender system and many other applications. For CTR prediction model, personalization is the key to improve the performance and enhance the user experience. Recently, several models are proposed to extract user interest from user behavior data which reflects user's personalized preference implicitly. However, existing works in the field of CTR prediction mainly focus on user representation and pay less attention on representing the relevance between user and item, which directly measures the intensity of user's preference on target item. Motivated by this, we propose a novel model named Deep Match to Rank (DMR) which combines the thought of collaborative filtering in matching methods for the ranking task in CTR prediction. In DMR, we design User-to-Item Network and Item-to-Item Network to represent the relevance in two forms. In User-to-Item Network, we represent the relevance between user and item by inner product of the corresponding representation in the embedding space. Meanwhile, an auxiliary match network is presented to supervise the training and push larger inner product to represent higher relevance. In Item-to-Item Network, we first calculate the item-to-item similarities between user interacted items and target item by attention mechanism, and then sum up the similarities to obtain another form of user-to-item relevance. We conduct extensive experiments on both public and industrial datasets to validate the effectiveness of our model, which outperforms the state-of-art models significantly.


Author(s):  
Yufei Feng ◽  
Fuyu Lv ◽  
Weichen Shen ◽  
Menghan Wang ◽  
Fei Sun ◽  
...  

Click-Through Rate (CTR) prediction plays an important role in many industrial applications, such as online advertising and recommender systems. How to capture users' dynamic and evolving interests from their behavior sequences remains a continuous research topic in the CTR prediction. However, most existing studies overlook the intrinsic structure of the sequences: the sequences are composed of sessions, where sessions are user behaviors separated by their occurring time. We observe that user behaviors are highly homogeneous in each session, and heterogeneous cross sessions. Based on this observation, we propose a novel CTR model named Deep Session Interest Network (DSIN) that leverages users' multiple historical sessions in their behavior sequences. We first use self-attention mechanism with bias encoding to extract users' interests in each session. Then we apply Bi-LSTM to model how users' interests evolve and interact among sessions. Finally, we employ the local activation unit to adaptively learn the influences of various session interests on the target item. Experiments are conducted on both advertising and production recommender datasets and DSIN outperforms other state-of-the-art models on both datasets.


2021 ◽  
Vol 7 ◽  
pp. e716
Author(s):  
Qiao Lu ◽  
Silin Li ◽  
Tuo Yang ◽  
Chenheng Xu

Recent advances in communication enable individuals to use phones and computers to access information on the web. E-commerce has seen rapid development, e.g., Alibaba has nearly 12 hundred million customers in China. Click-Through Rate (CTR) forecasting is a primary task in the e-commerce advertisement system. From the traditional Logistic Regression algorithm to the latest popular deep neural network methods that follow a similar embedding and MLP, several algorithms are used to predict CTR. This research proposes a hybrid model combining the Deep Interest Network (DIN) and eXtreme Deep Factorization Machine (xDeepFM) to perform CTR prediction robustly. The cores of DIN and xDeepFM are attention and feature cross, respectively. DIN follows an adaptive local activation unit that incorporates the attention mechanism to adaptively learn user interest from historical behaviors related to specific advertisements. xDeepFM further includes a critical part, a Compressed Interactions Network (CIN), aiming to generate feature interactions at a vectorwise level implicitly. Furthermore, a CIN, plain DNN, and a linear part are combined into one unified model to form xDeepFM. The proposed end-to-end hybrid model is a parallel ensemble of models via multilayer perceptron. CIN and xDeepFM are trained in parallel, and their output is fed into a multilayer perceptron. We used the e-commerce Alibaba dataset with the focal loss as the loss function for experimental evaluation through online complex example mining (OHEM) in the training process. The experimental result indicates that the proposed hybrid model has better performance than other models.


2020 ◽  
pp. 1-11
Author(s):  
Wenqiang Zhu

First, the recommendation system and its advantages are introduced in detail, and based on the characteristics of the intelligent topic logical interest set resource and user behavior in the existing intelligent topic logical interest set resource platform, a personalized fuzzy logic model of the intelligent topic logical interest set resource is established and adapted to it. The personalized fuzzy logic user personalized fuzzy logic interest model of personalized fuzzy logic is designed, and the user personalized fuzzy logic interest transfer method is designed to simulate the user learning process. Secondly, on the basis of the established model, according to the idea of collaborative filtering, the personalized fuzzy logic user’s personalized fuzzy logic interest value and the user’s rating of resources are respectively predicted, and the two prediction results are combined to recommend resources to the user. Finally, the ontology is applied to user interest description, and a method based on personalized fuzzy logic user rough interest vector and nearest neighbor concept aggregation is proposed to find fine-grained user interest and recommend interest resources. Experimental tests show that this method can better describe the composition and development of user interests, making the recommendation effect of interest resources for specific users more accurate and reliable. The problem of collaborative recommendation in personalized fuzzy logic systems is further studied, the basic principles and typical technologies of collaborative recommendation are analyzed, and the collaborative recommendation method based on users with similar interests and the collaborative recommendation method based on weighted association rules are proposed.


Author(s):  
Jing Song ◽  
Hong Shen ◽  
Zijing Ou ◽  
Junyi Zhang ◽  
Teng Xiao ◽  
...  

Session-based recommendation is a challenging problem due to the inherent uncertainty of user behavior and the limited historical click information. Latent factors and the complex dependencies within the user’s current session have an important impact on the user's main intention, but the existing methods do not explicitly consider this point. In this paper, we propose a novel model, Interest Shift and Latent Factors Combination Model (ISLF), which can capture the user's main intention by taking into account the user’s interest shift (i.e. long-term and short-term interest) and latent factors simultaneously. In addition, we experimentally give an explicit explanation of this combination in our ISLF. Our experimental results on three benchmark datasets show that our model achieves state-of-the-art performance on all test datasets.


2019 ◽  
Vol 2019 ◽  
pp. 1-13
Author(s):  
Qiaoqiao Tan ◽  
Fang’ai Liu

Recommendations based on user behavior sequences are becoming more and more common. Some studies consider user behavior sequences as interests directly, ignoring the mining and representation of implicit features. However, user behaviors contain a lot of information, such as consumption habits and dynamic preferences. In order to better locate user interests, this paper proposes a Bi-GRU neural network with attention to model user’s long-term historical preferences and short-term consumption motivations. First, a Bi-GRU network is established to solve the long-term dependence problem in sequences, and attention mechanism is introduced to capture user interest changes related to the target item. Then, user’s short-term interaction trajectory based on self-attention is modeled to distinguish the importance of each potential feature. Finally, combined with long-term and short-term interests, the next behavior is predicted. We conducted extensive experiments on Amazon and MovieLens datasets. The experimental results demonstrate that the proposed model outperforms current state-of-the-art models in Recall and NDCG indicators. Especially in MovieLens dataset, compared with other RNN-based models, our proposed model improved at least 2.32% at Recall@20, which verifies the effectiveness of modeling long-term and short-term interest of users, respectively.


Author(s):  
Wantong Lu ◽  
Yantao Yu ◽  
Yongzhe Chang ◽  
Zhen Wang ◽  
Chenhui Li ◽  
...  

Factorization Machines (FMs) refer to a class of general predictors working with real valued feature vectors, which are well-known for their ability to estimate model parameters under significant sparsity and have found successful applications in many areas such as the click-through rate (CTR) prediction. However, standard FMs only produce a single fixed representation for each feature across different input instances, which may limit the CTR model’s expressive and predictive power. Inspired by the success of Input-aware Factorization Machines (IFMs), which aim to learn more flexible and informative representations of a given feature according to different input instances, we propose a novel model named Dual Input-aware Factorization Machines (DIFMs) that can adaptively reweight the original feature representations at the bit-wise and vector-wise levels simultaneously. Furthermore, DIFMs strategically integrate various components including Multi-Head Self-Attention, Residual Networks and DNNs into a unified end-to-end model. Comprehensive experiments on two real-world CTR prediction datasets show that the DIFM model can outperform several state-of-the-art models consistently.


2017 ◽  
Vol 45 (3) ◽  
pp. 130-138 ◽  
Author(s):  
Basit Shahzad ◽  
Ikramullah Lali ◽  
M. Saqib Nawaz ◽  
Waqar Aslam ◽  
Raza Mustafa ◽  
...  

Purpose Twitter users’ generated data, known as tweets, are now not only used for communication and opinion sharing, but they are considered an important source of trendsetting, future prediction, recommendation systems and marketing. Using network features in tweet modeling and applying data mining and deep learning techniques on tweets is gaining more and more interest. Design/methodology/approach In this paper, user interests are discovered from Twitter Trends using a modeling approach that uses network-based text data (tweets). First, the popular trends are collected and stored in separate documents. These data are then pre-processed, followed by their labeling in respective categories. Data are then modeled and user interest for each Trending topic is calculated by considering positive tweets in that trend, average retweet and favorite count. Findings The proposed approach can be used to infer users’ topics of interest on Twitter and to categorize them. Support vector machine can be used for training and validation purposes. Positive tweets can be further analyzed to find user posting patterns. There is a positive correlation between tweets and Google data. Practical implications The results can be used in the development of information filtering and prediction systems, especially in personalized recommendation systems. Social implications Twitter microblogging platform offers content posting and sharing to billions of internet users worldwide. Therefore, this work has significant socioeconomic impacts. Originality/value This study guides on how Twitter network structure features can be exploited in discovering user interests using tweets. Further, positive correlation of Twitter Trends with Google Trends is reported, which validates the correctness of the authors’ approach.


Author(s):  
Trung Minh Nguyen ◽  
Thien Huu Nguyen

The previous work for event extraction has mainly focused on the predictions for event triggers and argument roles, treating entity mentions as being provided by human annotators. This is unrealistic as entity mentions are usually predicted by some existing toolkits whose errors might be propagated to the event trigger and argument role recognition. Few of the recent work has addressed this problem by jointly predicting entity mentions, event triggers and arguments. However, such work is limited to using discrete engineering features to represent contextual information for the individual tasks and their interactions. In this work, we propose a novel model to jointly perform predictions for entity mentions, event triggers and arguments based on the shared hidden representations from deep learning. The experiments demonstrate the benefits of the proposed method, leading to the state-of-the-art performance for event extraction.


Sign in / Sign up

Export Citation Format

Share Document