STG2Seq: Spatial-Temporal Graph to Sequence Model for Multi-step Passenger Demand Forecasting

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/274 ◽

2019 ◽

Cited By ~ 5

Author(s):

Lei Bai ◽

Lina Yao ◽

Salil S. Kanhere ◽

Xianzhi Wang ◽

Quan Z. Sheng

Keyword(s):

State Of The Art ◽

Demand Forecasting ◽

Short Term ◽

Demand Prediction ◽

On Demand ◽

Passenger Demand ◽

Temporal Correlations ◽

Output Module ◽

Real World Datasets

Multi-step passenger demand forecasting is a crucial task in on-demand vehicle sharing services. However, predicting passenger demand is generally challenging due to the nonlinear and dynamic spatial-temporal dependencies. In this work, we propose to model multi-step citywide passenger demand prediction based on a graph and use a hierarchical graph convolutional structure to capture both spatial and temporal correlations simultaneously. Our model consists of three parts: 1) a long-term encoder to encode historical passenger demands; 2) a short-term encoder to derive the next-step prediction for generating multi-step prediction; 3) an attention-based output module to model the dynamic temporal and channel-wise information. Experiments on three real-world datasets show that our model consistently outperforms many baseline methods and state-of-the-art models.

Download Full-text

Where to Go Next: Modeling Long- and Short-Term User Preferences for Point-of-Interest Recommendation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i01.5353 ◽

2020 ◽

Vol 34 (01) ◽

pp. 214-221 ◽

Cited By ~ 3

Author(s):

Ke Sun ◽

Tieyun Qian ◽

Tong Chen ◽

Yile Liang ◽

Quoc Viet Hung Nguyen ◽

...

Keyword(s):

State Of The Art ◽

User Preferences ◽

Short Term ◽

Preference Modeling ◽

Point Of Interest ◽

Proposed Model ◽

Poi Recommendation ◽

Novel Method ◽

Real World Datasets

Point-of-Interest (POI) recommendation has been a trending research topic as it generates personalized suggestions on facilities for users from a large number of candidate venues. Since users' check-in records can be viewed as a long sequence, methods based on recurrent neural networks (RNNs) have recently shown promising applicability for this task. However, existing RNN-based methods either neglect users' long-term preferences or overlook the geographical relations among recently visited POIs when modeling users' short-term preferences, thus making the recommendation results unreliable. To address the above limitations, we propose a novel method named Long- and Short-Term Preference Modeling (LSTPM) for next-POI recommendation. In particular, the proposed model consists of a nonlocal network for long-term preference modeling and a geo-dilated RNN for short-term preference learning. Extensive experiments on two real-world datasets demonstrate that our model yields significant improvements over the state-of-the-art methods.

Download Full-text

A Review-Driven Neural Model for Sequential Recommendation

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/397 ◽

2019 ◽

Cited By ~ 5

Author(s):

Chenliang Li ◽

Xichuan Niu ◽

Xiangyang Luo ◽

Zhenzhong Chen ◽

Cong Quan

Keyword(s):

Performance Improvement ◽

State Of The Art ◽

Neural Model ◽

Sequential Patterns ◽

Short Term ◽

User Reviews ◽

Individual Level ◽

Significant Performance ◽

Real World Datasets

Writing review for a purchased item is a unique channel to express a user's opinion in E-Commerce. Recently, many deep learning based solutions have been proposed by exploiting user reviews for rating prediction. In contrast, there has been few attempt to enlist the semantic signals covered by user reviews for the task of collaborative filtering. In this paper, we propose a novel review-driven neural sequential recommendation model (named RNS) by considering user's intrinsic preference (long-term) and sequential patterns (short-term). In detail, RNS is devised to encode each user or item with the aspect-aware representations extracted from the reviews. Given a sequence of historical purchased items for a user, we devise a novel hierarchical attention over attention mechanism to capture sequential patterns at both union-level and individual-level. Extensive experiments on three real-world datasets of different domains demonstrate that RNS obtains significant performance improvement over uptodate state-of-the-art sequential recommendation models.

Download Full-text

An Attention-Based Recommender System to Predict Contextual Intent Based on Choice Histories across and within Sessions

Applied Sciences ◽

10.3390/app8122426 ◽

2018 ◽

Vol 8 (12) ◽

pp. 2426 ◽

Cited By ~ 2

Author(s):

Ruo Huang ◽

Shelby McIntyre ◽

Meina Song ◽

Haihong E ◽

Zhonghong Ou

Keyword(s):

Recommender Systems ◽

State Of The Art ◽

User Profile ◽

Vital Role ◽

Short Term ◽

Novel Approach ◽

Learning Techniques ◽

Real World Datasets ◽

Near Future

Recent years have witnessed the growth of recommender systems, with the help of deep learning techniques. Recurrent Neural Networks (RNNs) play an increasingly vital role in various session-based recommender systems, since they use the user’s sequential history to build a comprehensive user profile, which helps improve the recommendation. However, a problem arises regarding how to be aware of the variation in the user’s contextual preference, especially the short-term intent in the near future, and make the best use of it to produce a precise recommendation at the start of a session. We propose a novel approach named Attention-based Short-term and Long-term Model (ASLM), to improve the next-item recommendation, by using an attention-based RNNs integrating both the user’s short-term intent and the long-term preference at the same time with a two-layer network. The experimental study on three real-world datasets and two sub-datasets demonstrates that, compared with other state-of-the-art methods, the proposed approach can significantly improve the next-item recommendation, especially at the start of sessions. As a result, our proposed approach is capable of coping with the cold-start problem at the beginning of each session.

Download Full-text

Short-Term Bus Passenger Demand Prediction Based on Time Series Model and Interactive Multiple Model Approach

Discrete Dynamics in Nature and Society ◽

10.1155/2015/682390 ◽

2015 ◽

Vol 2015 ◽

pp. 1-11 ◽

Cited By ~ 17

Author(s):

Rui Xue ◽

Daniel (Jian) Sun ◽

Shukai Chen

Keyword(s):

Time Series ◽

Demand Forecasting ◽

Performance Comparison ◽

Practical Significance ◽

Multiple Model ◽

Short Term ◽

Demand Prediction ◽

Passenger Demand ◽

Bus Scheduling ◽

Interactive Multiple Model

Although bus passenger demand prediction has attracted increased attention during recent years, limited research has been conducted in the context of short-term passenger demand forecasting. This paper proposes an interactive multiple model (IMM) filter algorithm-based model to predict short-term passenger demand. After aggregated in 15 min interval, passenger demand data collected from a busy bus route over four months were used to generate time series. Considering that passenger demand exhibits various characteristics in different time scales, three time series were developed, named weekly, daily, and 15 min time series. After the correlation, periodicity, and stationarity analyses, time series models were constructed. Particularly, the heteroscedasticity of time series was explored to achieve better prediction performance. Finally, IMM filter algorithm was applied to combine individual forecasting models with dynamically predicted passenger demand for next interval. Different error indices were adopted for the analyses of individual and hybrid models. The performance comparison indicates that hybrid model forecasts are superior to individual ones in accuracy. Findings of this study are of theoretical and practical significance in bus scheduling.

Download Full-text

Non Cash Payment and Demand for Real Money in Indonesia

Journal of Economics Business and Accountancy Ventura ◽

10.14414/jebav.v22i1.1575 ◽

2019 ◽

Vol 22 (1) ◽

Author(s):

Wasiaturrahma Wasiaturrahma ◽

Yuliana Tri Wahyuningtyas ◽

Shochrul Rohmatul Ajija

Keyword(s):

Error Correction ◽

Credit Card ◽

Error Correction Model ◽

Short Term ◽

Correction Model ◽

Real Money ◽

On Demand ◽

Cash Payment ◽

The Impact

The study analyses the impact of non-cash payment on demand for real money in Indonesia from 2010 to 2015. Utilizing the Error Correction Model (ECM), the results reveal that the use of both debit and credit card influence the demand for real money in the long term. Moreover, debit card also significantly affects the demand for real money in the short term, while the use of credit card does not have the implication.

Download Full-text

Hybrid Spatial–Temporal Graph Convolutional Networks for On-Street Parking Availability Prediction

Remote Sensing ◽

10.3390/rs13163338 ◽

2021 ◽

Vol 13 (16) ◽

pp. 3338

Author(s):

Xiao Xiao ◽

Zhiling Jin ◽

Yilong Hui ◽

Yueshen Xu ◽

Wei Shao

Keyword(s):

Smart Cities ◽

Convolutional Networks ◽

Temporal Correlations ◽

Spatial Features ◽

Temporal Features ◽

Prediction Scheme ◽

Temporal Graph ◽

Real World Datasets ◽

The Internet Of Things

With the development of sensors and of the Internet of Things (IoT), smart cities can provide people with a variety of information for a more convenient life. Effective on-street parking availability prediction can improve parking efficiency and, at times, alleviate city congestion. Conventional methods of parking availability prediction often do not consider the spatial–temporal features of parking duration distributions. To this end, we propose a parking space prediction scheme called the hybrid spatial–temporal graph convolution networks (HST-GCNs). We use graph convolutional networks and gated linear units (GLUs) with a 1D convolutional neural network to obtain the spatial features and the temporal features, respectively. Then, we construct a spatial–temporal convolutional block to obtain the instantaneous spatial–temporal correlations. Based on the similarity of the parking duration distributions, we propose an attention mechanism called distAtt to measure the similarity of parking duration distributions. Through the distAtt mechanism, we add the long-term spatial–temporal correlations to our spatial–temporal convolutional block, and thus, we can capture complex hybrid spatial–temporal correlations to achieve a higher accuracy of parking availability prediction. Based on real-world datasets, we compare the proposed scheme with the benchmark models. The experimental results show that the proposed scheme has the best performance in predicting the parking occupancy rate.

Download Full-text

Connectionist Temporal Modeling of Video and Language: a Joint Model for Translation and Sign Labeling

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/106 ◽

2019 ◽

Cited By ~ 1

Author(s):

Dan Guo ◽

Shengeng Tang ◽

Meng Wang

Keyword(s):

Dynamic Programming ◽

Objective Function ◽

Visual Representations ◽

Sequential Learning ◽

Short Term ◽

Temporal Modeling ◽

Temporal Correlations ◽

Feature Correlation ◽

3D Cnn

Online sign interpretation suffers from challenges presented by hybrid semantics learning among sequential variations of visual representations, sign linguistics, and textual grammars. This paper proposes a Connectionist Temporal Modeling (CTM) network for sentence translation and sign labeling. To acquire short-term temporal correlations, a Temporal Convolution Pyramid (TCP) module is performed on 2D CNN features to realize (2D+1D)=pseudo 3D' CNN features. CTM aligns the pseudo 3D' with the original 3D CNN clip features and fuses them. Next, we implement a connectionist decoding scheme for long-term sequential learning. Here, we embed dynamic programming into the decoding scheme, which learns temporal mapping among features, sign labels, and the generated sentence directly. The solution using dynamic programming to sign labeling is considered as pseudo labels. Finally, we utilize the pseudo supervision cues in an end-to-end framework. A joint objective function is designed to measure feature correlation, entropy regularization on sign labeling, and probability maximization on sentence decoding. The experimental results using the RWTH-PHOENIX-Weather and USTC-CSL datasets demonstrate the effectiveness of the proposed approach.

Download Full-text

Learning Long- and Short-Term User Literal-Preference with Multimodal Hierarchical Transformer Network for Personalized Image Caption

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6503 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9571-9578 ◽

Cited By ~ 1

Author(s):

Wei Zhang ◽

Yue Ying ◽

Pan Lu ◽

Hongyuan Zha

Keyword(s):

State Of The Art ◽

Natural Extension ◽

Target Image ◽

Short Term ◽

Image Representations ◽

High Level ◽

Image Descriptions ◽

Shed Light ◽

Image Caption

Personalized image caption, a natural extension of the standard image caption task, requires to generate brief image descriptions tailored for users' writing style and traits, and is more practical to meet users' real demands. Only a few recent studies shed light on this crucial task and learn static user representations to capture their long-term literal-preference. However, it is insufficient to achieve satisfactory performance due to the intrinsic existence of not only long-term user literal-preference, but also short-term literal-preference which is associated with users' recent states. To bridge this gap, we develop a novel multimodal hierarchical transformer network (MHTN) for personalized image caption in this paper. It learns short-term user literal-preference based on users' recent captions through a short-term user encoder at the low level. And at the high level, the multimodal encoder integrates target image representations with short-term literal-preference, as well as long-term literal-preference learned from user IDs. These two encoders enjoy the advantages of the powerful transformer networks. Extensive experiments on two real datasets show the effectiveness of considering two types of user literal-preference simultaneously and better performance over the state-of-the-art models.

Download Full-text

Learning from Interventions Using Hierarchical Policies for Safe Learning

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i06.6602 ◽

2020 ◽

Vol 34 (06) ◽

pp. 10352-10360

Author(s):

Jing Bi ◽

Vikas Dhiman ◽

Tianyou Xiao ◽

Chenliang Xu

Keyword(s):

Reaction Time ◽

State Of The Art ◽

The State ◽

Policy Framework ◽

Asymptotic Performance ◽

Short Term ◽

Learning From Demonstrations ◽

Hierarchical Levels ◽

Long Term Behavior

Learning from Demonstrations (LfD) via Behavior Cloning (BC) works well on multiple complex tasks. However, a limitation of the typical LfD approach is that it requires expert demonstrations for all scenarios, including those in which the algorithm is already well-trained. The recently proposed Learning from Interventions (LfI) overcomes this limitation by using an expert overseer. The expert overseer only intervenes when it suspects that an unsafe action is about to be taken. Although LfI significantly improves over LfD, the state-of-the-art LfI fails to account for delay caused by the expert's reaction time and only learns short-term behavior. We address these limitations by 1) interpolating the expert's interventions back in time, and 2) by splitting the policy into two hierarchical levels, one that generates sub-goals for the future and another that generates actions to reach those desired sub-goals. This sub-goal prediction forces the algorithm to learn long-term behavior while also being robust to the expert's reaction time. Our experiments show that LfI using sub-goals in a hierarchical policy framework trains faster and achieves better asymptotic performance than typical LfD.

Download Full-text

Long-term target tracking combined with re-detection

10.21203/rs.3.rs-51036/v3 ◽

2020 ◽

Author(s):

Juanjuan Wang ◽

HaoRan Yang ◽

Ning Xu ◽

Chengqin Wu ◽

ZengShun Zhao ◽

...

Keyword(s):

Correlation Energy ◽

State Of The Art ◽

Tracking Algorithm ◽

Correlation Filters ◽

Short Term ◽

Tracking Method ◽

Tracking Tasks ◽

Svm Model ◽

Confidence Degree

Abstract The long-term visual tracking undergoes more challenges and is closer to realistic applications than short-term tracking. However, the performances of most existing methods have been limited in the long-term tracking tasks. In this work, we present a reliable yet simple long-term tracking method, which extends the state-of-the-art Learning Adaptive Discriminative Correlation Filters (LADCF) tracking algorithm with a re-detection component based on the SVM model. The LADCF tracking algorithm localizes the target in each frame and the re-detector is able to efficiently re-detect the target in the whole image when the tracking fails. We further introduce a robust confidence degree evaluation criterion that combines the maximum response criterion and the average peak-to correlation energy (APCE) to judge the confidence level of the predicted target. When the confidence degree is generally high, the SVM is updated accordingly. If the confidence drops sharply, the SVM re-detects the target. We perform extensive experiments on the OTB-2015 and UAV123 datasets. The experimental results demonstrate the effectiveness of our algorithm in long-term tracking.

Download Full-text