A Self-Attention Augmented Graph Convolutional Clustering Networks for Skeleton-Based Video Anomaly Behavior Detection

In this paper, we propose a new method for detecting abnormal human behavior based on skeleton features using self-attention augment graph convolution. The skeleton data have been proved to be robust to the complex background, illumination changes, and dynamic camera scenes and are naturally constructed as a graph in non-Euclidean space. Particularly, the establishment of spatial temporal graph convolutional networks (ST-GCN) can effectively learn the spatio-temporal relationships of Non-Euclidean Structure Data. However, it only operates on local neighborhood nodes and thereby lacks global information. We propose a novel spatial temporal self-attention augmented graph convolutional networks (SAA-Graph) by combining improved spatial graph convolution operator with a modified transformer self-attention operator to capture both local and global information of the joints. The spatial self-attention augmented module is used to understand the intra-frame relationships between human body parts. As far as we know, we are the first group to utilize self-attention for video anomaly detection tasks by enhancing spatial temporal graph convolution. Moreover, to validate the proposed model, we performed extensive experiments on two large-scale publicly standard datasets (i.e., ShanghaiTech Campus and CUHK Avenue datasets) which reveal the state-of-art performance for our proposed approach when compared to existing skeleton-based methods and graph convolution methods.

Download Full-text

Modelling of Bi-Directional Spatio-Temporal Dependence and Users’ Dynamic Preferences for Missing POI Check-In Identification

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015458 ◽

2019 ◽

Vol 33 ◽

pp. 5458-5465 ◽

Cited By ~ 2

Author(s):

Dongbo Xi ◽

Fuzhen Zhuang ◽

Yanchi Liu ◽

Jingjing Gu ◽

Hui Xiong ◽

...

Keyword(s):

Large Scale ◽

Real Life ◽

Location Prediction ◽

Temporal Dependence ◽

Great Opportunity ◽

Mobility Data ◽

Proposed Model ◽

Poi Recommendation ◽

Spatio Temporal ◽

Dynamic Preferences

Human mobility data accumulated from Point-of-Interest (POI) check-ins provides great opportunity for user behavior understanding. However, data quality issues (e.g., geolocation information missing, unreal check-ins, data sparsity) in real-life mobility data limit the effectiveness of existing POIoriented studies, e.g., POI recommendation and location prediction, when applied to real applications. To this end, in this paper, we develop a model, named Bi-STDDP, which can integrate bi-directional spatio-temporal dependence and users’ dynamic preferences, to identify the missing POI check-in where a user has visited at a specific time. Specifically, we first utilize bi-directional global spatial and local temporal information of POIs to capture the complex dependence relationships. Then, target temporal pattern in combination with user and POI information are fed into a multi-layer network to capture users’ dynamic preferences. Moreover, the dynamic preferences are transformed into the same space as the dependence relationships to form the final model. Finally, the proposed model is evaluated on three large-scale real-world datasets and the results demonstrate significant improvements of our model compared with state-of-the-art methods. Also, it is worth noting that the proposed model can be naturally extended to address POI recommendation and location prediction tasks with competitive performances.

Download Full-text

Sentence Compression Using BERT and Graph Convolutional Networks

Applied Sciences ◽

10.3390/app11219910 ◽

2021 ◽

Vol 11 (21) ◽

pp. 9910

Author(s):

Yo-Han Park ◽

Gyong-Ho Lee ◽

Yong-Seok Choi ◽

Kong-Joo Lee

Keyword(s):

Language Processing ◽

Large Scale ◽

Convolutional Network ◽

Convolutional Networks ◽

Compression Model ◽

Sentence Compression ◽

Dependency Tree ◽

Proposed Model ◽

Syntactic Information ◽

Input Sentence

Sentence compression is a natural language-processing task that produces a short paraphrase of an input sentence by deleting words from the input sentence while ensuring grammatical correctness and preserving meaningful core information. This study introduces a graph convolutional network (GCN) into a sentence compression task to encode syntactic information, such as dependency trees. As we upgrade the GCN to activate a directed edge, the compression model with the GCN layers can distinguish between parent and child nodes in a dependency tree when aggregating adjacent nodes. Furthermore, by increasing the number of GCN layers, the model can gradually collect high-order information of a dependency tree when propagating node information through the layers. We implement a sentence compression model for Korean and English, respectively. This model consists of three components: pre-trained BERT model, GCN layers, and a scoring layer. The scoring layer can determine whether a word should remain in a compressed sentence by relying on the word vector containing contextual and syntactic information encoded by BERT and GCN layers. To train and evaluate the proposed model, we used the Google sentence compression dataset for English and a Korean sentence compression corpus containing about 140,000 sentence pairs for Korean. The experimental results demonstrate that the proposed model achieves state-of-the-art performance for English. To the best of our knowledge, this sentence compression model based on the deep learning model trained with a large-scale corpus is the first attempt for Korean.

Download Full-text

Stacked Spatio-Temporal Graph Convolutional Networks for Action Segmentation

2020 IEEE Winter Conference on Applications of Computer Vision (WACV) ◽

10.1109/wacv45572.2020.9093361 ◽

2020 ◽

Cited By ~ 2

Author(s):

Pallabi Ghosh ◽

Yi Yao ◽

Larry S. Davis ◽

Ajay Divakaran

Keyword(s):

Convolutional Networks ◽

Action Segmentation ◽

Temporal Graph ◽

Spatio Temporal

Download Full-text

Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

Sensors ◽

10.3390/s20185260 ◽

2020 ◽

Vol 20 (18) ◽

pp. 5260 ◽

Cited By ~ 1

Author(s):

Fanjia Li ◽

Juanjuan Li ◽

Aichun Zhu ◽

Yonggang Xu ◽

Hongsheng Yin ◽

...

Keyword(s):

Action Recognition ◽

Large Scale ◽

Optimal Solution ◽

Human Action Recognition ◽

Human Action ◽

Convolutional Network ◽

Spatial Graph ◽

Serial Connection ◽

In Series ◽

Temporal Graph

In the skeleton-based human action recognition domain, the spatial-temporal graph convolution networks (ST-GCNs) have made great progress recently. However, they use only one fixed temporal convolution kernel, which is not enough to extract the temporal cues comprehensively. Moreover, simply connecting the spatial graph convolution layer (GCL) and the temporal GCL in series is not the optimal solution. To this end, we propose a novel enhanced spatial and extended temporal graph convolutional network (EE-GCN) in this paper. Three convolution kernels with different sizes are chosen to extract the discriminative temporal features from shorter to longer terms. The corresponding GCLs are then concatenated by a powerful yet efficient one-shot aggregation (OSA) + effective squeeze-excitation (eSE) structure. The OSA module aggregates the features from each layer once to the output, and the eSE module explores the interdependency between the channels of the output. Besides, we propose a new connection paradigm to enhance the spatial features, which expand the serial connection to a combination of serial and parallel connections by adding a spatial GCL in parallel with the temporal GCLs. The proposed method is evaluated on three large scale datasets, and the experimental results show that the performance of our method exceeds previous state-of-the-art methods.

Download Full-text

Exploring Gyro-Sulcal Functional Connectivity Differences Across Task Domains via Anatomy-Guided Spatio-Temporal Graph Convolutional Networks

10.1007/978-3-030-87589-3_14 ◽

2021 ◽

pp. 130-139

Author(s):

Mingxin Jiang ◽

Shimin Yang ◽

Zhongbo Zhao ◽

Jiadong Yan ◽

Yuzhong Chen ◽

...

Keyword(s):

Functional Connectivity ◽

Convolutional Networks ◽

Temporal Graph ◽

Spatio Temporal

Download Full-text

Spatio-Temporal Graph Convolutional Networks for Traffic Forecasting

10.1145/3474717.3484207 ◽

2021 ◽

Author(s):

Yuen Hoi Lau ◽

Raymond Chi-Wing Wong

Keyword(s):

Traffic Forecasting ◽

Convolutional Networks ◽

Temporal Graph ◽

Spatio Temporal

Download Full-text

Hybrid Low-Order and Higher-Order Graph Convolutional Networks

Computational Intelligence and Neuroscience ◽

10.1155/2020/3283890 ◽

2020 ◽

Vol 2020 ◽

pp. 1-9

Author(s):

Fangyuan Lei ◽

Xun Liu ◽

Qingyun Dai ◽

Bingo Wing-Kuen Ling ◽

Huimin Zhao ◽

...

Keyword(s):

Computational Complexity ◽

Large Scale ◽

Representation Learning ◽

Higher Order ◽

Semisupervised Learning ◽

Graph Representation ◽

Convolutional Network ◽

Convolutional Networks ◽

Proposed Model ◽

Low Order

With the higher-order neighborhood information of a graph network, the accuracy of graph representation learning classification can be significantly improved. However, the current higher-order graph convolutional networks have a large number of parameters and high computational complexity. Therefore, we propose a hybrid lower-order and higher-order graph convolutional network (HLHG) learning model, which uses a weight sharing mechanism to reduce the number of network parameters. To reduce the computational complexity, we propose a novel information fusion pooling layer to combine the high-order and low-order neighborhood matrix information. We theoretically compare the computational complexity and the number of parameters of the proposed model with those of the other state-of-the-art models. Experimentally, we verify the proposed model on large-scale text network datasets using supervised learning and on citation network datasets using semisupervised learning. The experimental results show that the proposed model achieves higher classification accuracy with a small set of trainable weight parameters.

Download Full-text

Dynamic Spatio-Temporal Graph Convolutional Networks For Cardiac Motion Analysis

2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) ◽

10.1109/isbi48211.2021.9433890 ◽

2021 ◽

Author(s):

Ping Lu ◽

Wenjia Bai ◽

Daniel Rueckert ◽

J.Alison Noble

Keyword(s):

Motion Analysis ◽

Cardiac Motion ◽

Convolutional Networks ◽

Temporal Graph ◽

Spatio Temporal

Download Full-text

Demand prediction for a public bike sharing program based on spatio-temporal graph convolutional networks

Multimedia Tools and Applications ◽

10.1007/s11042-020-08803-y ◽

2020 ◽

Cited By ~ 6

Author(s):

Guangnian Xiao ◽

Ruinan Wang ◽

Chunqin Zhang ◽

Anning Ni

Keyword(s):

Demand Prediction ◽

Convolutional Networks ◽

Bike Sharing ◽

Temporal Graph ◽

Spatio Temporal

Download Full-text

Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecasting

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/505 ◽

2018 ◽

Cited By ~ 124

Author(s):

Bing Yu ◽

Haoteng Yin ◽

Zhanxing Zhu

Keyword(s):

Deep Learning ◽

Traffic Control ◽

Time Series Prediction ◽

Urban Traffic ◽

Traffic Forecasting ◽

Convolutional Networks ◽

Learning Framework ◽

Temporal Correlations ◽

Temporal Graph ◽

Spatio Temporal

Timely accurate traffic forecast is crucial for urban traffic control and guidance. Due to the high nonlinearity and complexity of traffic flow, traditional methods cannot satisfy the requirements of mid-and-long term prediction tasks and often neglect spatial and temporal dependencies. In this paper, we propose a novel deep learning framework, Spatio-Temporal Graph Convolutional Networks (STGCN), to tackle the time series prediction problem in traffic domain. Instead of applying regular convolutional and recurrent units, we formulate the problem on graphs and build the model with complete convolutional structures, which enable much faster training speed with fewer parameters. Experiments show that our model STGCN effectively captures comprehensive spatio-temporal correlations through modeling multi-scale traffic networks and consistently outperforms state-of-the-art baselines on various real-world traffic datasets.

Download Full-text