A Hierarchical Attention Graph Convolutional Network for Traffic Incident Impact Forecasting

Pixel-based semantic segmentation models fail to effectively express geographic objects and their topological relationships. Therefore, in semantic segmentation of remote sensing images, these models fail to avoid salt-and-pepper effects and cannot achieve high accuracy either. To solve these problems, object-based models such as graph neural networks (GNNs) are considered. However, traditional GNNs directly use similarity or spatial correlations between nodes to aggregate nodes’ information, which rely too much on the contextual information of the sample. The contextual information of the sample is often distorted, which results in a reduction in the node classification accuracy. To solve this problem, a knowledge and geo-object-based graph convolutional network (KGGCN) is proposed. The KGGCN uses superpixel blocks as nodes of the graph network and combines prior knowledge with spatial correlations during information aggregation. By incorporating the prior knowledge obtained from all samples of the study area, the receptive field of the node is extended from its sample context to the study area. Thus, the distortion of the sample context is overcome effectively. Experiments demonstrate that our model is improved by 3.7% compared with the baseline model named Cluster GCN and 4.1% compared with U-Net.

Download Full-text

SACN: A Novel Rotating Face Detector Based on Architecture Search

Electronics ◽

10.3390/electronics10050558 ◽

2021 ◽

Vol 10 (5) ◽

pp. 558

Author(s):

Anping Song ◽

Xiaokang Xu ◽

Xinyi Zhai

Keyword(s):

Face Detection ◽

Human Face ◽

Angle Error ◽

Rotation Invariant ◽

Convolutional Network ◽

Data Set ◽

Practical Applications ◽

Model Size ◽

Average Angle ◽

Face Detector

Rotation-Invariant Face Detection (RIPD) has been widely used in practical applications; however, the problem of the adjusting of the rotation-in-plane (RIP) angle of the human face still remains. Recently, several methods based on neural networks have been proposed to solve the RIP angle problem. However, these methods have various limitations, including low detecting speed, model size, and detecting accuracy. To solve the aforementioned problems, we propose a new network, called the Searching Architecture Calibration Network (SACN), which utilizes architecture search, fully convolutional network (FCN) and bounding box center cluster (CC). SACN was tested on the challenging Multi-Oriented Face Detection Data Set and Benchmark (MOFDDB) and achieved a higher detecting accuracy and almost the same speed as existing detectors. Moreover, the average angle error is optimized from the current 12.6° to 10.5°.

Download Full-text

Multi-Channel Temporal Graph Convolutional Network for Stock Return Prediction

2020 IEEE 18th International Conference on Industrial Informatics (INDIN) ◽

10.1109/indin45582.2020.9442196 ◽

2020 ◽

Author(s):

Jifeng Sun ◽

Jianwu Lin ◽

Yi Zhou

Keyword(s):

Stock Return ◽

Convolutional Network ◽

Stock Return Prediction ◽

Temporal Graph

Download Full-text

Proposing Gesture Recognition Algorithm Using Two-Stream Convolutional Network and LSTM

2020 IEEE Eighth International Conference on Communications and Electronics (ICCE) ◽

10.1109/icce48956.2021.9352147 ◽

2021 ◽

Author(s):

Phat Nguyen Huu ◽

Tien Luong Ngoc ◽

Quang Tran Minh

Keyword(s):

Gesture Recognition ◽

Recognition Algorithm ◽

Convolutional Network

Download Full-text

NPU RGB+D Dataset and a Feature-Enhanced LSTM-DGCN Method for Action Recognition of Basketball Players

Applied Sciences ◽

10.3390/app11104426 ◽

2021 ◽

Vol 11 (10) ◽

pp. 4426

Author(s):

Chunyan Ma ◽

Ji Fan ◽

Jinghao Yao ◽

Tao Zhang

Keyword(s):

Action Recognition ◽

Large Scale ◽

Short Term Memory ◽

Evaluation Criteria ◽

Image Data ◽

Basketball Player ◽

Basketball Players ◽

Convolutional Network ◽

Atomic Actions ◽

New Feature

Computer vision-based action recognition of basketball players in basketball training and competition has gradually become a research hotspot. However, owing to the complex technical action, diverse background, and limb occlusion, it remains a challenging task without effective solutions or public dataset benchmarks. In this study, we defined 32 kinds of atomic actions covering most of the complex actions for basketball players and built the dataset NPU RGB+D (a large scale dataset of basketball action recognition with RGB image data and Depth data captured in Northwestern Polytechnical University) for 12 kinds of actions of 10 professional basketball players with 2169 RGB+D videos and 75 thousand frames, including RGB frame sequences, depth maps, and skeleton coordinates. Through extracting the spatial features of the distances and angles between the joint points of basketball players, we created a new feature-enhanced skeleton-based method called LSTM-DGCN for basketball player action recognition based on the deep graph convolutional network (DGCN) and long short-term memory (LSTM) methods. Many advanced action recognition methods were evaluated on our dataset and compared with our proposed method. The experimental results show that the NPU RGB+D dataset is very competitive with the current action recognition algorithms and that our LSTM-DGCN outperforms the state-of-the-art action recognition methods in various evaluation criteria on our dataset. Our action classifications and this NPU RGB+D dataset are valuable for basketball player action recognition techniques. The feature-enhanced LSTM-DGCN has a more accurate action recognition effect, which improves the motion expression ability of the skeleton data.

Download Full-text

Effects of Pooling Operations on Prediction of Ligand Rotation‐Dependent Protein–Ligand Binding in 3D Graph Convolutional Network

Bulletin of the Korean Chemical Society ◽

10.1002/bkcs.12267 ◽

2021 ◽

Author(s):

Yeji Kim ◽

Jihoo Kim ◽

Won June Kim ◽

Eok Kyun Lee ◽

Insung S. Choi

Keyword(s):

Ligand Binding ◽

Convolutional Network ◽

Dependent Protein

Download Full-text

Efficient End-to-End Sentence-Level Lipreading with Temporal Convolutional Networks

Applied Sciences ◽

10.3390/app11156975 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6975

Author(s):

Tao Zhang ◽

Lun He ◽

Xudong Li ◽

Guoqing Feng

Keyword(s):

Performance Improvement ◽

State Of The Art ◽

Error Rates ◽

Convolutional Network ◽

Convolutional Networks ◽

Sentence Level ◽

End To End ◽

High Level ◽

Improved Accuracy ◽

Talking Face

Lipreading aims to recognize sentences being spoken by a talking face. In recent years, the lipreading method has achieved a high level of accuracy on large datasets and made breakthrough progress. However, lipreading is still far from being solved, and existing methods tend to have high error rates on the wild data and have the defects of disappearing training gradient and slow convergence. To overcome these problems, we proposed an efficient end-to-end sentence-level lipreading model, using an encoder based on a 3D convolutional network, ResNet50, Temporal Convolutional Network (TCN), and a CTC objective function as the decoder. More importantly, the proposed architecture incorporates TCN as a feature learner to decode feature. It can partly eliminate the defects of RNN (LSTM, GRU) gradient disappearance and insufficient performance, and this yields notable performance improvement as well as faster convergence. Experiments show that the training and convergence speed are 50% faster than the state-of-the-art method, and improved accuracy by 2.4% on the GRID dataset.

Download Full-text

Ensemble manifold regularized multi-modal graph convolutional network for cognitive ability prediction

IEEE Transactions on Biomedical Engineering ◽

10.1109/tbme.2021.3077875 ◽

2021 ◽

pp. 1-1

Author(s):

Gang Qu ◽

Li Xiao ◽

Wenxing Hu ◽

Junqi Wang ◽

Kun Zhang ◽

...

Keyword(s):

Cognitive Ability ◽

Convolutional Network

Download Full-text

A Hierarchical Attention Graph Convolutional Network for Traffic Incident Impact Forecasting

Practically Feasible Design for Convolutional Network Code

Traffic incident detection system based on video image processing

Knowledge and Geo-Object Based Graph Convolutional Network for Remote Sensing Semantic Segmentation

SACN: A Novel Rotating Face Detector Based on Architecture Search

Multi-Channel Temporal Graph Convolutional Network for Stock Return Prediction

Proposing Gesture Recognition Algorithm Using Two-Stream Convolutional Network and LSTM

NPU RGB+D Dataset and a Feature-Enhanced LSTM-DGCN Method for Action Recognition of Basketball Players

Effects of Pooling Operations on Prediction of Ligand Rotation‐Dependent Protein–Ligand Binding in 3D Graph Convolutional Network

Efficient End-to-End Sentence-Level Lipreading with Temporal Convolutional Networks

Ensemble manifold regularized multi-modal graph convolutional network for cognitive ability prediction

Export Citation Format