Road Region Detection by Spatio-Temporal Graph Segmentation of Optical Flows using On-Board Camera

Author(s):  
Kenji Nishida ◽  
Jun Fujiki ◽  
Takumi Kobayashi ◽  
Chikao Tsuchiya ◽  
Shinya Tanaka ◽  
...  
2020 ◽  
Vol 34 (07) ◽  
pp. 10713-10720
Author(s):  
Mingyu Ding ◽  
Zhe Wang ◽  
Bolei Zhou ◽  
Jianping Shi ◽  
Zhiwu Lu ◽  
...  

A major challenge for video semantic segmentation is the lack of labeled data. In most benchmark datasets, only one frame of a video clip is annotated, which makes most supervised methods fail to utilize information from the rest of the frames. To exploit the spatio-temporal information in videos, many previous works use pre-computed optical flows, which encode the temporal consistency to improve the video segmentation. However, the video segmentation and optical flow estimation are still considered as two separate tasks. In this paper, we propose a novel framework for joint video semantic segmentation and optical flow estimation. Semantic segmentation brings semantic information to handle occlusion for more robust optical flow estimation, while the non-occluded optical flow provides accurate pixel-level temporal correspondences to guarantee the temporal consistency of the segmentation. Moreover, our framework is able to utilize both labeled and unlabeled frames in the video through joint training, while no additional calculation is required in inference. Extensive experiments show that the proposed model makes the video semantic segmentation and optical flow estimation benefit from each other and outperforms existing methods under the same settings in both tasks.


Author(s):  
Yinong Zhang ◽  
Shanshan Guan ◽  
Cheng Xu ◽  
Hongzhe Liu

In the era of intelligent education, human behavior recognition based on computer vision is an important branch of pattern recognition. Human behavior recognition is a basic technology in the fields of intelligent monitoring and human-computer interaction in education. The dynamic changes of human skeleton provide important information for the recognition of educational behavior. Traditional methods usually use manual information to label or traverse rules only, resulting in limited representation capabilities and poor generalization performance of the model. In this paper, a kind of dynamic skeleton model with residual is adopted—a spatio-temporal graph convolutional network based on residual connections, which not only overcomes the limitations of previous methods, but also can learn the spatio-temporal model from the skeleton data. In the big bone NTU-RGB + D dataset, the network model not only improved the representation ability of human behavior characteristics, but also improved the generalization ability, and achieved better recognition effect than the existing model. In addition, this paper also compares the results of behavior recognition on subsets of different joint points, and finds that spatial structure division have better effects.


2021 ◽  
Vol 30 ◽  
pp. 7760-7775
Author(s):  
Maosen Li ◽  
Siheng Chen ◽  
Yangheng Zhao ◽  
Ya Zhang ◽  
Yanfeng Wang ◽  
...  

2021 ◽  
pp. 125-134
Author(s):  
Ahmed El-Gazzar ◽  
Rajat Mani Thomas ◽  
Guido van Wingen

2021 ◽  
Author(s):  
Yi-Fan Li ◽  
Bo Dong ◽  
Latifur Khan ◽  
Bhavani Thuraisingham ◽  
Patrick T. Brandt ◽  
...  

2019 ◽  
Vol 23 (2) ◽  
pp. 905-926
Author(s):  
Junhua Fang ◽  
Jiafeng Ding ◽  
Pengpeng Zhao ◽  
Jiajie Xu ◽  
An Liu ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document