Real-Time Multi-View Face Detection and Pose Estimation in Video Stream

AbstractNowadays, face detection and head pose estimation have a lot of application such as face recognition, aiding in gaze estimation and modeling attention. For these two tasks, it is usually to design two different models. However, the head pose estimation model often depends on the region of interest (ROI) detected in advance, which means that a serial face detector is needed. Even the lightest face detector will slow down the whole forward inference time and cannot achieve real-time performance when detecting the head pose of multiple people. We can see that both face detection and head pose estimation need face features, so a shared face feature map can be used between them. In this paper, a multi-task learning model is proposed that can solve both problems simultaneously. We directly detect the location of the center point of the bounding box of face; at this location, we calculate the size of the bounding box of face and the head attitude. We evaluate our model’s performance on the AFLW. The proposed model has great competitiveness with the multi-stage face attribute analysis model, and our model can achieve real-time performance.

Download Full-text

Real time face detection from color video stream based on PCA method

32nd Applied Imagery Pattern Recognition Workshop, 2003. Proceedings. ◽

10.1109/aipr.2003.1284263 ◽

2004 ◽

Cited By ~ 9

Author(s):

R. Gottumukkal ◽

V.K. Asari

Keyword(s):

Real Time ◽

Face Detection ◽

Video Stream ◽

Pca Method ◽

Color Video

Download Full-text

Real-time Face Detection Algorithm Using Fractal Features in MPEG-4 Video Stream

Journal of Convergence Information Technology ◽

10.4156/jcit.vol7.issue6.7 ◽

2012 ◽

Vol 7 (6) ◽

pp. 54-62 ◽

Cited By ~ 3

Author(s):

Zhiliang Zhu ◽

Jiangning Gao ◽

Hai Yu

Keyword(s):

Real Time ◽

Face Detection ◽

Video Stream ◽

Detection Algorithm

Download Full-text

Real-Time Pose Estimation for Human-Robot Interaction

2020 2nd Annual International Conference on Information and Sciences (AiCIS) ◽

10.1109/aicis51645.2020.00023 ◽

2020 ◽

Author(s):

Saad D. Al-Sheekh ◽

Majid Dherar Younus

Keyword(s):

Real Time ◽

Pose Estimation ◽

Human Robot Interaction ◽

Robot Interaction

Download Full-text

Face Detection in Real Time Live Video Using Yolo Algorithm Based on Vgg16 Convolutional Neural Network

2021 International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM) ◽

10.1109/icieam51226.2021.9446291 ◽

2021 ◽

Author(s):

Htet Aung ◽

Alexander V. Bobkov ◽

Nyan Lin Tun

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real Time ◽

Face Detection ◽

Live Video

Download Full-text

An RNN-Ensemble approach for Real Time Human Pose Estimation from Sparse IMUs

Proceedings of the 3rd International Conference on Applications of Intelligent Systems ◽

10.1145/3378184.3378228 ◽

2020 ◽

Author(s):

Deepak Nagaraj ◽

Erik Schake ◽

Patrick Leiner ◽

Dirk Werth

Keyword(s):

Real Time ◽

Pose Estimation ◽

Human Pose Estimation ◽

Ensemble Approach ◽

Human Pose

Download Full-text

Face detection and posture recognition in a real time tracking system

2017 IEEE International Systems Engineering Symposium (ISSE) ◽

10.1109/syseng.2017.8088265 ◽

2017 ◽

Cited By ~ 4

Author(s):

Hung-Yuan Chung ◽

Chun-Cheng Hou ◽

Shou-Jyun Liang

Keyword(s):

Real Time ◽

Face Detection ◽

Tracking System ◽

Real Time Tracking ◽

Posture Recognition

Download Full-text

Robust hand gesture recognition using multiple shape-oriented visual cues

EURASIP Journal on Image and Video Processing ◽

10.1186/s13640-021-00567-1 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Samy Bakheet ◽

Ayoub Al-Hamadi

Keyword(s):

Real Time ◽

Gesture Recognition ◽

Pose Estimation ◽

Depth Map ◽

Hand Gesture Recognition ◽

Support Vector ◽

Hand Gesture ◽

Hand Pose Estimation ◽

Time Operation ◽

Hand Pose

AbstractRobust vision-based hand pose estimation is highly sought but still remains a challenging task, due to its inherent difficulty partially caused by self-occlusion among hand fingers. In this paper, an innovative framework for real-time static hand gesture recognition is introduced, based on an optimized shape representation build from multiple shape cues. The framework incorporates a specific module for hand pose estimation based on depth map data, where the hand silhouette is first extracted from the extremely detailed and accurate depth map captured by a time-of-flight (ToF) depth sensor. A hybrid multi-modal descriptor that integrates multiple affine-invariant boundary-based and region-based features is created from the hand silhouette to obtain a reliable and representative description of individual gestures. Finally, an ensemble of one-vs.-all support vector machines (SVMs) is independently trained on each of these learned feature representations to perform gesture classification. When evaluated on a publicly available dataset incorporating a relatively large and diverse collection of egocentric hand gestures, the approach yields encouraging results that agree very favorably with those reported in the literature, while maintaining real-time operation.

Download Full-text