Audiovisual Facial Action Unit Recognition using Feature Level Fusion

Recognizing facial actions is challenging, especially when they are accompanied with speech. Instead of employing information solely from the visual channel, this work aims to exploit information from both visual and audio channels in recognizing speech-related facial action units (AUs). In this work, two feature-level fusion methods are proposed. The first method is based on a kind of human-crafted visual feature. The other method utilizes visual features learned by a deep convolutional neural network (CNN). For both methods, features are independently extracted from visual and audio channels and aligned to handle the difference in time scales and the time shift between the two signals. These temporally aligned features are integrated via feature-level fusion for AU recognition. Experimental results on a new audiovisual AU-coded dataset have demonstrated that both fusion methods outperform their visual counterparts in recognizing speech-related AUs. The improvement is more impressive with occlusions on the facial images, which would not affect the audio channel.

Download Full-text

Audiovisual Facial Action Unit Recognition Using Feature Level Fusion

Computer Vision ◽

10.4018/978-1-5225-5204-8.ch024 ◽

2018 ◽

pp. 636-655

Author(s):

Zibo Meng ◽

Shizhong Han ◽

Min Chen ◽

Yan Tong

Keyword(s):

Time Shift ◽

Visual Feature ◽

Visual Channel ◽

Facial Action ◽

Audio Channel ◽

Feature Level Fusion ◽

Fusion Methods ◽

The Difference ◽

Facial Images ◽

Level Fusion

Download Full-text

Two Feature-Level Fusion Methods with Feature Scaling and Hashing for Multimodal Biometrics

IETE Technical Review ◽

10.1080/02564602.2016.1149039 ◽

2016 ◽

Vol 34 (1) ◽

pp. 91-101 ◽

Cited By ~ 5

Author(s):

Ren-He Jeng ◽

Wen-Shiung Chen

Keyword(s):

Multimodal Biometrics ◽

Feature Level Fusion ◽

Fusion Methods ◽

Feature Scaling ◽

Level Fusion

Download Full-text

Feature Level Fusion for Bimodal Facial Action Unit Recognition

2015 IEEE International Symposium on Multimedia (ISM) ◽

10.1109/ism.2015.116 ◽

2015 ◽

Cited By ~ 3

Author(s):

Zibo Meng ◽

Shizhong Han ◽

Min Chen ◽

Yan Tong

Keyword(s):

Action Unit ◽

Facial Action ◽

Feature Level Fusion ◽

Level Fusion

Download Full-text

Feature Level Fusion of Seven Neighbor Bilinear Interpolation Data Sets of Finger Vein

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2020/95922020 ◽

2020 ◽

Vol 9 (2) ◽

pp. 1531-1536

Author(s):

Arjun B

Keyword(s):

Data Sets ◽

Bilinear Interpolation ◽

Finger Vein ◽

Feature Level Fusion ◽

Level Fusion

Download Full-text

The Review of Feature Level Fusion of Multi-Focused Images Using Wavelets

Recent Patents on Signal Processing ◽

10.2174/1877612401002010028 ◽

2010 ◽

Vol 2 (1) ◽

pp. 28-38 ◽

Cited By ~ 3

Author(s):

K. Kannan ◽

S. Arumuga Perumal ◽

K. Arulmozhi

Keyword(s):

Feature Level Fusion ◽

Level Fusion

Download Full-text

A Deep 2D/3D Feature-Level Fusion for Classification of UAV Multispectral Imagery in Urban Areas

Geocarto International ◽

10.1080/10106049.2021.1959655 ◽

2021 ◽

pp. 1-16

Author(s):

Hossein Pourazar ◽

Farhad Samadzadegan ◽

Farzaneh Dadrass Javan

Keyword(s):

Urban Areas ◽

Multispectral Imagery ◽

Feature Level Fusion ◽

Level Fusion

Download Full-text

Feature-level fusion of physiological parameters to be used as cryptographic keys

2017 IEEE International Conference on Communications (ICC) ◽

10.1109/icc.2017.7996338 ◽

2017 ◽

Author(s):

Duygu Karaoglan Altop ◽

Albert Levi ◽

Volkan Tuzcu

Keyword(s):

Physiological Parameters ◽

Feature Level Fusion ◽

Cryptographic Keys ◽

Level Fusion

Download Full-text

3S data feature-level fusion by neural network

Proceedings. 2005 IEEE International Geoscience and Remote Sensing Symposium, 2005. IGARSS '05. ◽

10.1109/igarss.2005.1525768 ◽

2005 ◽

Author(s):

Yannan Sun ◽

Min Han ◽

Shiguo Xu

Keyword(s):

Neural Network ◽

Feature Level Fusion ◽

Level Fusion

Download Full-text

Using bidirectional Binary Particle Swarm Optimization for feature selection in feature-level fusion recognition system

2009 4th IEEE Conference on Industrial Electronics and Applications ◽

10.1109/iciea.2009.5138918 ◽

2009 ◽

Author(s):

Dawei Wang ◽

Wei Ge ◽

Yanjie Wang

Keyword(s):

Feature Selection ◽

Particle Swarm Optimization ◽

Particle Swarm ◽

Recognition System ◽

Binary Particle Swarm Optimization ◽

Swarm Optimization ◽

Feature Level Fusion ◽

Level Fusion

Download Full-text

Hyperspectral and Multispectral Remote Sensing Image Fusion Based on Endmember Spatial Information

Remote Sensing ◽

10.3390/rs12061009 ◽

2020 ◽

Vol 12 (6) ◽

pp. 1009

Author(s):

Xiaoxiao Feng ◽

Luxiao He ◽

Qimin Cheng ◽

Xiaoyi Long ◽

Yuxin Yuan

Keyword(s):

Image Fusion ◽

Spatial Resolution ◽

Spectral Resolution ◽

Spatial Information ◽

Spectral Unmixing ◽

High Spectral Resolution ◽

Remote Sensing Image Fusion ◽

Fusion Methods ◽

Invariant Regions ◽

The Difference

Hyperspectral (HS) images usually have high spectral resolution and low spatial resolution (LSR). However, multispectral (MS) images have high spatial resolution (HSR) and low spectral resolution. HS–MS image fusion technology can combine both advantages, which is beneficial for accurate feature classification. Nevertheless, heterogeneous sensors always have temporal differences between LSR-HS and HSR-MS images in the real cases, which means that the classical fusion methods cannot get effective results. For this problem, we present a fusion method via spectral unmixing and image mask. Considering the difference between the two images, we firstly extracted the endmembers and their corresponding positions from the invariant regions of LSR-HS images. Then we can get the endmembers of HSR-MS images based on the theory that HSR-MS images and LSR-HS images are the spectral and spatial degradation from HSR-HS images, respectively. The fusion image is obtained by two result matrices. Series experimental results on simulated and real datasets substantiated the effectiveness of our method both quantitatively and visually.

Download Full-text