Semi-Supervised FaceGAN for Face-Age Progression and Regression with Synthesized Paired Images

Quang T. M. Pham; Janghoon Yang; Jitae Shin

doi:10.3390/electronics9040603

Semi-Supervised FaceGAN for Face-Age Progression and Regression with Synthesized Paired Images

Electronics ◽

10.3390/electronics9040603 ◽

2020 ◽

Vol 9 (4) ◽

pp. 603

Author(s):

Quang T. M. Pham ◽

Janghoon Yang ◽

Jitae Shin

Keyword(s):

Real Data ◽

Similarity Score ◽

Generative Adversarial Network ◽

Age Progression ◽

Regression Methods ◽

Adversarial Network ◽

Face Images ◽

The Face ◽

Quantitative Results ◽

Human Faces

The performance of existing face age progression or regression methods is often limited by the lack of sufficient data to train the model. To deal with this problem, we introduce a novel framework that exploits synthesized images to improve the performance. A conditional generative adversarial network (GAN) is first developed to generate facial images with targeted ages. The semi-supervised GAN, called SS-FaceGAN, is proposed. This approach considers synthesized images with a target age and the face images from the real data so that age and identity features can be explicitly utilized in the objective function of the network. We analyze the performance of our method over previous studies qualitatively and quantitatively. The experimental results show that the SS-FaceGAN model can produce realistic human faces in terms of both identity preservation and age preservation with the quantitative results of a decent face detection rate of 97% and similarity score of 0.30 on average.

Download Full-text

3D-Aided Deep Pose-Invariant Face Recognition

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/165 ◽

2018 ◽

Cited By ~ 14

Author(s):

Jian Zhao ◽

Lin Xiong ◽

Yu Cheng ◽

Yi Cheng ◽

Jianshu Li ◽

...

Keyword(s):

Face Recognition ◽

Real Data ◽

Training Data ◽

Generative Adversarial Network ◽

High Data ◽

Adversarial Network ◽

The Arts ◽

The Face ◽

In The Wild ◽

Data Efficiency

Learning from synthetic faces, though perhaps appealing for high data efficiency, may not bring satisfactory performance due to the distribution discrepancy of the synthetic and real face images. To mitigate this gap, we propose a 3D-Aided Deep Pose-Invariant Face Recognition Model (3D-PIM), which automatically recovers realistic frontal faces from arbitrary poses through a 3D face model in a novel way. Specifically, 3D-PIM incorporates a simulator with the aid of a 3D Morphable Model (3D MM) to obtain shape and appearance prior for accelerating face normalization learning, requiring less training data. It further leverages a global-local Generative Adversarial Network (GAN) with multiple critical improvements as a refiner to enhance the realism of both global structures and local details of the face simulator’s output using unlabelled real data only, while preserving the identity information. Qualitative and quantitative experiments on both controlled and in-the-wild benchmarks clearly demonstrate superiority of the proposed model over state-of-the-arts.

Download Full-text

Modeling and evaluating beat gestures for social robots

Multimedia Tools and Applications ◽

10.1007/s11042-021-11289-x ◽

2021 ◽

Author(s):

Unai Zabala ◽

Igor Rodriguez ◽

José María Martínez-Otzeta ◽

Elena Lazkano

Keyword(s):

Motion Capture ◽

Motion Capture System ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Desirable Feature ◽

Quantitative Results ◽

Path Lengths ◽

End Effectors ◽

The One ◽

Made In

AbstractNatural gestures are a desirable feature for a humanoid robot, as they are presumed to elicit a more comfortable interaction in people. With this aim in mind, we present in this paper a system to develop a natural talking gesture generation behavior. A Generative Adversarial Network (GAN) produces novel beat gestures from the data captured from recordings of human talking. The data is obtained without the need for any kind of wearable, as a motion capture system properly estimates the position of the limbs/joints involved in human expressive talking behavior. After testing in a Pepper robot, it is shown that the system is able to generate natural gestures during large talking periods without becoming repetitive. This approach is computationally more demanding than previous work, therefore a comparison is made in order to evaluate the improvements. This comparison is made by calculating some common measures about the end effectors’ trajectories (jerk and path lengths) and complemented by the Fréchet Gesture Distance (FGD) that aims to measure the fidelity of the generated gestures with respect to the provided ones. Results show that the described system is able to learn natural gestures just by observation and improves the one developed with a simpler motion capture system. The quantitative results are sustained by questionnaire based human evaluation.

Download Full-text

Facial Age Progression using Conditional Generative Adversarial Network with Heritable Visual Features

10.1109/smc52423.2021.9659067 ◽

2021 ◽

Author(s):

Prarinya Siritanawan ◽

Hideki Ichikawa ◽

Kazunori Kotani

Keyword(s):

Visual Features ◽

Generative Adversarial Network ◽

Age Progression ◽

Adversarial Network ◽

Facial Age

Download Full-text

Matching Thermal to Visible Face Images Using a Semantic-Guided Generative Adversarial Network

2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019) ◽

10.1109/fg.2019.8756527 ◽

2019 ◽

Cited By ~ 3

Author(s):

Cunjian Chen ◽

Arun Ross

Keyword(s):

Generative Adversarial Network ◽

Adversarial Network ◽

Face Images

Download Full-text

1D conditional generative adversarial network for spectrum-to-spectrum translation of simulated chemical reflectance signatures

Journal of Spectral Imaging ◽

10.1255/jsi.2021.a2 ◽

2021 ◽

Author(s):

Cara Murphy ◽

John Kerekes

Keyword(s):

Classification Accuracy ◽

Domain Adaptation ◽

Real Data ◽

Training Set ◽

Generative Adversarial Network ◽

Average Classification Accuracy ◽

Adversarial Network ◽

Chemical Residues ◽

Reflectance Data

The classification of trace chemical residues through active spectroscopic sensing is challenging due to the lack of physics-based models that can accurately predict spectra. To overcome this challenge, we leveraged the field of domain adaptation to translate data from the simulated to the measured domain for training a classifier. We developed the first 1D conditional generative adversarial network (GAN) to perform spectrum-to-spectrum translation of reflectance signatures. We applied the 1D conditional GAN to a library of simulated spectra and quantified the improvement in classification accuracy on real data using the translated spectra for training the classifier. Using the GAN-translated library, the average classification accuracy increased from 0.622 to 0.723 on real chemical reflectance data, including data from chemicals not included in the GAN training set.

Download Full-text

RoboCoDraw: Robotic Avatar Drawing with GAN-Based Style Transfer and Time-Efficient Path Optimization

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i06.6609 ◽

2020 ◽

Vol 34 (06) ◽

pp. 10402-10409

Author(s):

Tianying Wang ◽

Wei Qi Toh ◽

Hao Zhang ◽

Xiuchao Sui ◽

Shaohua Li ◽

...

Keyword(s):

Face Image ◽

Path Optimization ◽

Robotic Arm ◽

Human Face ◽

Generative Adversarial Network ◽

Style Transfer ◽

Adversarial Network ◽

Face Images ◽

Drawing System ◽

Collaborative Robot

Robotic drawing has become increasingly popular as an entertainment and interactive tool. In this paper we present RoboCoDraw, a real-time collaborative robot-based drawing system that draws stylized human face sketches interactively in front of human users, by using the Generative Adversarial Network (GAN)-based style transfer and a Random-Key Genetic Algorithm (RKGA)-based path optimization. The proposed RoboCoDraw system takes a real human face image as input, converts it to a stylized avatar, then draws it with a robotic arm. A core component in this system is the AvatarGAN proposed by us, which generates a cartoon avatar face image from a real human face. AvatarGAN is trained with unpaired face and avatar images only and can generate avatar images of much better likeness with human face images in comparison with the vanilla CycleGAN. After the avatar image is generated, it is fed to a line extraction algorithm and converted to sketches. An RKGA-based path optimization algorithm is applied to find a time-efficient robotic drawing path to be executed by the robotic arm. We demonstrate the capability of RoboCoDraw on various face images using a lightweight, safe collaborative robot UR5.

Download Full-text

JANE: Jointly Adversarial Network Embedding

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/192 ◽

2020 ◽

Author(s):

Liang Yang ◽

Yuexue Wang ◽

Junhua Gu ◽

Chuan Wang ◽

Xiaochun Cao ◽

...

Keyword(s):

Link Prediction ◽

Real Data ◽

Semantic Space ◽

Network Embedding ◽

Generative Adversarial Network ◽

Adversarial Learning ◽

Adversarial Network ◽

Node Clustering ◽

Topology Information ◽

Embedding Methods

Motivated by the capability of Generative Adversarial Network on exploring the latent semantic space and capturing semantic variations in the data distribution, adversarial learning has been adopted in network embedding to improve the robustness. However, this important ability is lost in existing adversarially regularized network embedding methods, because their embedding results are directly compared to the samples drawn from perturbation (Gaussian) distribution without any rectification from real data. To overcome this vital issue, a novel Joint Adversarial Network Embedding (JANE) framework is proposed to jointly distinguish the real and fake combinations of the embeddings, topology information and node features. JANE contains three pluggable components, Embedding module, Generator module and Discriminator module. The overall objective function of JANE is defined in a min-max form, which can be optimized via alternating stochastic gradient. Extensive experiments demonstrate the remarkable superiority of the proposed JANE on link prediction (3% gains in both AUC and AP) and node clustering (5% gain in F1 score).

Download Full-text

Sound Field Reconstruction in Rooms with Deep Generative Models

INTER-NOISE and NOISE-CON Congress and Conference Proceedings ◽

10.3397/in-2021-1864 ◽

2021 ◽

Vol 263 (5) ◽

pp. 1527-1538

Author(s):

Xenofon Karakonstantis ◽

Efren Fernandez Grande

Keyword(s):

Plane Waves ◽

Sound Field ◽

Real Data ◽

Generative Models ◽

Random Wave ◽

Generative Adversarial Network ◽

Underlying Distribution ◽

Adversarial Network ◽

Reconstruction Methods ◽

Free Region

The characterization of Room Impulse Responses (RIR) over an extended region in a room by means of measurements requires dense spatial with many microphones. This can often become intractable and time consuming in practice. Well established reconstruction methods such as plane wave regression show that the sound field in a room can be reconstructed from sparsely distributed measurements. However, these reconstructions usually rely on assuming physical sparsity (i.e. few waves compose the sound field) or trait in the measured sound field, making the models less generalizable and problem specific. In this paper we introduce a method to reconstruct a sound field in an enclosure with the use of a Generative Adversarial Network (GAN), which s new variants of the data distributions that it is trained upon. The goal of the proposed GAN model is to estimate the underlying distribution of plane waves in any source free region, and map these distributions from a stochastic, latent representation. A GAN is trained on a large number of synthesized sound fields represented by a random wave field and then tested on both simulated and real data sets, of lightly damped and reverberant rooms.

Download Full-text

Photo-realistic face age progression/regression using a single generative adversarial network

Neurocomputing ◽

10.1016/j.neucom.2019.07.085 ◽

2019 ◽

Vol 366 ◽

pp. 295-304 ◽

Cited By ~ 3

Author(s):

Jiangfeng Zeng ◽

Xiao Ma ◽

Ke Zhou

Keyword(s):

Generative Adversarial Network ◽

Age Progression ◽

Adversarial Network

Download Full-text

Presentation Attack Face Image Generation Based on a Deep Generative Adversarial Network

Sensors ◽

10.3390/s20071810 ◽

2020 ◽

Vol 20 (7) ◽

pp. 1810

Author(s):

Dat Tien Nguyen ◽

Tuyen Danh Pham ◽

Ganbayar Batchuluun ◽

Kyoung Jun Noh ◽

Kang Ryoung Park

Keyword(s):

Recognition Task ◽

Recognition System ◽

Attack Detection ◽

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Face Images ◽

Problem Presentation ◽

Recognition Systems ◽

Public Datasets

Although face-based biometric recognition systems have been widely used in many applications, this type of recognition method is still vulnerable to presentation attacks, which use fake samples to deceive the recognition system. To overcome this problem, presentation attack detection (PAD) methods for face recognition systems (face-PAD), which aim to classify real and presentation attack face images before performing a recognition task, have been developed. However, the performance of PAD systems is limited and biased due to the lack of presentation attack images for training PAD systems. In this paper, we propose a method for artificially generating presentation attack face images by learning the characteristics of real and presentation attack images using a few captured images. As a result, our proposed method helps save time in collecting presentation attack samples for training PAD systems and possibly enhance the performance of PAD systems. Our study is the first attempt to generate PA face images for PAD system based on CycleGAN network, a deep-learning-based framework for image generation. In addition, we propose a new measurement method to evaluate the quality of generated PA images based on a face-PAD system. Through experiments with two public datasets (CASIA and Replay-mobile), we show that the generated face images can capture the characteristics of presentation attack images, making them usable as captured presentation attack samples for PAD system training.

Download Full-text