ExpNet: Landmark-Free, Deep, 3D Facial Expressions (2018-02-02T00:00:00.000000Z)

TL;DR

It is shown that the ExpNet produces expression coefficients which better discriminate between facial emotions than those obtained using state of the art, facial landmark detectors, and is more robust to scale changes than landmark detectors.

Abstract

We describe a deep learning based method for estimating 3D facial expression coefficients. Unlike previous work, our process does not relay on facial landmark detection methods as a proxy step. Recent methods have shown that a CNN can be trained to regress accurate and discriminative 3D morphable model (3DMM) representations, directly from image intensities. By foregoing landmark detection, these methods were able to estimate shapes for occluded faces appearing in unprecedented viewing conditions. We build on those methods by showing that facial expressions can also be estimated by a robust, deep, landmark-free approach. Our ExpNet CNN is applied directly to the intensities of a face image and regresses a 29D vector of 3D expression coefficients. We propose a unique method for collecting data to train our network, leveraging on the robustness of deep networks to training label noise. We further offer a novel means of evaluating the accuracy of estimated expression coefficients: by measuring how well they capture facial emotions on the CK+ and EmotiW-17 emotion recognition benchmarks. We show that our ExpNet produces expression coefficients which better discriminate between facial emotions than those obtained using state of the art, facial landmark detectors. Moreover, this advantage grows as image scales drop, demonstrating that our ExpNet is more robust to scale changes than landmark detectors. Finally, our ExpNet is orders of magnitude faster than its alternatives.

Authors

G. Medioni

4 papers

R. Nevatia

9 papers

I. Masi

5 papers

TL;DR

Abstract

Authors

References53 items

Learning Pose-Aware Models for Pose-Invariant Face Recognition in the Wild

Extreme 3D Face Reconstruction: Looking Past Occlusions

From individual to group-level emotion recognition: EmotiW 5.0

FacePoseNet: Making a Case for Landmark-Free Face Alignment

Emotion Recognition in Context

Rapid Synthesis of Massive Face Sets for Improved Face Recognition

Regressing Robust and Discriminative 3D Morphable Models with a Very Deep Neural Network

Convolutional Experts Constrained Local Model for Facial Landmark Detection

Learning Detailed Face Reconstruction from a Single Image

Face Recognition Using a Unified 3D Morphable Model

A multi-scale cascade fully convolutional network face detector

EmotioNet: An Accurate, Real-Time Algorithm for the Automatic Annotation of a Million Facial Expressions in the Wild

Gender and Smile Classification Using Deep Convolutional Neural Networks

Pooling Faces: Template Based Face Recognition with Pooled Face Images

Do We Really Need to Collect Millions of Faces for Effective Face Recognition?

OpenFace: An open source facial behavior analysis toolkit

Face recognition using deep multi-pose representations

300 Faces In-The-Wild Challenge: database and results

A Multiresolution 3D Morphable Face Model and Fitting Framework

Fitting a 3D Morphable Model to Edges: A Comparison Between Hard and Soft Correspondences

Deep Residual Learning for Image Recognition

Facial Landmark Detection with Tweaked Convolutional Neural Networks

Emotion Recognition in the Wild via Convolutional Neural Networks and Mapped Binary Patterns

Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus Benchmark A

High-fidelity Pose and Expression Normalization for face recognition in the wild

Dense 3D face alignment from 2D videos in real-time

Age and Gender Estimation of Unfiltered Faces

Effective face frontalization in unconstrained images

Learning Face Representation from Scratch

Pose Independent Face Recognition by Localizing Local Binary Patterns via Deformation Components

Hello, my name is…

3D-Aided Face Recognition Robust to Expression and Pose Variations

Constrained Local Neural Fields for Robust Facial Landmark Detection in the Wild

Viewing Real-World Faces in 3D

Robust Face Landmark Estimation under Occlusion

Collecting Large, Richly Annotated Facial-Expression Databases from Movies

Expression flow for 3D-aware face component transfer

Face recognition in unconstrained videos with matched background similarity

The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression

Dlib-ml: A Machine Learning Toolkit

A 3D Face Model for Pose and Illumination Invariant Face Recognition

Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments

A high-resolution 3D dynamic facial expression database

Real-time conversion from a single 2D face image to a 3D text-driven emotive audio-visual avatar

Estimating 3D shape and texture using pixel intensity, edges, specular highlights, texture constraints and a prior

Efficient, robust and accurate fitting of a 3D morphable model

Face Recognition Based on Fitting a 3D Morphable Model

Face identification across different poses and illuminations with a 3D morphable model

Face Recognition Using Active Appearance Models

Face alignment across large poses: A 3D solution

Facial affect“in-the-wild

Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video

Multiple View Geometry in Computer Vision

Field of Study

Journal Information

Name

Page

Venue Information

Name

Type

URL

Alternate Names