Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

computer-vision

Constrained Lip-synchronization

3260 papers • 126 benchmarks • 313 datasets

This task deals with lip-syncing a video (or) an image to the desired target speech. Approaches in this task work only for a specific (limited set) of identities, languages, speech/voice. See also: Unconstrained lip-synchronization - https://paperswithcode.com/task/lip-sync

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in constrained-lip-synchronization

Trend

Dataset

Best Model

Actions

No benchmarks available.

Libraries

Use these libraries to find constrained-lip-synchronization models and implementations

Datasets

No datasets available.

Subtasks

No subtasks available.

Most implemented papers

Talking Face Generation by Conditional Recurrent Adversarial Network

H. Qi, Jingwen Zhu, Dawei Li, Xiaolong Wang•Thu Apr 12 2018

A novel conditional video generation network where the audio input is treated as a condition for the recurrent adversarial network such that temporal dependency is incorporated to realize smooth transition for the lip and facial movement is proposed.

210

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

Paper Graph

Dynamic Temporal Alignment of Speech to Lips

Shmuel Peleg, Tavi Halperin, Ariel Ephrat•Sat Aug 18 2018

This work presents an audio-to-video method for automating speech to lips alignment, stretching and compressing the audio signal to match the lip movements, based on deep audio-visual features.

43 0

Paper Graph

DeepFakes: a New Threat to Face Recognition? Assessment and Detection

S. Marcel, Pavel Korshunov•Wed Dec 19 2018

This paper presents the first publicly available set of Deepfake videos generated from videos of VidTIMIT database, and demonstrates that GAN-generated Deep fake videos are challenging for both face recognition systems and existing detection methods.

736 0

Paper Graph

Real-Time Lip Sync for Live 2D Animation

Deepali Aneja, Wilmot Li•Fri Oct 18 2019

A deep learning based interactive system that automatically generates live lip sync for layered 2D characters using a Long Short Term Memory (LSTM) model that takes streaming audio as input and produces viseme sequences with less than 200ms of latency.

16 0

Paper Graph

Not made for each other- Audio-Visual Dissonance-based Deepfake Detection and Localization

Abhinav Dhall, Komal Chugh, Parul Gupta, Ramanathan Subramanian•Thu May 28 2020

The proposed detection of deepfake videos based on the dissimilarity between the audio and visual modalities, termed as the Modality Dissonance Score (MDS), outperforms the state-of-the-art by up to 7%.

220 0

Paper Graph

ObamaNet: Photo-realistic lip-sync from text

Yoshua Bengio, Rithesh Kumar, Jose M. R. Sotelo, Kundan Kumar, A. D. Brébisson•Tue Dec 05 2017

The first architecture that generates both audio and synchronized photo-realistic lip-sync videos from any new text is presented, and it is claimed that this architecture is the first to be composed of fully trainable neural modules.

129 0

Paper Graph

Adding a benchmark result helps the community track progress.

Constrained Lip-synchronization | State-of-the-Art