awesome-talking-head-generation

Papers for Talking Head Generation, released codes collections.

This repo mainly focus on the image-driven talking head generation task, but any addition or bug about other domain talking head generation,please open an issue, pull requests or e-mail me by fhongac@cse.ust.hk. If you are researching in talking head generation task, you can add my discord account: Fa-Ting Hong#6563 for better communication and cooperations.

Related Group

MMLab@NTU

Datasets

VoxCeleb1 [Download link].
VoxCeleb2 [Download link].
Faceforensics++ [Download link].
CelebV [Download link].
TalkingHead-1KH [Download link].
LRW (Lip Reading in the Wild) [Download link].
MEAD [Download link].
CelebV-HQ [Download link].

Image-driven

Audio-driven

2016

[LRW] Lip Reading in the Wild, ACCV 2016.

2017

[Synthesizing-Obama] Synthesizing Obama: Learning Lip Sync From Audio, SIGGRAPH 2017. [Project].
[You-Said-That?] You Said That?: Synthesising Talking Faces From Audio, IJCV 2019. [Code].
Audio-Driven Facial Animation by Joint End-to-End Learning of Pose and Emotion, SIGGRAPH 2017.
A Deep Learning Approach for Generalized Speech Animation, SIGGRAPH 2017.

2018

Lip Movements Generation at a Glance, ECCV 2018. [Code].
[VisemeNet] VisemeNet: Audio-Driven Animator-Centric Speech Animation, SIGGRAPH 2018.

2019

[DAVS] Talking Face Generation by Adversarially Disentangled Audio-Visual Representation, AAAI 2019. [Code].
[ATVGnet] Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss, CVPR 2019. [Code]

2020

[Wav2Lip] A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild, ACM Multimedia 2020. [Code], [Project].
[RhythmicHead]Talking-head Generation with Rhythmic Head Motion, ECCV 2020. [Code].
[MakeItTalk] MakeItTalk: Speaker-Aware Talking-Head Animation, SIGGRAPH Asia 2020. [Code], [Project].
Neural Voice Puppetry: Audio-driven Facial Reenactment, ECCV 2020. [Project].
[MEAD] MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation, ECCV 2020. [Code], [Project].
Realistic Speech-Driven Facial Animation with GANs, IJCV 2020.

2021

[PC-AVS] Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation, CVPR 2021. [Code], [Project].
[IATS]Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis,ACM Multimedia 2021..
[EVP] Audio-Driven Emotional Video Portraits, CVPR 2021. [Code]
[FAU] Talking Head Generation with Audio and Speech Related Facial Action Units, arxiv 2021.
[Speech2Talking-Face] Speech2Talking-Face: Inferring and Driving a Face with Synchronized Audio-Visual Representation, IJCAI 2021.
[IATS] Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis, ACM MM 2021.
[LSP] Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation, ACM TOG 2021.
[Audio2head] Audio2head: Audio-driven one-shot talking-head generation with natural head motion, ArXiv 2021.

2022

[GC-AVT] Expressive Talking Head Generation with Granular Audio-Visual Control , CVPR 2022.
Talking Face Generation with Multilingual TTS, CVPR 2022. [Demo Track].
[EAMM] EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model, SIGGRAPH 2022.
[SPACEx] SPACEx 🚀: Speech-driven Portrait Animation with Controllable Expression, Arxiv 2022. [Project]

2023

🔥Diffusion🔥 1. [Diffused Heads] Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation, Arxiv 2023. [Project]

🔥Diffusion🔥 2. [DiffTalk] DiffTalk: Crafting Diffusion Models for Generalized Talking Head Synthesis, Arxiv 2023. [Project]

Nerf & 3D

2021

[DFA-NeRF] DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering, arxiv, 2021.
[NerFACE] NerFACE: Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction, CVPR 2021 Oral. [Code], [Project]

2022

[SSP-NeRFF] Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation, arxiv, 2022.
[HeadNeRF] HeadNeRF: A Real-time NeRF-based Parametric Head Model, CVPR 2022. [Code], [Project]
[IMavatar] I M Avatar: Implicit Morphable Head Avatars from Videos, CVPR 2022. [Code]
[ROME] Realistic One-shot Mesh-based Head Avatars, ECCV 2022.
[FNeVR] FNeVR: Neural Volume Rendering for Face Animation, Arxiv 2022. [Code]
[3DFaceShop] 3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation, Arxiv 2022. [Code],[Project]
[Next3D] Generative Neural Texture Rasterization for 3D-Aware Head Avatars, Arxiv 2022.[Project]
[NeRFInvertor] NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-shot Real Image Animation, Arxiv 2022.

Parameter-Based

2020

[DiscoFaceGAN ] Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning , CVPR 2020 Oral. [Code].

Survey

2020

What comprises a good talking-head video generation?: A Survey and Benchmark.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

readme.md

Repository files navigation

awesome-talking-head-generation

Related Group

Datasets

Image-driven

2016

2018

2019

2020

2021

2022

2023

Audio-driven

2016

2017

2018

2019

2020

2021

2022

2023

Nerf & 3D

2021

2022

Parameter-Based

2020

Survey

2020

About

Releases

Packages

wangsuzhen/awesome-talking-head-generation

Folders and files

Latest commit

History

readme.md

readme.md

Repository files navigation

awesome-talking-head-generation

Related Group

Datasets

Image-driven

2016

2018

2019

2020

2021

2022

2023

Audio-driven

2016

2017

2018

2019

2020

2021

2022

2023

Nerf & 3D

2021

2022

Parameter-Based

2020

Survey

2020

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages