Facial Animation

PhonemeNet: A Transformer Pipeline for Text-Driven Facial Animation featured image

PhonemeNet: A Transformer Pipeline for Text-Driven Facial Animation

A transformer pipeline for text-driven facial animation exploiting phoneme-level speech structure, achieving real-time performance and best-in-class lip synchronization accuracy. …

p.-witzig

EmoSpaceTime: Decoupling Emotion and Content through Contrastive Learning for Expressive 3D Speech Animation

EmoSpaceTime decouples emotion and content in 3D speech animation through contrastive learning, enabling fine-grained control over emotional expressivity independent of spoken …

p.-witzig
Facial Animation Synthesis featured image

Facial Animation Synthesis

Efficient transformer-based pipelines for text-driven and speech-driven 3D facial animation — achieving real-time performance through phoneme-level speech modeling and contrastive …