PhonemeNet: A Transformer Pipeline for Text-Driven Facial Animation
A transformer pipeline for text-driven facial animation exploiting phoneme-level speech structure, achieving real-time performance and best-in-class lip synchronization accuracy. …
p.-witzig

