PhonemeNet: A Transformer Pipeline for Text-Driven Facial Animation
A transformer pipeline for text-driven facial animation exploiting phoneme-level speech structure, achieving real-time performance and best-in-class lip synchronization accuracy. …
A transformer pipeline for text-driven facial animation exploiting phoneme-level speech structure, achieving real-time performance and best-in-class lip synchronization accuracy. …
A dynamic cognitive framework for narrative agents in interactive storytelling, combining BDI representations with LLM generation to balance story coherence with player agency. …
BEE is a modular cognitive framework for conversational agents featuring belief management, value alignment, transparent reasoning, and extensibility. Best Paper Honorable Mention …
A joint framework modeling personality and emotion for personality-consistent conversational agents, using contrastive learning to decouple emotion from semantic content. IVA 2025. …
A full-pipeline platform for interactive AI character experiences, demonstrated through Digital Einstein and deployed at scientific conferences, technology events, and public …
SIGGRAPH Asia 2024 Emerging Technologies demonstration describing the physical installation and AI integration of Digital Einstein at the Tokyo venue.
EmoSpaceTime decouples emotion and content in 3D speech animation through contrastive learning, enabling fine-grained control over emotional expressivity independent of spoken …
Systematic study of multimodal emotion recognition in natural human-chatbot interactions, evaluating text, acoustic, and behavioral signal fusion strategies. ICMI 2024.
Dynamic personality infusion for chatbots — modulating expressed Big Five personality traits at inference time to improve user engagement and interaction quality. CUI 2024.
A large-scale empirical characterization of the personality dimensions GPT-3 expresses during human-chatbot interaction, using Big Five psychometrics. Published in ACM IMWUT 2024.