{"ID":2879457,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2508.16401","arxiv_id":"2508.16401","title":"Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars","abstract":"Audio-driven facial animation presents an effective solution for animating digital avatars. In this paper, we detail the technical aspects of NVIDIA Audio2Face-3D, including data acquisition, network architecture, retargeting methodology, evaluation metrics, and use cases. Audio2Face-3D system enables real-time interaction between human users and interactive avatars, facilitating facial animation authoring for game characters. To assist digital avatar creators and game developers in generating realistic facial animations, we have open-sourced Audio2Face-3D networks, SDK, training framework, and example dataset.","short_abstract":"Audio-driven facial animation presents an effective solution for animating digital avatars. In this paper, we detail the technical aspects of NVIDIA Audio2Face-3D, including data acquisition, network architecture, retargeting methodology, evaluation metrics, and use cases. Audio2Face-3D system enables real-time interac...","url_abs":"https://arxiv.org/abs/2508.16401","url_pdf":"https://arxiv.org/pdf/2508.16401v1","authors":"[\"NVIDIA\",\":\",\"Chaeyeon Chung\",\"Ilya Fedorov\",\"Michael Huang\",\"Aleksey Karmanov\",\"Dmitry Korobchenko\",\"Roger Ribera\",\"Yeongho Seol\"]","published":"2025-08-22T14:02:24Z","proceeding":"cs.GR","tasks":"[\"cs.GR\",\"cs.HC\",\"cs.LG\",\"cs.SD\",\"eess.AS\"]","methods":"[]","has_code":false}