Latest from Apple researchers: Deep learning approach for driving animated faces using both acoustic and visual information

For project and code or API requests: https://www.catalyzex.com/paper/arxiv:2005.13616

To ensure that the model exploits both modalities during training, batches are generated that contain audio-only, video-only, and audiovisual input features

Posted by uniqueone
,