Microsoft VASA tech can create realistic deepfakes using a single photo and one audio track
TechSpot -

The Visual Affective Skills Animator, or VASA, is a machine-learning framework that analyzes a facial photo and then animates it to a voice, syncing the lips and mouth movements to the audio. It also simulates facial expressions, head movements, and even unseen body movements.Read Entire Article

In related news