The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

G06T 13/40 (2011.01); G06T 13/20 (2011.01); G06T 13/80 (2011.01); G06V 10/774 (2022.01); G06V 10/82 (2022.01); G06V 10/98 (2022.01); G06V 40/20 (2022.01); G10L 21/06 (2013.01);

U.S. Cl.

CPC ...

G06T 13/40 (2013.01); G06T 13/205 (2013.01); G06T 13/80 (2013.01); G06V 10/774 (2022.01); G06V 10/82 (2022.01); G06V 10/98 (2022.01); G06V 40/20 (2022.01); G10L 21/06 (2013.01);

Abstract

Embodiments of the present disclosure provide a method for audio-driven character lip sync, a model for audio-driven character lip sync, and a training method therefor. A target dynamic image is obtained by acquiring a character image of a target character and speech for generating a target dynamic image, processing the character image and the speech as image-audio data that may be trained, respectively, and mixing the image-audio data with auxiliary data for training. When a large amount of sample data needs to be obtained for training in different scenarios, a video when another character speaks is used as an auxiliary video for processing, so as to obtain the auxiliary data. The auxiliary data, which replaces non-general sample data, and other data are input into a model in a preset ratio for training. The auxiliary data may improve a process of training a synthetic lip sync action of the model, so that there are no parts unrelated to the synthetic lip sync action during the training process. In this way, a problem that a large amount of sample data is required during the training process is resolved.

Find Patent Forward Citations