The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

G06T 13/40 (2011.01); G06N 3/0455 (2023.01); G06T 13/20 (2011.01); G06V 10/80 (2022.01); G06V 40/16 (2022.01); G10L 25/63 (2013.01);

U.S. Cl.

CPC ...

G06T 13/40 (2013.01); G06N 3/0455 (2023.01); G06T 13/205 (2013.01); G06V 10/806 (2022.01); G06V 40/171 (2022.01); G10L 25/63 (2013.01);

Abstract

This disclosure relates generally to methods and systems for emotion-controllable generalized talking face generation of an arbitrary face image. Most of the conventional techniques for the realistic talking face generation may not be efficient to control the emotion over the face and have limited scope of generalization to an arbitrary unknown target face. The present disclosure proposes a graph convolutional network that uses speech content feature along with an independent emotion input to generate emotion and speech-induced motion on facial geometry-aware landmark representation. The facial geometry-aware landmark representation is further used in by an optical flow-guided texture generation network for producing the texture. A two-branch optical flow-guided texture generation network with motion and texture branches is designed to consider the motion and texture content independently. The optical flow-guided texture generation network then renders emotional talking face animation from a single image of any arbitrary target face.

Find Patent Forward Citations