The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 20, 2025

Filed:

Feb. 02, 2023
Applicant:

Tata Consultancy Services Limited, Mumbai, IN;

Inventors:

Sanjana Sinha, Kolkata, IN;

Sandika Biswas, Kolkata, IN;

Brojeshwar Bhowmick, Kolkata, IN;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06T 13/40 (2011.01); G06N 3/0455 (2023.01); G06T 13/20 (2011.01); G06V 10/80 (2022.01); G06V 40/16 (2022.01); G10L 25/63 (2013.01);
U.S. Cl.
CPC ...
G06T 13/40 (2013.01); G06N 3/0455 (2023.01); G06T 13/205 (2013.01); G06V 10/806 (2022.01); G06V 40/171 (2022.01); G10L 25/63 (2013.01);
Abstract

This disclosure relates generally to methods and systems for emotion-controllable generalized talking face generation of an arbitrary face image. Most of the conventional techniques for the realistic talking face generation may not be efficient to control the emotion over the face and have limited scope of generalization to an arbitrary unknown target face. The present disclosure proposes a graph convolutional network that uses speech content feature along with an independent emotion input to generate emotion and speech-induced motion on facial geometry-aware landmark representation. The facial geometry-aware landmark representation is further used in by an optical flow-guided texture generation network for producing the texture. A two-branch optical flow-guided texture generation network with motion and texture branches is designed to consider the motion and texture content independently. The optical flow-guided texture generation network then renders emotional talking face animation from a single image of any arbitrary target face.


Find Patent Forward Citations

Loading…