The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 18, 2025

Filed:

Dec. 16, 2022
Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Gourav Datta, Los Angeles, CA (US);

Vivek Yadav, Lakewood, CO (US);

Yue Wu, Torrance, CA (US);

Ayush Jaiswal, Redondo Beach, CA (US);

Rajiv M Reddy, Bellevue, WA (US);

Prateek Singhal, San Francisco, CA (US);

Karthik Ramakrishnan, Bellevue, WA (US);

Premkumar Natarajan, Rolling Hills Estates, CA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06T 7/20 (2017.01); G06T 7/70 (2017.01); G06T 13/20 (2011.01); G06T 13/40 (2011.01); G06V 40/16 (2022.01); G10L 15/22 (2006.01); G10L 25/57 (2013.01); G10L 25/60 (2013.01);
U.S. Cl.
CPC ...
G06T 13/205 (2013.01); G06T 7/20 (2013.01); G06T 7/70 (2017.01); G06T 13/40 (2013.01); G06V 40/176 (2022.01); G10L 15/22 (2013.01); G10L 25/57 (2013.01); G10L 25/60 (2013.01); G06T 2207/30201 (2013.01);
Abstract

A system configured to perform style-aware listener animation. By representing different listening styles (e.g., facial expressions) using an embedding space, a single model can be trained to generate unique facial animations for a number of distinct listeners. Thus, individual listening styles can be associated with a listener identifier, enabling the system to (i) animate a plurality of different listeners with unique nonverbal behavior and/or (ii) select a particular listener identifier or desired type of listener style with which to animate. This enables the model to be generalized to new listeners to generate additional listener facial responses without needing training data for each new listener. The model may process a listener representation style or listener identifier, along with input data corresponding to a speaker talking, to generate unique facial animation responsive to the speech.


Find Patent Forward Citations

Loading…