The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 09, 2024

Filed:

Feb. 28, 2022
Applicant:

Electronic Arts Inc., Redwood City, CA (US);

Inventors:

Siddharth Gururani, Santa Clara, CA (US);

Kilol Gupta, Redwood City, CA (US);

Dhaval Shah, Redwood City, CA (US);

Zahra Shakeri, Newark, CA (US);

Jervis Pinto, Toronto, CA;

Mohsen Sardari, Burlingame, CA (US);

Navid Aghdaie, San Jose, CA (US);

Kazi Zaman, Foster City, CA (US);

Assignee:

ELECTRONIC ARTS INC., Redwood City, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 13/00 (2006.01); A63F 13/60 (2014.01); G06N 3/044 (2023.01); G06N 3/08 (2023.01); A63F 13/63 (2014.01);
U.S. Cl.
CPC ...
G10L 13/00 (2013.01); A63F 13/60 (2014.09); G06N 3/044 (2023.01); G06N 3/08 (2013.01); A63F 13/63 (2014.09); A63F 2300/6018 (2013.01);
Abstract

A system for use in video game development to generate expressive speech audio comprises a user interface configured to receive user-input text data and a user selection of a speech style. The system includes a machine-learned synthesizer comprising a text encoder, a speech style encoder and a decoder. The machine-learned synthesizer is configured to generate one or more text encodings derived from the user-input text data, using the text encoder of the machine-learned synthesizer; generate a speech style encoding by processing a set of speech style features associated with the selected speech style using the speech style encoder of the machine-learned synthesizer; combine the one or more text encodings and the speech style encoding to generate one or more combined encodings; and decode the one or more combined encodings with the decoder of the machine-learned synthesizer to generate predicted acoustic features. The system includes one or more modules configured to process the predicted acoustic features, the one or more modules comprising a machine-learned vocoder configured to generate a waveform of the expressive speech audio.


Find Patent Forward Citations

Loading…