The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 11605388 B1

Date of Patent:

Mar. 14, 2023

Filed:

Nov. 09, 2020

Speaker conversion for video games

Applicant:

Electronic Arts Inc., Redwood City, CA (US);

Inventors:

Kilol Gupta, Redwood City, CA (US);

Dhaval Shah, Redwood City, CA (US);

Zahra Shakeri, Mountain View, CA (US);

Jervis Pinto, Toronto, CA;

Mohsen Sardari, Burlingame, CA (US);

Harold Chaput, Castro Valley, CA (US);

Navid Aghdaie, San Jose, CA (US);

Kazi Zaman, Foster City, CA (US);

Assignee:

Electronic Arts Inc., Redwood City, CA (US);

Attorney:

Gray Ice Higdon

Primary Examiner:

Jakieda R Jackson

Int. Cl.

CPC ...

G10L 15/02 (2006.01); G10L 17/04 (2013.01); A63F 13/424 (2014.01); A63F 13/215 (2014.01); G10L 15/00 (2013.01);

U.S. Cl.

CPC ...

G10L 17/04 (2013.01); A63F 13/215 (2014.09); A63F 13/424 (2014.09); G10L 15/005 (2013.01);

Abstract

This specification describes a computer-implemented method of generating speech audio for use in a video game, wherein the speech audio is generated using a voice convertor that has been trained to convert audio data for a source speaker into audio data for a target speaker. The method comprises receiving: (i) source speech audio, and (ii) a target speaker identifier. The source speech audio comprises speech content in the voice of a source speaker. Source acoustic features are determined for the source speech audio. A target speaker embedding associated with the target speaker identifier is generated as output of a speaker encoder of the voice convertor. The target speaker embedding and the source acoustic features are inputted into an acoustic feature encoder of the voice convertor. One or more acoustic feature encodings are generated as output of the acoustic feature encoder. The one or more acoustic feature encodings are derived from the target speaker embedding and the source acoustic features. Target speech audio is generated for the target speaker. The target speech audio comprises the speech content in the voice of the target speaker. The generating comprises decoding the one or more acoustic feature encodings using an acoustic feature decoder of the voice convertor.

Find Patent Forward Citations