The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 10, 2024

Filed:

Feb. 16, 2022
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Daisy Antonia Stanton, San Francisco, CA (US);

Sean Matthew Shannon, San Francisco, CA (US);

Soroosh Mariooryad, Redwood City, CA (US);

Russell John-Wyatt Skerry-Ryan, Mountain View, CA (US);

Eric Dean Battenberg, Walnut Creek, CA (US);

Thomas Edward Bagby, Monte Rio, CA (US);

David Teh-Hwa Kao, Philadelphia, PA (US);

Assignee:

GOOGLE LLC, Mountain View, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 13/08 (2013.01); G06N 3/08 (2023.01); G10L 13/027 (2013.01);
U.S. Cl.
CPC ...
G10L 13/086 (2013.01); G06N 3/08 (2013.01); G10L 13/027 (2013.01);
Abstract

Systems and methods for text-to-speech with novel speakers can obtain text data and output audio data. The input text data may be input along with one or more speaker preferences. The speaker preferences can include speaker characteristics. The speaker preferences can be processed by a machine-learned model conditioned on a learned prior distribution to determine a speaker embedding. The speaker embedding can then be processed with the text data to generate an output that includes audio data descriptive of the text data spoken by a novel speaker.


Find Patent Forward Citations

Loading…