The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 03, 2024

Filed:

Jun. 21, 2022
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Zhehuai Chen, Jersey City, NJ (US);

Bhuvana Ramabhadran, Mt. Kisco, NY (US);

Andrew M. Rosenberg, Brooklyn, NY (US);

Yu Zhang, Mountain View, CA (US);

Pedro J. Moreno Mengibar, Jersey City, NJ (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/06 (2013.01); G10L 13/047 (2013.01); G10L 13/08 (2013.01); G10L 15/16 (2006.01);
U.S. Cl.
CPC ...
G10L 13/047 (2013.01); G10L 13/08 (2013.01);
Abstract

A method includes receiving training data that includes unspoken text utterances and un-transcribed non-synthetic speech utterances. Each unspoken text utterance is not paired with any corresponding spoken utterance of non-synthetic speech. Each un-transcribed non-synthetic speech utterance is not paired with a corresponding transcription. The method also includes generating a corresponding synthetic speech representation for each unspoken textual utterance of the received training data using a text-to-speech model. The method also includes pre-training an audio encoder on the synthetic speech representations generated for the unspoken textual utterances and the un-transcribed non-synthetic speech utterances to teach the audio encoder to jointly learn shared speech and text representations.


Find Patent Forward Citations

Loading…