The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 26, 2025

Filed:

Sep. 02, 2021
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Byungha Chun, Tokyo, JP;

Mohammad Norouzi, Richmond Hill, CA;

Nanxin Chen, Baltimore, MD (US);

Ron J. Weiss, New York, NY (US);

William Chan, Toronto, CA;

Yu Zhang, Mountain View, CA (US);

Yonghui Wu, Fremont, CA (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G10L 13/08 (2013.01); G10L 21/0208 (2013.01);
U.S. Cl.
CPC ...
G10L 13/08 (2013.01); G10L 21/0208 (2013.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating waveforms conditioned on phoneme sequences. In one aspect, a method comprises: obtaining a phoneme sequence; processing the phoneme sequence using an encoder neural network to generate a hidden representation of the phoneme sequence; generating, from the hidden representation, a conditioning input; initializing a current waveform output; and generating a final waveform output that defines an utterance of the phoneme sequence by a speaker by updating the current waveform output at each of a plurality of iterations, wherein each iteration corresponds to a respective noise level, and wherein the updating comprises, at each iteration: processing (i) the current waveform output and (ii) the conditioning input using a noise estimation neural network to generate a noise output; and updating the current waveform output using the noise output and the noise level for the iteration.


Find Patent Forward Citations

Loading…