The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 09, 2024

Filed:

Oct. 07, 2021
Applicant:

Nvidia Corporation, Santa Clara, CA (US);

Inventors:

Kevin Shih, Santa Clara, CA (US);

Jose Rafael Valle Gomes da Costa, Berkeley, CA (US);

Rohan Badlani, San Jose, CA (US);

Adrian Lancucki, Legnica, PL;

Wei Ping, Sunnyvale, CA (US);

Bryan Catanzaro, Los Altos Hills, CA (US);

Assignee:

Nvidia Corporation, Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 13/00 (2006.01); G10L 13/08 (2013.01); G10L 13/10 (2013.01); G10L 13/047 (2013.01); G10L 25/90 (2013.01); G06N 3/045 (2023.01); G06N 3/08 (2023.01); G10L 13/033 (2013.01);
U.S. Cl.
CPC ...
G10L 13/047 (2013.01); G06N 3/045 (2023.01); G06N 3/08 (2013.01); G10L 13/0335 (2013.01); G10L 13/08 (2013.01); G10L 25/90 (2013.01);
Abstract

Generation of synthetic speech from an input text sequence may be difficult when durations of individual phonemes forming the input text sequence are unknown. A predominantly parallel process may model speech rhythm as a separate generative distribution such that phoneme duration may be sampled at inference. Additional information such as pitch or energy may also be sampled to provide improved diversity for synthetic speech generation.


Find Patent Forward Citations

Loading…