The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 03, 2024

Filed:

Sep. 27, 2022
Applicant:

Tencent America Llc, Palo Alto, CA (US);

Inventors:

Chunlei Zhang, Bellevue, WA (US);

Jiachen Lian, Palo Alto, CA (US);

Dong Yu, Palo Alto, CA (US);

Assignee:

TENCENT AMERICA LLC, Palo Alto, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 13/10 (2013.01); G10L 13/00 (2006.01); G10L 13/047 (2013.01); G10L 13/06 (2013.01); G10L 15/14 (2006.01);
U.S. Cl.
CPC ...
G10L 13/10 (2013.01); G10L 13/047 (2013.01); G10L 13/06 (2013.01); G10L 2013/105 (2013.01);
Abstract

An unsupervised text to speech system utilizing a lexicon to map input text to the phoneme sequence, which is expanded to the frame-level forced alignment with a speaker-dependent duration model. An alignment mapping module that converts the forced alignment to the unsupervised alignment (UA). Afterword, a Conditional Disentangled Sequential Variational Auto-encoder (C-DSVAE), serving as the self-supervised TTS AM, takes the predicted UA and a target speaker embedding to generate the mel spectrogram, which is ultimately converted to waveform with a neural vocoder.


Find Patent Forward Citations

Loading…