The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 06, 2020

Filed:

Aug. 08, 2018
Applicant:

Baidu Usa, Llc, Sunnyvale, CA (US);

Inventors:

Sercan O. Arik, San Francisco, CA (US);

Wei Ping, Sunnyvale, CA (US);

Kainan Peng, Sunnyvale, CA (US);

Sharan Narang, Sunnyvale, CA (US);

Ajay Kannan, San Francisco, CA (US);

Andrew Gibiansky, Mountain View, CA (US);

Jonathan Raiman, Palo Alto, CA (US);

John Miller, Berkeley, CA (US);

Assignee:

Baidu USA LLC, Sunnyvale, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 13/027 (2013.01); G10L 13/08 (2013.01); G10L 13/047 (2013.01);
U.S. Cl.
CPC ...
G10L 13/027 (2013.01); G10L 13/08 (2013.01); G10L 13/047 (2013.01);
Abstract

Described herein are embodiments of a fully-convolutional attention-based neural text-to-speech (TTS) system, which various embodiments may generally be referred to as Deep Voice 3. Embodiments of Deep Voice 3 match state-of-the-art neural speech synthesis systems in naturalness while training ten times faster. Deep Voice 3 embodiments were scaled to data set sizes unprecedented for TTS, training on more than eight hundred hours of audio from over two thousand speakers. In addition, common error modes of attention-based speech synthesis networks were identified and mitigated, and several different waveform synthesis methods were compared. Also presented are embodiments that describe how to scale inference to ten million queries per day on one single-GPU server.


Find Patent Forward Citations

Loading…