The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 01, 2022

Filed:

Jun. 19, 2020
Applicant:

Baidu Online Network Technology (Beijing) Co., Ltd., Beijing, CN;

Inventors:

Zhipeng Chen, Beijing, CN;

Jinfeng Bai, Beijing, CN;

Lei Jia, Beijing, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 13/047 (2013.01); G06N 3/08 (2006.01); G10L 13/06 (2013.01); G10L 13/08 (2013.01);
U.S. Cl.
CPC ...
G10L 13/047 (2013.01); G06N 3/08 (2013.01); G10L 13/06 (2013.01); G10L 13/08 (2013.01);
Abstract

The present application discloses a training method and an apparatus for a speech synthesis model, electronic device, and storage medium. The method includes: taking a syllable input sequence, a phoneme input sequence and a Chinese character input sequence of a current sample as inputs of an encoder of a model to be trained, to obtain encoded representations of these three sequences at an output end of the encoder; fusing the encoded representations of these three sequences, to obtain a weighted combination of these three sequences; taking the weighted combination as an input of an attention module, to obtain a weighted average of the weighted combination at each moment at an output end of the attention module; taking the weighted average as an input of a decoder of the model to be trained, to obtain a speech Mel spectrum of the current sample at an output end of the decoder.


Find Patent Forward Citations

Loading…