The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 16, 2019

Filed:

Sep. 09, 2016
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Jian Luan, Beijing, CN;

Lei He, Beijing, CN;

Max Leung, Kirkland, WA (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 13/00 (2006.01); G10L 13/08 (2013.01); G10L 13/033 (2013.01); G10L 13/02 (2013.01); G06F 3/0482 (2013.01); G06F 3/0484 (2013.01);
U.S. Cl.
CPC ...
G10L 13/0335 (2013.01); G06F 3/0482 (2013.01); G06F 3/04847 (2013.01); G10L 13/02 (2013.01); G10L 13/033 (2013.01); G10L 13/08 (2013.01);
Abstract

Multi-voice font interpolation is provided. A multi-voice font interpolation engine allows the production of computer generated speech with a wide variety of speaker characteristics and/or prosody by interpolating speaker characteristics and prosody from existing fonts. Using prediction models from multiple voice fonts, the multi-voice font interpolation engine predicts values for the parameters that influence speaker characteristics and/or prosody for the phoneme sequence obtained from the text to spoken. For each parameter, additional parameter values are generated by a weighted interpolation from the predicted values. Modifying an existing voice font with the interpolated parameters changes the style and/or emotion of the speech while retaining the base sound qualities of the original voice. The multi-voice font interpolation engine allows the speaker characteristics and/or prosody to be transplanted from one voice font to another or entirely new speaker characteristics and/or prosody to be generated for an existing voice font.


Find Patent Forward Citations

Loading…