The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

G10L 25/30 (2013.01); G06F 18/10 (2023.01); G06F 18/21 (2023.01); G06F 18/2135 (2023.01); G06N 3/02 (2006.01); G06N 3/042 (2023.01); G06N 3/08 (2023.01); G06N 5/02 (2023.01); G10L 13/04 (2013.01); G10L 13/08 (2013.01); G10L 19/00 (2013.01);

U.S. Cl.

CPC ...

G10L 13/08 (2013.01); G06F 18/10 (2023.01); G06F 18/2135 (2023.01); G06F 18/217 (2023.01); G06N 3/02 (2013.01); G06N 3/042 (2023.01); G06N 3/08 (2013.01); G06N 5/02 (2013.01); G10L 13/04 (2013.01); G10L 19/00 (2013.01);

Abstract

A technique improves training and speech quality of a text-to-speech (TTS) system having an artificial intelligence, such as a neural network. The TTS system is organized as a front-end subsystem and a back-end subsystem. The front-end subsystem is configured to provide analysis and conversion of text into input vectors, each having at least a base frequency, f, a phenome duration, and a phoneme sequence that is processed by a signal generation unit of the back-end subsystem. The signal generation unit includes the neural network interacting with a pre-existing knowledgebase of phenomes to generate audible speech from the input vectors. The technique applies an error signal from the neural network to correct imperfections of the pre-existing knowledgebase of phenomes to generate audible speech signals. A back-end training system is configured to train the signal generation unit by applying psychoacoustic principles to improve quality of the generated audible speech signal.

Find Patent Forward Citations