The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 22, 2022

Filed:

Nov. 12, 2020
Applicant:

Ubtech Robotics Corp Ltd, Shenzhen, CN;

Inventors:

Ruotong Wang, Shenzhen, CN;

Dongyan Huang, Shenzhen, CN;

Xian Li, Shenzhen, CN;

Jiebin Xie, Shenzhen, CN;

Zhichao Tang, Shenzhen, CN;

Wan Ding, Shenzhen, CN;

Yang Liu, Shenzhen, CN;

Bai Li, Shenzhen, CN;

Youjun Xiong, Shenzhen, CN;

Assignee:

UBTECH ROBOTICS CORP LTD, Shenzhen, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/06 (2013.01); G06N 3/08 (2006.01); G10L 15/16 (2006.01); G10L 15/30 (2013.01); G10L 21/01 (2013.01); G10L 25/18 (2013.01); G10L 25/24 (2013.01); G10L 21/003 (2013.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G06N 3/08 (2013.01); G10L 15/16 (2013.01); G10L 15/30 (2013.01); G10L 21/003 (2013.01); G10L 21/01 (2013.01); G10L 25/18 (2013.01); G10L 25/24 (2013.01);
Abstract

The present disclosure discloses a voice conversion training method. The method includes: forming a first training data set including a plurality of training voice data groups; selecting two of the training voice data groups from the first training data set to input into a voice conversion neural network for training; forming a second training data set including the first training data set and a first source speaker voice data group; inputting one of the training voice data groups selected from the first training data set and the first source speaker voice data group into the network for training; forming the third training data set including the second source speaker voice data group and the personalized voice data group that are parallel corpus with respect to each other; and inputting the second source speaker voice data group and the personalized voice data group into the network for training.


Find Patent Forward Citations

Loading…