The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 13, 2025

Filed:

Dec. 23, 2024
Applicant:

Nanjing Silicon Intelligence Technology Co., Ltd., Jiangsu, CN;

Inventors:

Huapeng Sima, Jiangsu, CN;

Ran Xu, Jiangsu, CN;

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G10L 15/06 (2013.01); G10L 21/013 (2013.01); G10L 25/18 (2013.01); G10L 25/30 (2013.01); G10L 25/90 (2013.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G10L 21/013 (2013.01); G10L 25/18 (2013.01); G10L 25/30 (2013.01); G10L 25/90 (2013.01); G10L 2021/0135 (2013.01);
Abstract

The present disclosure provides a pitch-based speech conversion model training method and a speech conversion system, wherein an audio feature code is output by a priori encoder, and a pitch feature is extracted by a pitch extraction module. A linear spectrum corresponding to the reference speech is input into the posteriori encoder to obtain an audio latent variable. In addition, the audio feature code, a speech concatenation feature obtained by concatenation of the audio feature code and the pitch feature, and the audio latent variable are input into a temporal alignment module to obtain a converted speech code, and the converted speech code is decoded by a decoder to obtain a converted speech. The training loss of the converted speech is then calculated to determine the degree of convergence of the speech conversion model.


Find Patent Forward Citations

Loading…