The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 26, 2023

Filed:

May. 14, 2019
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Yang Zhang, Cambridge, MA (US);

Shiyu Chang, Elmsford, NY (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 21/003 (2013.01); G10L 21/013 (2013.01); G10L 19/00 (2013.01); G06N 20/20 (2019.01); G06N 3/08 (2023.01); G06N 3/045 (2023.01);
U.S. Cl.
CPC ...
G10L 21/013 (2013.01); G06N 3/045 (2023.01); G06N 3/08 (2013.01); G06N 20/20 (2019.01); G10L 19/00 (2013.01); G10L 2021/0135 (2013.01);
Abstract

A method (and structure and computer product) to permit zero-shot voice conversion with non-parallel data includes receiving source speaker speech data as input data into a content encoder of a style transfer autoencoder system, the content encoder providing a source speaker disentanglement of the source speaker speech data by reducing speaker style information of the input source speech data while retaining content information and receiving target speaker input speech as input data into a target speaker encoder. The output of the content encoder and the target speaker encoder are combined in a decoder of the style transfer autoencoder, and the output of the decoder provides the content information of the input source speech data in a style of the target speaker speech information.


Find Patent Forward Citations

Loading…