The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 20, 2022

Filed:

Sep. 17, 2020
Applicant:

Tencent Technology (Shenzhen) Company Limited, Shenzhen, CN;

Inventors:

Lianwu Chen, Shenzhen, CN;

Meng Yu, Bellevue, WA (US);

Yanmin Qian, Shenzhen, CN;

Dan Su, Shenzhen, CN;

Dong Yu, Bothell, WA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 21/0272 (2013.01); G06N 3/04 (2006.01); G06N 3/08 (2006.01); G10L 25/30 (2013.01); G10L 25/51 (2013.01);
U.S. Cl.
CPC ...
G10L 21/0272 (2013.01); G06N 3/0454 (2013.01); G06N 3/088 (2013.01); G10L 25/30 (2013.01); G10L 25/51 (2013.01);
Abstract

A multi-person speech separation method is provided for a terminal. The method includes extracting a hybrid speech feature from a hybrid speech signal requiring separation, N human voices being mixed in the hybrid speech signal, N being a positive integer greater than or equal to 2; extracting a masking coefficient of the hybrid speech feature by using a generative adversarial network (GAN) model, to obtain a masking matrix corresponding to the N human voices, wherein the GAN model comprises a generative network model and an adversarial network model; and performing a speech separation on the masking matrix corresponding to the N human voices and the hybrid speech signal by using the GAN model, and outputting N separated speech signals corresponding to the N human voices.


Find Patent Forward Citations

Loading…