The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 17, 2024

Filed:

Nov. 18, 2021
Applicant:

Tencent America Llc, Palo Alto, CA (US);

Inventors:

Yong Xu, Bellevue, WA (US);

Meng Yu, Bellevue, WA (US);

Shi-Xiong Zhang, Redmond, WA (US);

Dong Yu, Bellevue, WA (US);

Assignee:

TENCENT AMERICA LLC, Palo Alto, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 21/0208 (2013.01); G06N 3/044 (2023.01); G06N 3/08 (2023.01); G10L 21/0216 (2013.01); G10L 21/0264 (2013.01); G10L 25/30 (2013.01);
U.S. Cl.
CPC ...
G10L 21/0208 (2013.01);
Abstract

There is included a method and apparatus comprising computer code for generating enhanced target speech from audio data, performed by a computing device, the method comprising: receiving audio data corresponding to one or more speakers; generating estimated an target speech, an estimated noise, and an estimated echo simultaneously based on the audio data using a jointly trained complex ratio mask; predicting frame-level multi-tap time-frequency (T-F) spatio-temporal-echo filter weights based on the estimated target speech, the estimated noise, and the estimated echo using a trained neural network model; and predicting enhanced target speech based on the frame-level multi-tap T-F spatio-temporal-echo filter weights.


Find Patent Forward Citations

Loading…