The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Feb. 20, 2024
Filed:
Aug. 12, 2021
Inter-channel feature extraction method, audio separation method and apparatus, and computing device
Tencent Technology (Shenzhen) Company Limited, Shenzhen, CN;
Rongzhi Gu, Shenzhen, CN;
Shixiong Zhang, Shenzhen, CN;
Lianwu Chen, Shenzhen, CN;
Yong Xu, Shenzhen, CN;
Meng Yu, Shenzhen, CN;
Dan Su, Shenzhen, CN;
Dong Yu, Shenzhen, CN;
TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED, Shenzhen, CN;
Abstract
This application relates to a method of extracting an inter channel feature from a multi-channel multi-sound source mixed audio signal performed at a computing device. The method includes: transforming one channel component of a multi-channel multi-sound source mixed audio signal into a single-channel multi-sound source mixed audio representation in a feature space; performing a two-dimensional dilated convolution on the multi-channel multi-sound source mixed audio signal to extract inter-channel features; performing a feature fusion on the single-channel multi-sound source mixed audio representation and the inter-channel features; estimating respective weights of sound sources in the single-channel multi-sound source mixed audio representation based on a fused multi-channel multi-sound source mixed audio feature; obtaining respective representations of the plurality of sound sources according to the single-channel multi-sound source mixed audio representation and the respective weights; and transforming the respective representations of the sound sources into respective audio signals of the plurality of sound sources.