The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 29, 2020

Filed:

Sep. 10, 2019
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Guillaume Jean Victor Marie Le Moing, Montbonnot Saint Martin, FR;

Phongtharin Vinayavekhin, Tokyo, JP;

Don Joven R. Agravante, Tokyo, JP;

Tadanobu Inoue, Kanagawa, JP;

Asim Munawar, Ichikawa, JP;

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
H04R 3/00 (2006.01); H04R 1/40 (2006.01); G06N 3/04 (2006.01); G06N 3/08 (2006.01);
U.S. Cl.
CPC ...
H04R 3/005 (2013.01); G06N 3/04 (2013.01); H04R 1/406 (2013.01); G06N 3/08 (2013.01);
Abstract

A computer-implemented method is provided for multi-source sound localization. The method includes extracting, by a hardware processor, spectral features from respective pluralities of microphones comprised in each of two or more microphone arrays. The method further includes forming, by the hardware processor, respective sets of pairs of the spectral features from the respective pluralities of microphones within each of the two or more microphone arrays by rearranging and duplicating the spectral features from the respective pluralities of microphones included in each of the two or more microphone arrays. The method also includes inputting, by the hardware processor, the respective sets of pairs of the spectral features into a neural network to encode the spectral features into deep features and decode the deep features to output from the neural network at least one location representation of one or more sound sources.


Find Patent Forward Citations

Loading…