The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 22, 2022

Filed:

Dec. 10, 2019
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Bo Li, Fremont, CA (US);

Ron J. Weiss, New York, NY (US);

Michiel A. U. Bacchiani, Summit, NJ (US);

Tara N. Sainath, Jersey City, NJ (US);

Kevin William Wilson, Cambridge, MA (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/00 (2013.01); G10L 15/16 (2006.01); G10L 15/20 (2006.01); G10L 21/0224 (2013.01); G10L 15/26 (2006.01); G10L 21/0216 (2013.01);
U.S. Cl.
CPC ...
G10L 15/16 (2013.01); G10L 15/20 (2013.01); G10L 21/0224 (2013.01); G10L 15/26 (2013.01); G10L 2021/02166 (2013.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for neural network adaptive beamforming for multichannel speech recognition are disclosed. In one aspect, a method includes the actions of receiving a first channel of audio data corresponding to an utterance and a second channel of audio data corresponding to the utterance. The actions further include generating a first set of filter parameters for a first filter based on the first channel of audio data and the second channel of audio data and a second set of filter parameters for a second filter based on the first channel of audio data and the second channel of audio data. The actions further include generating a single combined channel of audio data. The actions further include inputting the audio data to a neural network. The actions further include providing a transcription for the utterance.


Find Patent Forward Citations

Loading…