The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 10, 2023

Filed:

Jun. 08, 2021
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Ehsan Variani, Mountain View, CA (US);

Kevin William Wilson, Cambridge, MA (US);

Ron J. Weiss, New York, NY (US);

Tara N. Sainath, Jersey City, NJ (US);

Arun Narayanan, Santa Clara, CA (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/16 (2006.01); G10L 25/30 (2013.01); G10L 21/028 (2013.01); G10L 21/0388 (2013.01); G10L 19/008 (2013.01); G10L 15/20 (2006.01); G10L 21/0208 (2013.01); G10L 21/0216 (2013.01);
U.S. Cl.
CPC ...
G10L 25/30 (2013.01); G10L 15/16 (2013.01); G10L 15/20 (2013.01); G10L 19/008 (2013.01); G10L 21/028 (2013.01); G10L 21/0388 (2013.01); G10L 2021/02087 (2013.01); G10L 2021/02166 (2013.01);
Abstract

This specification describes computer-implemented methods and systems. One method includes receiving, by a neural network of a speech recognition system, first data representing a first raw audio signal and second data representing a second raw audio signal. The first raw audio signal and the second raw audio signal describe audio occurring at a same period of time. The method further includes generating, by a spatial filtering layer of the neural network, a spatial filtered output using the first data and the second data, and generating, by a spectral filtering layer of the neural network, a spectral filtered output using the spatial filtered output. Generating the spectral filtered output comprises processing frequency-domain data representing the spatial filtered output. The method still further includes processing, by one or more additional layers of the neural network, the spectral filtered output to predict sub-word units encoded in both the first raw audio signal and the second raw audio signal.


Find Patent Forward Citations

Loading…