The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 24, 2024

Filed:

Oct. 07, 2021
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

William Chan, Mountain View, CA (US);

Navdeep Jaitly, Mountain View, CA (US);

Quoc V. Le, Sunnyvale, CA (US);

Oriol Vinyals, Berkeley, CA (US);

Noam M. Shazeer, Palo Alto, CA (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/16 (2006.01); G06F 40/12 (2020.01); G06F 40/197 (2020.01); G06N 3/044 (2023.01); G06N 3/045 (2023.01); G10L 15/183 (2013.01); G10L 15/26 (2006.01); G10L 25/30 (2013.01);
U.S. Cl.
CPC ...
G10L 15/16 (2013.01); G06F 40/12 (2020.01); G06F 40/197 (2020.01); G06N 3/044 (2023.01); G06N 3/045 (2023.01); G10L 15/183 (2013.01); G10L 15/26 (2013.01); G10L 25/30 (2013.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media for speech recognition. One method includes obtaining an input acoustic sequence, the input acoustic sequence representing an utterance, and the input acoustic sequence comprising a respective acoustic feature representation at each of a first number of time steps; processing the input acoustic sequence using a first neural network to convert the input acoustic sequence into an alternative representation for the input acoustic sequence; processing the alternative representation for the input acoustic sequence using an attention-based Recurrent Neural Network (RNN) to generate, for each position in an output sequence order, a set of substring scores that includes a respective substring score for each substring in a set of substrings; and generating a sequence of substrings that represent a transcription of the utterance.


Find Patent Forward Citations

Loading…