The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

G06N 3/02 (2006.01); G06N 3/044 (2023.01); G06N 3/045 (2023.01); G06N 3/084 (2023.01); G06N 7/01 (2023.01); G10L 15/02 (2006.01); G10L 15/06 (2013.01); G10L 15/14 (2006.01); G10L 15/16 (2006.01); G10L 15/22 (2006.01); G10L 15/183 (2013.01);

U.S. Cl.

CPC ...

G10L 15/16 (2013.01); G06N 3/02 (2013.01); G06N 3/044 (2023.01); G06N 3/045 (2023.01); G06N 3/084 (2013.01); G10L 15/14 (2013.01); G10L 15/22 (2013.01); G06N 7/01 (2023.01); G10L 15/02 (2013.01); G10L 15/063 (2013.01); G10L 15/183 (2013.01); G10L 2015/025 (2013.01);

Abstract

Methods, systems, and apparatus for performing speech recognition. In some implementations, acoustic data representing an utterance is obtained. The acoustic data corresponds to time steps in a series of time steps. One or more computers process scores indicative of the acoustic data using a recurrent neural network to generate a sequence of outputs. The sequence of outputs indicates a likely output label from among a predetermined set of output labels. The predetermined set of output labels includes output labels that respectively correspond to different linguistic units and to a placeholder label that does not represent a classification of acoustic data. The recurrent neural network is configured to use an output label indicated for a previous time step to determine an output label for the current time step. The generated sequence of outputs is processed to generate a transcription of the utterance, and the transcription of the utterance is provided.

Find Patent Forward Citations