The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 17, 2024

Filed:

Sep. 09, 2021
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Jiahui Yu, Mountain View, CA (US);

Chung-cheng Chiu, Sunnyvale, CA (US);

Bo Li, Fremont, CA (US);

Shuo-yiin Chang, Sunnyvale, CA (US);

Tara Sainath, Jersey City, NJ (US);

Wei Han, Mountain View, CA (US);

Anmol Gulati, Mountain View, CA (US);

Yanzhang He, Mountain View, CA (US);

Arun Narayanan, Santa Clara, CA (US);

Yonghui Wu, Fremont, CA (US);

Ruoming Pang, New York, NY (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/06 (2013.01); G10L 15/16 (2006.01); G10L 15/187 (2013.01); G10L 15/22 (2006.01); G10L 15/30 (2013.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G10L 15/16 (2013.01); G10L 15/22 (2013.01); G10L 15/30 (2013.01); G10L 15/187 (2013.01);
Abstract

A computer-implemented method of training a streaming speech recognition model that includes receiving, as input to the streaming speech recognition model, a sequence of acoustic frames. The streaming speech recognition model is configured to learn an alignment probability between the sequence of acoustic frames and an output sequence of vocabulary tokens. The vocabulary tokens include a plurality of label tokens and a blank token. At each output step, the method includes determining a first probability of emitting one of the label tokens and determining a second probability of emitting the blank token. The method also includes generating the alignment probability at a sequence level based on the first probability and the second probability. The method also includes applying a tuning parameter to the alignment probability at the sequence level to maximize the first probability of emitting one of the label tokens.


Find Patent Forward Citations

Loading…