The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 28, 2023

Filed:

Jan. 19, 2021
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Arindrima Datta, New York, NY (US);

Bhuvana Ramabhadran, Mt. Kisco, NY (US);

Jesse Emond, Mountain View, CA (US);

Brian Roark, Mountain View, CA (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G10L 15/00 (2013.01); G06F 40/58 (2020.01); G06N 3/04 (2006.01); G10L 15/06 (2013.01); G10L 15/16 (2006.01); G10L 15/26 (2006.01); G06N 3/049 (2023.01);
U.S. Cl.
CPC ...
G10L 15/005 (2013.01); G06F 40/58 (2020.01); G06N 3/049 (2013.01); G10L 15/063 (2013.01); G10L 15/16 (2013.01); G10L 15/26 (2013.01);
Abstract

A method includes obtaining a plurality of training data sets each associated with a respective native language and includes a plurality of respective training data samples. For each respective training data sample of each training data set in the respective native language, the method includes transliterating the corresponding transcription in the respective native script into corresponding transliterated text representing the respective native language of the corresponding audio in a target script and associating the corresponding transliterated text in the target script with the corresponding audio in the respective native language to generate a respective normalized training data sample. The method also includes training, using the normalized training data samples, a multilingual end-to-end speech recognition model to predict speech recognition results in the target script for corresponding speech utterances spoken in any of the different native languages associated with the plurality of training data sets.


Find Patent Forward Citations

Loading…