The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 16, 2025

Filed:

Aug. 18, 2022
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Andrew Rosenberg, Brooklyn, NY (US);

Bhuvana Ramabhadran, Mt. Kisco, NY (US);

Yu Zhang, Mountain View, CA (US);

Murali Karthick Baskar, Mountain View, CA (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/06 (2013.01); G06N 3/0455 (2023.01); G10L 13/00 (2006.01); G10L 15/02 (2006.01); G10L 15/08 (2006.01); G10L 15/16 (2006.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G10L 15/02 (2013.01); G10L 15/08 (2013.01); G06N 3/0455 (2023.01);
Abstract

A method of guided data selection for masked speech modeling includes obtaining a sequence of encoded representations corresponding to an utterance. For each respective encoded representation, the method includes processing the respective encoded representation to generate a corresponding probability distribution over possible speech recognition hypotheses and assigning, to the respective encode representation, a confidence score as a highest probability from the corresponding probability distribution over possible speech recognition hypotheses. The method also includes selecting a set of unmasked encoded representations to mask based on the confidence scores assigned to the sequence of encoded representations. The method also includes generating a set of masked encoded representations by masking the selected set of unmasked encoded representations. Here, each masked encoded representation in the set of masked encoded representations corresponds to a respective one of the unmasked encoded representations in the selected set of unmasked encoded representations.


Find Patent Forward Citations

Loading…