The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 10, 2024

Filed:

Mar. 23, 2022
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Bhuvana Ramabhadran, Mt. Kisco, NY (US);

Hainan Xu, Mountain View, CA (US);

Kartik Audhkhasi, Mountain View, CA (US);

Yinghui Huang, Mountain View, CA (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G10L 15/02 (2006.01); G06F 40/284 (2020.01); G06N 3/04 (2023.01); G10L 15/04 (2013.01); G10L 15/06 (2013.01); G10L 15/16 (2006.01); G10L 25/30 (2013.01);
U.S. Cl.
CPC ...
G10L 15/04 (2013.01); G06F 40/284 (2020.01); G06N 3/04 (2013.01); G10L 15/063 (2013.01); G10L 15/16 (2013.01); G10L 25/30 (2013.01); G10L 15/02 (2013.01);
Abstract

A method for subword segmentation includes receiving an input word to be segmented into a plurality of subword units. The method also includes executing a subword segmentation routine to segment the input word into a plurality of subword units by accessing a trained vocabulary set of subword units and selecting the plurality of subword units from the input word by greedily finding a longest subword unit from the input word that is present in the trained vocabulary set until an end of the input word is reached.


Find Patent Forward Citations

Loading…