The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 14, 2025

Filed:

Mar. 15, 2023
Applicant:

Paypal, Inc., San Jose, CA (US);

Inventors:

Sandro Cavallari, Tanjong Pagar, SG;

Yuzhen Zhuo, Tiong Bahru, SG;

Van Hoang Nguyen, Clementi New Town, SG;

Quan Jin Ferdinand Tang, Tanglin, SG;

Gautam Vasappanavara, Fremont, CA (US);

Assignee:

PAYPAL, INC., San Jose, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 40/00 (2020.01); G06F 40/205 (2020.01); G06F 40/253 (2020.01); G06F 40/284 (2020.01); G10L 15/19 (2013.01); G10L 15/26 (2006.01);
U.S. Cl.
CPC ...
G10L 15/19 (2013.01); G06F 40/205 (2020.01); G06F 40/253 (2020.01); G06F 40/284 (2020.01); G10L 15/26 (2013.01);
Abstract

Methods and systems are presented for translating informal utterances into formal texts. Informal utterances may include words in abbreviation forms or typographical errors. The informal utterances may be processed by mapping each word in an utterance into a well-defined token. The mapping from the words to the tokens may be based on a context associated with the utterance derived by analyzing the utterance in a character-by-character basis. The token that is mapped for each word can be one of a vocabulary token that corresponds to a formal word in a pre-defined word corpus, an unknown token that corresponds to an unknown word, or a masked token. Formal text may then be generated based on the mapped tokens. Through the processing of informal utterances using the techniques disclosed herein, the informal utterances are both normalized and sanitized.


Find Patent Forward Citations

Loading…