The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 27, 2022

Filed:

Sep. 09, 2020
Applicant:

Oracle International Corporation, Redwood Shores, CA (US);

Inventors:

Elias Luqman Jalaluddin, Seattle, WA (US);

Vishal Vishnoi, Redwood City, CA (US);

Mark Edward Johnson, Castle Cove, AU;

Thanh Long Duong, Seabrook, AU;

Yu-Heng Hong, Carlton, AU;

Balakota Srinivas Vinnakota, Sunnyvale, CA (US);

Assignee:

ORACLE INTERNATIONAL CORPORATION, Redwood Shores, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/22 (2006.01); G10L 15/06 (2013.01); G10L 15/05 (2013.01); G10L 15/18 (2013.01); G10L 15/26 (2006.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G10L 15/05 (2013.01); G10L 15/18 (2013.01); G10L 15/22 (2013.01); G10L 15/26 (2013.01); G10L 2015/0633 (2013.01); G10L 2015/0638 (2013.01); G10L 2015/227 (2013.01);
Abstract

Techniques for noise data augmentation for training chatbot systems in natural language processing. In one particular aspect, a method is provided that includes receiving a training set of utterances for training an intent classifier to identify one or more intents for one or more utterances; augmenting the training set of utterances with noise text to generate an augmented training set of utterances; and training the intent classifier using the augmented training set of utterances. The augmenting includes: obtaining the noise text from a list of words, a text corpus, a publication, a dictionary, or any combination thereof irrelevant of original text within the utterances of the training set of utterances, and incorporating the noise text within the utterances relative to the original text in the utterances of the training set of utterances at a predefined augmentation ratio to generate augmented utterances.


Find Patent Forward Citations

Loading…