The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 26, 2024

Filed:

Oct. 28, 2021
Applicant:

Oracle International Corporation, Redwood Shores, CA (US);

Inventors:

Elias Luqman Jalaluddin, Seattle, WA (US);

Vishal Vishnoi, Redwood Shores, CA (US);

Thanh Long Duong, Seabrook, AU;

Mark Edward Johnson, Castle Cove, AU;

Poorya Zaremoodi, Melbourne, AU;

Gautam Singaraju, Dublin, CA (US);

Ying Xu, Albion, AU;

Vladislav Blinov, Melbourne, AU;

Assignee:

Oracle International Corporation, Redwood Shores, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 40/279 (2020.01); G06F 40/35 (2020.01); G06N 20/00 (2019.01); H04L 51/02 (2022.01); G06F 40/205 (2020.01); G06F 40/284 (2020.01); G06F 40/289 (2020.01);
U.S. Cl.
CPC ...
G06F 40/279 (2020.01); G06F 40/35 (2020.01); G06N 20/00 (2019.01); H04L 51/02 (2013.01); G06F 40/205 (2020.01); G06F 40/284 (2020.01); G06F 40/289 (2020.01);
Abstract

Techniques for keyword data augmentation for training chatbot systems in natural language processing. In one particular aspect, a method is provided that includes receiving a training set of utterances for training a machine-learning model to identify one or more intents for one or more utterances, augmenting the training set of utterances with out-of-domain (OOD) examples. The augmenting includes: identifying keywords within utterances of the training set of utterances, generating a set of OOD examples with the identified keywords, filtering out OOD examples from the set of OOD examples that have a context substantially similar to context of the utterances of the training set of utterances, and incorporating the set of OOD examples without the filtered OOD examples into the training set of utterances to generate an augmented training set of utterances. Thereafter, the machine-learning model is trained using the augmented training set of utterances.


Find Patent Forward Citations

Loading…