The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
May. 16, 2023
Filed:
Sep. 09, 2020
Oracle International Corporation, Redwood Shores, CA (US);
Vishal Vishnoi, Redwood City, CA (US);
Mark Edward Johnson, Sydney, AU;
Elias Luqman Jalaluddin, Seattle, WA (US);
Balakota Srinivas Vinnakota, Sunnyvale, CA (US);
Thanh Long Duong, Seabrook, AU;
Gautam Singaraju, Dublin, CA (US);
ORACLE INTERNATIONAL CORPORATION, Redwood Shores, CA (US);
Abstract
Techniques for stop word data augmentation for training chatbot systems in natural language processing. In one particular aspect, a computer-implemented method includes receiving a training set of utterances for training an intent classifier to identify one or more intents for one or more utterances; augmenting the training set of utterances with stop words to generate an augmented training set of out-of-domain utterances for an unresolved intent category corresponding to an unresolved intent; and training the intent classifier using the training set of utterances and the augmented training set of out-of-domain utterances. The augmenting includes: selecting one or more utterances from the training set of utterances, and for each selected utterance, preserving existing stop words within the utterance and replacing at least one non-stop word within the utterance with a stop word or stop word phrase selected from a list of stop words to generate an out-of-domain utterance.