The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jun. 09, 2020
Filed:
Nov. 20, 2019
Clinc, Inc., Ann Arbor, MI (US);
Stefan Larson, Ann Arbor, MI (US);
Anish Mahendran, Ann Arbor, MI (US);
Andrew Lee, Ann Arbor, MI (US);
Jonathan K. Kummerfeld, Ann Arbor, MI (US);
Parker Hill, Ann Arbor, MI (US);
Michael A. Laurenzano, Ann Arbor, MI (US);
Johann Hauswald, Ann Arbor, MI (US);
Lingjia Tang, Ann Arbor, MI (US);
Jason Mars, Ann Arbor, MI (US);
Clinc, Inc., Ann Arbor, MI (US);
Abstract
A system and method for improving a machine learning-based dialogue system includes: sourcing a corpus of raw machine learning training data from sources of training data based on a plurality of seed training samples, wherein the corpus of raw machine learning training data comprises a plurality of distinct instances of training data; generating a vector representation for each distinct instance of training data; identifying statistical characteristics of the corpus of raw machine learning training data based on a mapping of the vector representation for each distinct instance of training data; identifying anomalous instances of the plurality of distinct instances of training data of the corpus of raw machine learning training data based on the identified statistical characteristics of the corpus; and curating the corpus of raw machine learning training data based on each of the instances of training data identified as anomalous instances.