The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 09, 2023

Filed:

Jan. 28, 2020
Applicant:

Microsoft Technology Licensing, Llc;

Inventors:

Nishant Velagapudi, Seattle, WA (US);

Zhengwen Zhu, Redmond, WA (US);

Venkatasatya Premnath Ayyalasomayajula, Issaquah, WA (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 40/20 (2020.01); G06N 20/00 (2019.01); G06N 3/08 (2006.01); G06N 3/04 (2023.01);
U.S. Cl.
CPC ...
G06F 40/20 (2020.01); G06N 3/0445 (2013.01);
Abstract

Techniques performed by a data processing system for analyzing training data for a machine learning model and identifying outliers in the training data herein include obtaining training data for the model from a memory of the data processing system; analyzing the training data using a Siamese Neural Network to determine within-label similarities and cross-label similarities associated with a plurality of data elements within the training data, the within-label representing similarities between a respective data element and a first set of data elements similarly labeled in the training data, the cross-label similarities representing similarities between the respective data element and a second set of data elements dissimilarly labeled in the training data; identifying outlier data elements in the plurality of data elements based on the within-label and cross-label similarities; and processing the training data comprising the outlier data elements. Processing may include deleting the outlier data elements or generating a report.


Find Patent Forward Citations

Loading…