The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jul. 05, 2022
Filed:
Dec. 10, 2019
International Business Machines Corporation, Armonk, NY (US);
Michael Desmond, White Plains, NY (US);
Matthew Arnold, Ridgefield Park, NJ (US);
Jeffrey Scott Boston, Wappingers Falls, NY (US);
INTERNATIONAL BUSINESS MACHINES CORPORATION, Armonk, NY (US);
Abstract
Methods, systems and computer program products for improving ground truth quality for modeling are provided. Aspects include receiving a plurality of data inputs, wherein each of the plurality of data inputs has an associated label. Aspects also include training a model based on the plurality of data inputs. Aspects also include generating a plurality of vector representations corresponding to the plurality of data inputs based on the model. Aspects also include clustering the plurality of vector representations into one or more clusters. Aspects also include identifying at least one anomalous data input based on the one or more clusters. The at least one anomalous data input can be a data input of the plurality of data inputs that is mislabeled, contributes to an ambiguous class structure or is an outlier. Aspects also include outputting a notification that provides an indication of the at least one anomalous data input.