The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 26, 2023

Filed:

May. 13, 2020
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Shaikh Shahriar Quader, Scarborough, CA;

Mona Nashaat Ali Elmowafy, Edmonton, CA;

Darrell Christopher Reimer, Tarrytown, NY (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06N 5/04 (2023.01); G06N 20/20 (2019.01); G06N 7/01 (2023.01);
U.S. Cl.
CPC ...
G06N 5/04 (2013.01); G06N 20/20 (2019.01); G06N 7/01 (2023.01);
Abstract

Noisy labeled and unlabeled datapoint detection and rectification in a training dataset for machine-learning is facilitated by a processor(s) obtaining a training dataset for use in training a machine-learning model. The processor(s) applies ensemble machine-learning and a generative model to the training dataset to detect noisy labeled datapoints in the training dataset, and create a clean dataset with preliminary labels added for any unlabeled datapoints in the training dataset. Data-driven active learning and the clean dataset are used by the processor(s) to facilitate generating an active-learned dataset with true labels added for one or more selected datapoints of a datapoint pool including the detected noisy labeled datapoints and the unlabeled datapoints of the training dataset. The machine-learning model is trained by the processor(s) using, at least in part, the clean dataset and the active-learned dataset.


Find Patent Forward Citations

Loading…