The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 05, 2021

Filed:

Jul. 27, 2016
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Md Faisal M. Chowdhury, Corona, NY (US);

Sarthak Dash, Jersey City, NJ (US);

Alfio M. Gliozzo, Brooklyn, NY (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06N 20/10 (2019.01); G06N 20/00 (2019.01); G06F 16/951 (2019.01); G06F 16/2452 (2019.01); G06N 5/04 (2006.01); G06F 40/30 (2020.01);
U.S. Cl.
CPC ...
G06N 20/10 (2019.01); G06F 16/24522 (2019.01); G06F 16/951 (2019.01); G06F 40/30 (2020.01); G06N 5/04 (2013.01); G06N 20/00 (2019.01);
Abstract

A method, system and computer-usable medium are disclosed for reducing labeled data imbalances when training an active learning system. The ratio of instances having positive labels or negative labels in a collection of labeled instances associated with an input category used for learning is determined. A first instance for annotation is selected from a collection of unlabeled instances if a first threshold for negative instances, and a first threshold confidence level of being a positive instance of the input category, have been met. A second instance for annotation is selected if a second threshold for positive instances, and a second threshold confidence level of being a negative instance of the input category, have been met. The first and second instances are respectively annotated with a positive and negative label and added to the collection of labeled instances, which are then used for training.


Find Patent Forward Citations

Loading…