The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 29, 2015

Filed:

Sep. 24, 2013
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Ching-Yung Lin, Scarsdale, NY (US);

Wan-Yi Lin, White Plains, NY (US);

Yinglong Xia, Rye Brook, NY (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 15/18 (2006.01); G06N 99/00 (2010.01);
U.S. Cl.
CPC ...
G06N 99/005 (2013.01);
Abstract

Injecting generated data samples into a minority data class of an imbalanced training data set is provided. In response to receiving an input to balance the imbalanced training data set that includes a majority data class and the minority data class, a set of data samples is generated for the minority data class. A distance is calculated from each data sample in the set of generated data samples to a center of a kernel that includes a set of data samples of the majority data class. Each data sample in the set of generated data samples is stored within a corresponding distance score bucket based on the calculated distance of a data sample. Generated data samples are selected from a number of highest ranking distance score buckets. The generated data samples selected from the number of highest ranking distance score buckets are injected into the minority data class.


Find Patent Forward Citations

Loading…