The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 04, 2017

Filed:

Dec. 27, 2013
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Andrey Balmin, San Jose, CA (US);

Vuk Ercegovac, Campbell, CA (US);

Peter J. Haas, San Jose, CA (US);

Liping Peng, Amherst, MA (US);

John Sismanis, San Jose, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/30 (2006.01);
U.S. Cl.
CPC ...
G06F 17/30598 (2013.01); G06F 17/3053 (2013.01); G06F 17/30324 (2013.01); G06F 17/30486 (2013.01); G06F 17/30536 (2013.01); G06F 17/30867 (2013.01);
Abstract

Stratified sampling of a plurality of records is performed. A plurality of records are partitioned into a plurality of splits, wherein each split includes at least a portion of the plurality of records. The split of the plurality of splits is provided to a mapper. The mapper assigns at least a portion the records of the at least one split to a group based on a strata of the assigned records, and filters the records of the group based on a comparison of the weights of the records to a local threshold of the mapper. The mapper updates the local threshold of the mapper by communicating with a coordinator. The mapper shuffles the group to a reducer, where the reducer filters the records of the group based on the weights of the records. The reducer provides a stratified sampling of the plurality of records based on the group.


Find Patent Forward Citations

Loading…