The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 28, 2017

Filed:

Aug. 02, 2016
Applicant:

Google Inc., Mountain View, CA (US);

Inventors:

Robert C. Pike, Menlo Park, CA (US);

Sean Quinlan, Menlo Park, CA (US);

Sean M. Dorward, Martinsville, NJ (US);

Jeffrey Dean, Palo Alto, CA (US);

Sanjay Ghemawat, Mountain View, CA (US);

Assignee:

GOOGLE INC., Mountain View, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 17/30 (2006.01); G06F 11/14 (2006.01);
U.S. Cl.
CPC ...
G06F 17/30501 (2013.01); G06F 11/1482 (2013.01); G06F 17/30545 (2013.01); G06F 17/30598 (2013.01); Y10S 707/99933 (2013.01); Y10S 707/99937 (2013.01);
Abstract

A method processes data records. The method partitions the data records into groups and assigns each group to a respective process of a first plurality of processes, which execute in parallel. For each group, the assigned process extracts information from the data records, applies a script with information processing commands applied sequentially to produce intermediate values, stores the intermediate values in a respective intermediate data structure, and updates the status of the group to indicate completion. When the predefined threshold percentage of the data records are completed, the process assigns each group to a respective second process as a backup. When each of the groups has been completed by at least one process (either the original or the backup), the method executes a second plurality of processes to aggregate intermediate values from the intermediate data structures to produce output data. The aggregation includes intermediate values only once for each group.


Find Patent Forward Citations

Loading…