The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 01, 2022

Filed:

Jul. 23, 2018
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Seyed Vahab Mirrokni Banadaki, Hoboken, NJ (US);

Hossein Esfandiari, Adelphi, MD (US);

MohammadHossein Bateni, South Orange, NJ (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 7/00 (2006.01); G06F 16/34 (2019.01); G06F 16/23 (2019.01); G06F 16/24 (2019.01); G06N 7/00 (2006.01); H04L 9/06 (2006.01); G06F 17/10 (2006.01); G06F 9/448 (2018.01); G06N 20/00 (2019.01); G06F 8/30 (2018.01); H04L 9/32 (2006.01); G06N 5/02 (2006.01); G06N 5/00 (2006.01);
U.S. Cl.
CPC ...
G06N 7/00 (2013.01); G06F 8/31 (2013.01); G06F 9/448 (2018.02); G06F 17/10 (2013.01); G06N 5/003 (2013.01); G06N 5/022 (2013.01); G06N 20/00 (2019.01); H04L 9/0643 (2013.01); H04L 9/3239 (2013.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing large datasets using a computationally-efficient representation are disclosed. A request to apply a coverage algorithm to a large input dataset is received. The large dataset includes sets of elements. A computationally-efficient representation of the large dataset is generated by generating a reduced set of elements that contains fewer elements based on a defined probability. For each element in the reduced set, a determination is made regarding whether the element appears in more than a threshold number of sets. When the element appears in more than the threshold number, the element is removed from sets until the element appears in only the threshold number. The coverage algorithm is then applied to the computationally-efficient representation to identify a subset of the sets. The system provides data identifying the subset of the sets in response to the received request.


Find Patent Forward Citations

Loading…