The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 20, 2021

Filed:

Jan. 24, 2017
Applicant:

Oath (Americas) Inc., Dulles, VA (US);

Inventor:

Jason Jinshui Qin, Great Falls, VA (US);

Assignee:

Verizon Media Inc., New York, NY (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/22 (2019.01);
U.S. Cl.
CPC ...
G06F 16/2255 (2019.01);
Abstract

Systems and methods are disclosed for optimizing full-spectrum cardinality approximations on big data by exploiting an underlying relationship between LogLog counting estimation techniques and order statistics-based estimation techniques. To accomplish the foregoing, a multiset of objects that each corresponds to one of a plurality of objects associated with a resource are obtained by a computing device. A compound data object is populated by the computing device with data that is derived based on generated hash values that correspond to each object in the obtained multiset. The populated compound data object is processed utilizing a processor with a full-spectrum unified estimation operation that can accurately determine a cardinality estimate for the obtained multiset, utilizing considerably less resources when compared to traditional and state of the art techniques. The determination is made by the computing device without the need to employ linear counting for low cardinalities, bias correction operations, or angular correction terms, all while offering decreased memory usage, simpler implementation, improved performance, and comparable or improved accuracy. An estimated number of unique objects in the obtained multiset can be determined by the computing device, and subsequently provided for display, communication to another computing device, or further manipulation.


Find Patent Forward Citations

Loading…