The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 08, 2012

Filed:

May. 19, 2006
Applicants:

Jeffrey A. Dean, Palo Alto, CA (US);

Sanjay Ghemawat, Mountain View, CA (US);

Gautham Thambidorai, Sunnyvale, CA (US);

Inventors:

Jeffrey A. Dean, Palo Alto, CA (US);

Sanjay Ghemawat, Mountain View, CA (US);

Gautham Thambidorai, Sunnyvale, CA (US);

Assignee:

Google Inc., Mountain View, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/06 (2006.01);
U.S. Cl.
CPC ...
Abstract

A set of documents may be stored and indexed as a compressed sequence of tokens. A set of documents are grouped into clusters. Sequences of tokens representing the clusters of documents are encoded to elide some repeating instances of tokens. A compressed sequence of tokens is generated from the compressed cluster sequences of tokens. Queries on the compressed sequence are performed by identifying cluster sequences within the compressed sequence that are likely to have documents that satisfy the query and then identifying, within these identified clusters, the documents that actually satisfies the query.


Find Patent Forward Citations

Loading…