The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 21, 2012

Filed:

Apr. 01, 2010
Applicants:

Jon Mark Holdman, Wheat Ridge, CO (US);

Robert Michael Raymond, Boulder, CO (US);

Atiq Ahamad, Superior, CO (US);

John Richard Kostraba, Jr., Broomfield, CO (US);

Carl T. Madison, Jr., Windsor, CO (US);

Inventors:

Jon Mark Holdman, Wheat Ridge, CO (US);

Robert Michael Raymond, Boulder, CO (US);

Atiq Ahamad, Superior, CO (US);

John Richard Kostraba, Jr., Broomfield, CO (US);

Carl T. Madison, Jr., Windsor, CO (US);

Assignee:

Oracle International Corporation, Redwood City, CA (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 13/00 (2006.01);
U.S. Cl.
CPC ...
Abstract

A data deduplication method using a small hash digest dictionary in fast-access memory. The method includes receiving customer data, dividing the data into smaller chunks, and assigning hash values to each chunk. For each chunk, the method includes performing lookup for a duplicate chunk by accessing a small dictionary in memory with the chunk's hash value. When no entry, the small dictionary is updated to include the hash value to fill the dictionary with earliest received data. When an entry is found, the entry's hash value is compared with lookup value and if matched, reference data is returned and an entry counter is incremented. If not matched, additional accesses are attempted such as with additional indexes calculated using the hash value. Collisions may trigger an entry replacement such that some initially entered entries are replaced when determined to not be most repeating values such as based on their counter value.


Find Patent Forward Citations

Loading…