The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 26, 2018

Filed:

Nov. 30, 2017
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Shay H. Akirav, Petach-Tikva, IL;

Lior Aronovich, Thornhill, CA;

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 12/08 (2016.01); G06F 3/06 (2006.01); G06F 17/30 (2006.01); G06F 12/0875 (2016.01); G06F 12/0846 (2016.01);
U.S. Cl.
CPC ...
G06F 17/30159 (2013.01); G06F 3/067 (2013.01); G06F 3/0619 (2013.01); G06F 3/0641 (2013.01); G06F 12/0848 (2013.01); G06F 12/0875 (2013.01);
Abstract

For utilizing a global digests cache in deduplication processing in a data deduplication system using a processor device in a computing environment, input data is partitioned into data chunks and digest values are calculated for each of the data chunks. The positions of similar repository data are found in a repository of data for each of the data chunks. The input digests of the input data are matched with the repository digests contained in the global digests cache for locating data matches. The positions of the similar repository data are used to locate and linearly load into the global digests cache, digests and digest block boundaries of the similar repository data in a sequence corresponding to a placement order of calculated values of the digests of the similar repository data.


Find Patent Forward Citations

Loading…