The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jan. 26, 2016
Filed:
Mar. 15, 2013
Efficient calculation of similarity search values and digest block boundaries for data deduplication
Applicant:
International Business Machines Corporation, Armonk, NY (US);
Inventors:
Shay H. Akirav, Petach-Tikva, IL;
Lior Aronovich, Toronto, CA;
Shira Ben-Dor, Givat Shmuel, IL;
Michael Hirsch, Mazkeret Batya, IL;
Ofer Leneman, Kfar Saba, IL;
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION, Armonk, NY (US);
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/30 (2006.01);
U.S. Cl.
CPC ...
G06F 17/30156 (2013.01); G06F 17/30159 (2013.01);
Abstract
For efficient calculation of both similarity search values and boundaries of digest blocks in data deduplication, input data is partitioned into chunks, and for each chunk a set of rolling hash values is calculated. A single linear scan of the rolling hash values is used to produce both similarity search values and boundaries of the digest blocks of the chunk.