The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 21, 2025

Filed:

Aug. 17, 2021
Applicant:

Emc Ip Holding Company Llc, Hopkinton, MA (US);

Inventors:

Tony Tzeming Wong, Milpitas, CA (US);

Smriti Thakkar, San Jose, CA (US);

Assignee:

EMC IP Holding Company LLC, Hopkinton, MA (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 16/11 (2018.12); G06F 16/13 (2018.12); G06F 16/906 (2018.12);
U.S. Cl.
CPC ...
G06F 16/122 (2018.12); G06F 16/137 (2018.12);
Abstract

A system partitions files, including segments identified by fingerprints. into clusters. The system counts common fingerprints by counting fingerprints which correspond to both a file cluster and another file cluster. The system counts unique fingerprints by counting fingerprints which correspond to the file cluster and/or the other file cluster. The system uses the common and unique fingerprint counts to approximate the distance between the file cluster and the other file cluster. The system identifies the smallest of distances which are approximated between all file clusters. The system merges the file cluster and the other file cluster into a merged file cluster if the approximated distance is the smallest of distances. The system determines an index corresponding to the smallest and next smallest of distances. The system determines indexes which correspond to merges of all file clusters. The system uses the maximum of indexes to identify the optimal file clustering.


Find Patent Forward Citations

Loading…