The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 18, 2022

Filed:

Jan. 18, 2019
Applicant:

Emc Ip Holding Company Llc, Hopkinton, MA (US);

Inventors:

Tony Wong, Milpitas, CA (US);

Hemanth Satyanarayana, Santa Clara, CA (US);

Abhinav Duggal, Jersey City, NJ (US);

Assignee:

EMC IP Holding Company LLC, Hopkinton, MA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 16/17 (2019.01); G06F 11/14 (2006.01); G06F 16/174 (2019.01);
U.S. Cl.
CPC ...
G06F 11/1453 (2013.01); G06F 16/1748 (2019.01); G06F 2201/84 (2013.01);
Abstract

Embodiments for a mostly unique file selection process for a deduplication backup system are described. The process assigns tags to files. A tag serves as a hint about the similarity of files in a deduplication file system. It is expected that files from the same client machine will be assigned the same tag. The tag is the smallest unit of migration and serves as a hint of the similarity of the files. The MUFS process measures the uniqueness using a u-index that is a function of the total unique size of a tag relative to the total size of the tag. A load balancer then selects the most unique tags for migration to free the maximum space. It uses the u-index to measure the uniqueness percentage of a tag, so that tags with the highest u-index are selected for migration to free up maximum space on the source node.


Find Patent Forward Citations

Loading…