The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 10, 2018

Filed:

May. 31, 2015
Applicant:

Vmware, Inc., Palo Alto, CA (US);

Inventors:

Wenguang Wang, Santa Clara, CA (US);

Tian Luo, Fremont, CA (US);

Assignee:

VMware, Inc., Palo Alto, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/30 (2006.01); G06N 7/00 (2006.01); G06F 3/06 (2006.01); G06F 11/14 (2006.01);
U.S. Cl.
CPC ...
G06F 17/30156 (2013.01); G06F 3/0608 (2013.01); G06F 3/0641 (2013.01); G06F 11/1453 (2013.01); G06F 17/3033 (2013.01); G06F 17/30097 (2013.01); G06F 17/30159 (2013.01); G06N 7/005 (2013.01);
Abstract

Examples perform predictive probabilistic deduplication of storage, such as virtualized or physical disks. Incoming input/output (I/O) commands include data, which is written to storage and tracked in a key-value store. The key-value store includes a hash of the data as the key, and a reference counter and the address of the data as the value. When a certain percentage of sampled incoming data is found to be duplicate, it is predicted that the I/O commands have become not unique (e.g., duplicate). Based on the prediction, subsequent incoming data is not written to storage, and instead the reference counter associated with the hash of the data is incremented. In this manner, predictions on the uniqueness of future data is made based on previous data, and extraneous writes and deletions from the chunk store are avoided.


Find Patent Forward Citations

Loading…