The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 17, 2020

Filed:

Jul. 26, 2017
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Giovanni Cherubini, Rueschlikon, CH;

Mark A. Lantz, Thalwil, CH;

Taras Lehinevych, Kyiv, UA;

Vinodh Venkatesan, Waedenswil, CH;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/30 (2019.01); G06F 3/06 (2006.01);
U.S. Cl.
CPC ...
G06F 3/064 (2013.01); G06F 3/0604 (2013.01); G06F 3/067 (2013.01); G06F 3/0611 (2013.01); G06F 3/0631 (2013.01); G06F 3/0649 (2013.01); G06F 3/0685 (2013.01);
Abstract

The invention is notably directed to computer-implemented methods and systems for managing datasets in a storage system. In such systems, it is assumed that a (typically small) subset of datasets are labeled with respect to their relevance, so as to be associated with respective relevance values. Essentially, the present methods determine, for each unlabeled dataset of the datasets, a respective probability distribution over a set of relevance values. From this probability distribution, a corresponding relevance value can be obtained. This probability distribution is computed based on distances (or similarities), in terms of metadata values, between said each unlabeled dataset and the labeled datasets. Based on their associated relevance values, datasets can then be efficiently managed in a storage system.


Find Patent Forward Citations

Loading…