The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 17, 2022

Filed:

Jan. 10, 2020
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Paul R. Bastide, Ashland, MA (US);

Aris Gkoulalas-Divanis, Waltham, MA (US);

Rohit Ranchal, Austin, TX (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/20 (2019.01); G06F 3/06 (2006.01); G06F 16/242 (2019.01); G06F 16/21 (2019.01); G06F 16/2457 (2019.01);
U.S. Cl.
CPC ...
G06F 3/0644 (2013.01); G06F 3/064 (2013.01); G06F 3/0622 (2013.01); G06F 3/0673 (2013.01); G06F 16/219 (2019.01); G06F 16/244 (2019.01); G06F 16/24578 (2019.01);
Abstract

One embodiment of the invention provides a method for data lineage and data provenance enhancement. The method comprises arranging a data set into a logical ordering, and partitioning the data set into at least one set of partitions based on the logical ordering. The method further comprises, for each partition of the at least one set of partitions, determining a corresponding score for the partition, and determining a data similarity between the partition and each other partition of each other data set based on the corresponding score for the partition and another score corresponding to the other partition. The method further comprises determining data lineage of the data set based on each data similarity determined.


Find Patent Forward Citations

Loading…