The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 18, 2023

Filed:

Feb. 21, 2020
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Sudheesh S. Kairali, Bangalore, IN;

Ankur Tagra, Bangalore, IN;

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 16/00 (2019.01); G06F 16/28 (2019.01); G06F 16/2457 (2019.01); G06F 16/26 (2019.01);
U.S. Cl.
CPC ...
G06F 16/283 (2019.01); G06F 16/24578 (2019.01); G06F 16/26 (2019.01); G06F 2216/03 (2013.01);
Abstract

Embodiments relate to a system, computer program product, and method to merge two or more heterogeneous datasets. Seed attributes of each dataset that is the subject of the merge are identified. The seed attributes are derived from candidate attributes of the respective datasets. A correlation is assessed to create a set of mergeable attributes and a set of non-mergeable attributes. A cohesiveness characteristic is leveraged to iteratively identify one or more attributes from the set of non-mergeable attributes, and to amend the set of mergeable attributes with the one or more attributes identified in the set of non-mergeable attributes. A merged dataset based on the amended set of mergeable attributes and representing non-trivial similarities between the first and second dataset is formed as output.


Find Patent Forward Citations

Loading…