The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 05, 2023

Filed:

May. 18, 2020
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Yannick Saillet, Stuttgart, DE;

Namit Kabra, Hyderabad, IN;

Mike W. Grasselt, Leinfelden-Echterdingen, DE;

Krishna Kishore Bonagiri, Ambajipet, IN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/28 (2019.01); G06F 16/2457 (2019.01); G06F 16/22 (2019.01); G06N 20/00 (2019.01); G06F 16/248 (2019.01); G06F 18/214 (2023.01); G06N 7/01 (2023.01);
U.S. Cl.
CPC ...
G06F 16/285 (2019.01); G06F 16/221 (2019.01); G06F 16/248 (2019.01); G06F 16/24573 (2019.01); G06F 18/214 (2023.01); G06N 7/01 (2023.01); G06N 20/00 (2019.01);
Abstract

A method provides for classifying data fields of a dataset. A classifier configured for determining confidence values for a plurality of data classes for the data fields may be applied. Using the confidence values, data class candidates may be identified. Data fields may be determined for which a plurality of data class candidates is identifiable. Using previous user-selected data class assignments, a probability may be determined for the data class candidates that the respective data class candidate is a data class to which the respective data field is to be assigned. The data fields may be classified using the probabilities to select for the data fields a data class from the data class candidates. The dataset may be provided with metadata identifying for the data fields the data classes to which the respective data fields are assigned.


Find Patent Forward Citations

Loading…