The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 18, 2025

Filed:

Sep. 24, 2021
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Bhavna Agrawal, Armonk, NY (US);

Elham Khabiri, Briarcliff Manor, NY (US);

Yingjie Li, Chappaqua, NY (US);

Pranav Girish Sankhe, Buffalo, NY (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 40/18 (2020.01); G06F 40/284 (2020.01); G06N 3/047 (2023.01); G06N 3/08 (2023.01);
U.S. Cl.
CPC ...
G06F 40/18 (2020.01); G06F 40/284 (2020.01); G06N 3/047 (2023.01); G06N 3/08 (2013.01);
Abstract

Tabular data is accessed that contains multiple entries of alphanumeric data. Multiple tokens are generated of the multiple entries of alphanumeric data using a tokenization process. The tokenization process maintains jargon-specific features of the alphanumeric data. Multiple embeddings of the multiple entries of alphanumeric data are generated using the tokens. The embeddings capture similarity of the multiple entries considering all of global features, column features, and row features in the tokens of the tabular data. A neural network is used to predict probabilities for pre-defined classes for the tabular data using the generated embeddings.


Find Patent Forward Citations

Loading…