The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 15, 2020

Filed:

Nov. 09, 2018
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

HongLei Guo, Beijing, CN;

Li Zhang, Beijing, CN;

Changhua Sun, Beijing, CN;

Birgit M. Pfitzmann, Zürich, CH;

Shiwan Zhao, Beijing, CN;

Zhong Su, Beijing, CN;

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01); G06F 40/30 (2020.01); G06F 40/279 (2020.01); G06F 40/183 (2020.01); G06F 40/174 (2020.01); G06F 40/177 (2020.01); G10L 15/22 (2006.01); G06F 3/0484 (2013.01); G06F 16/21 (2019.01);
U.S. Cl.
CPC ...
G06F 40/30 (2020.01); G06F 40/174 (2020.01); G06F 40/183 (2020.01); G06F 40/279 (2020.01); G06F 3/04842 (2013.01); G06F 16/21 (2019.01); G06F 40/177 (2020.01); G10L 15/22 (2013.01);
Abstract

A method is presented for error correction of tabular data in document conversion. The method includes identifying errors from tabular data transformation by employing an error/invalidation checking module and correcting the identified errors from the tabular data transformation by employing an error correction module. The error correction module includes identifying a main structure pattern from common row structures, concatenating separate keywords according to natural language processing models employing training data obtained from a plurality of candidate tabular data, adjusting cells in the tabular data based on a domain-specific knowledge database including the training data in combination with linguistic and semantic knowledge, merging partial tabular data pieces, and generating an adjusted table as output on a display of a computing device.


Find Patent Forward Citations

Loading…