The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 03, 2021

Filed:

Jun. 19, 2019
Applicant:

Infosys Limited, Bangalore, IN;

Inventors:
Assignee:

INFOSYS LIMITED, Bangalore, IN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/62 (2006.01); G06N 20/00 (2019.01);
U.S. Cl.
CPC ...
G06K 9/6262 (2013.01); G06N 20/00 (2019.01); G06K 2209/01 (2013.01);
Abstract

A computer implemented a method and system for enrichment of OCR extracted data is disclosed comprising of accepting a set of extraction criteria and a set of configuration parameters by a data extraction engine. The data extraction engine captures data satisfying an extraction criteria using the configuration parameters and adapts the captured data using a set of domain specific rules and a set of OCR error patterns. A learning engine generates learning data models using the adapted data and the configuration parameters and the system dynamically updates the extraction criteria using the generated learning data models. The extraction criteria comprise one or more extraction templates wherein an extraction template includes one of a regular expression, geometric markers, anchor text markers and a combination thereof.


Find Patent Forward Citations

Loading…