The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 30, 2021

Filed:

Dec. 21, 2018
Applicant:

Automation Anywhere Inc., San Jose, CA (US);

Inventors:

Thomas Corcoran, San Jose, CA (US);

Vibhas Gejji, Fremont, CA (US);

Stephen Van Lare, San Jose, CA (US);

Assignee:

Automation Anywhere, Inc., San Jose, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/03 (2006.01); G06K 9/62 (2006.01); G06K 9/00 (2006.01);
U.S. Cl.
CPC ...
G06K 9/03 (2013.01); G06K 9/00442 (2013.01); G06K 9/6203 (2013.01); G06K 9/6262 (2013.01); G06K 2209/01 (2013.01);
Abstract

A computer implemented method and system for correcting error produced by Optical Character Recognition (OCR) of text contained in an image encoded document. An error model representing frequency and type of errors produced by Optical Character Recognition Engine is generated. An OCR character string generated by OCR is retrieved. A user-defined pattern of a plurality of character strings is retrieved, where each character string represents a possible correct representation of characters in the OCR character string. The OCR character string is compared to each of the above generated character strings and a 'likelihood score' is calculated based on the information from the error model. The character string with the highest 'likelihood score' is presumed to be the corrected version of the OCR character string.


Find Patent Forward Citations

Loading…