The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 12, 2018

Filed:

May. 31, 2016
Applicant:

Abbyy Development Llc, Moscow, RU;

Inventor:

Aleksey Kalyuzhny, Moscow Oblast, RU;

Assignee:

ABBYY DEVELOPMENT LLC, Moscow, RU;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/00 (2006.01); G06K 9/32 (2006.01); G06K 9/62 (2006.01); G06T 3/40 (2006.01); G06K 9/46 (2006.01); G06T 7/00 (2017.01);
U.S. Cl.
CPC ...
G06K 9/325 (2013.01); G06K 9/00449 (2013.01); G06K 9/00483 (2013.01); G06K 9/4604 (2013.01); G06K 9/6218 (2013.01); G06K 9/6267 (2013.01); G06T 3/40 (2013.01); G06T 7/0085 (2013.01);
Abstract

Systems and methods are described for receiving a current image that partially overlaps with a previous image of a series of images of an original document; performing optical character recognition (OCR) of the current image, producing an OCR text and a corresponding text layout; identifying textual artifacts in the current and previous images, each represented by a sequence of symbols having a frequency of occurrence within the OCR text below a threshold frequency; identifying corresponding base points associated with textual artifacts; identifying parameters of a coordinate transformation converting coordinates of the previous image into coordinates of the current image; associating part of the OCR text with a cluster of symbol sequences, wherein the symbol sequences are produced by processing previously received images; identifying an order of clusters of symbol sequences reflecting a layout of the original document; and producing a resulting OCR text representing a portion of the original document.


Find Patent Forward Citations

Loading…