The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 12, 2021

Filed:

Jan. 22, 2019
Applicant:

Groupon, Inc., Chicago, IL (US);

Inventors:

Stephen Clark Mitchell, Chicago, IL (US);

Pavel Melnichuk, Chicago, IL (US);

Assignee:

GROUPON, INC., Chicago, IL (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/34 (2006.01); G06K 9/00 (2006.01); G06K 9/46 (2006.01); G06Q 20/04 (2012.01); G06K 9/62 (2006.01);
U.S. Cl.
CPC ...
G06K 9/00442 (2013.01); G06K 9/4671 (2013.01); G06Q 20/047 (2020.05); G06K 9/6227 (2013.01); G06K 2209/01 (2013.01);
Abstract

Techniques for providing improved optical character recognition (OCR) for receipts are discussed herein. Some embodiments may provide for a system including one or more servers configured to perform receipt image cleanup, logo identification, and text extraction. The image cleanup may include transforming image data of the receipt by using image parameters values that optimize the logo identification, and performing logo identification using a comparison of the image data with training logos associated with merchants. When a merchant is identified, a second image clean up may be performed by using image parameter values optimized for text extraction. A receipt structure may be used to categorize the extracted text. Improved OCR accuracy is also achieved by applying on format rules of the receipt structure to the extracted text.


Find Patent Forward Citations

Loading…