The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 10, 2015

Filed:

Aug. 06, 2010
Applicants:

David Petrou, Brooklyn, NY (US);

Ashok C. Popat, Menlo Park, CA (US);

Matthew R. Casey, San Francisco, CA (US);

Inventors:

David Petrou, Brooklyn, NY (US);

Ashok C. Popat, Menlo Park, CA (US);

Matthew R. Casey, San Francisco, CA (US);

Assignee:

Google Inc., Mountain View, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06K 9/18 (2006.01); G06F 17/30 (2006.01); G06K 9/03 (2006.01); G06K 9/00 (2006.01); G06K 9/72 (2006.01);
U.S. Cl.
CPC ...
G06F 17/30244 (2013.01); G06F 17/30864 (2013.01); G06K 9/00483 (2013.01); G06K 9/036 (2013.01); G06K 9/72 (2013.01);
Abstract

A server system receives a visual query from a client system. The visual query is an image containing text such as a picture of a document. At the receiving server or another server, optical character recognition (OCR) is performed on the visual query to produce text recognition data representing textual characters. Each character in a contiguous region of the visual query is individually scored according to its quality. The quality score of a respective character is influenced by the quality scores of neighboring or nearby characters. Using the scores, one or more high quality strings of characters are identified. Each high quality string has a plurality of high quality characters. A canonical document containing the one or more high quality textual strings is retrieved. At least a portion of the canonical document is sent to the client system.


Find Patent Forward Citations

Loading…