The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 06, 2019

Filed:

Sep. 22, 2016
Applicant:

Evernote Corporation, Redwood City, CA (US);

Inventors:

Alexander Pashintsev, Cupertino, CA (US);

Boris Gorbatov, Sunnyvale, CA (US);

Eugene Livshitz, San Mateo, CA (US);

Vitaly Glazkov, Moscow, RU;

Assignee:

EVERNOTE CORPORATION, Redwood City, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/00 (2006.01); G06K 9/52 (2006.01); G06T 7/60 (2017.01); G06T 3/40 (2006.01);
U.S. Cl.
CPC ...
G06K 9/00456 (2013.01); G06K 9/00463 (2013.01); G06K 9/52 (2013.01); G06T 3/40 (2013.01); G06T 7/60 (2013.01);
Abstract

Determining if a document is a text page includes partitioning the document into a plurality of cells, scaling each of the cells to a standardized number of pixels to provide a corresponding snippet for each of the cells, using a classifier to examine the snippets to determine which of the cells are classified as text and which of the cells are not classified as text, determining a volume of text for the document based on a total amount of text in the document corresponding to a sum of an amount of text in each of the cells classified as text, and determining that the document is a text page in response to the total amount exceeding a pre-determined threshold. In response to the total amount being less than the pre-determined threshold, cells not classified as text may be examined further. The classifier may be provided by training a neural net.


Find Patent Forward Citations

Loading…