The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 08, 2020

Filed:

Apr. 25, 2019
Applicant:

Zorroa Corporation, Berkeley, CA (US);

Inventors:

Juan Jose Buhler, Woodside, CA (US);

David DeBry, Salt Lake City, UT (US);

Daniel Wexler, Soda Springs, CA (US);

Assignee:

Zorroa Corporation, San Francisco, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/72 (2006.01); G06K 9/00 (2006.01); G06K 9/62 (2006.01); G06K 9/68 (2006.01); H04N 1/00 (2006.01);
U.S. Cl.
CPC ...
G06K 9/726 (2013.01); G06K 9/00469 (2013.01); G06K 9/6256 (2013.01); G06K 9/685 (2013.01); H04N 1/00018 (2013.01); H04N 1/00034 (2013.01); H04N 1/00082 (2013.01); G06K 2009/6864 (2013.01); G06K 2209/01 (2013.01);
Abstract

A method of analyzing and organizing printed documents is performed at a computing system having one or more processors and memory. The method includes receiving one or more printed documents, each including one or more pages. The method includes processing each page of each printed document. The method includes scanning the respective page to obtain an image file. The method also includes determining a document class for the respective page by inputting the image file to one or more trained classifier models, and generating a semantic analyzer pipeline including at least an optical character recognition (OCR)-based semantic analyzer. The method also includes applying the OCR-based semantic analyzer to the preprocessed output page to generate a preprocessed output page and to extract semantic information corresponding to the respective page. The method includes determining a digital organization for the respective printed document based on the extracted semantic information and the document class.


Find Patent Forward Citations

Loading…