The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Nov. 25, 2014
Filed:
Oct. 28, 2013
Girish Welling, Nashua, NH (US);
Nirupam Sarkar, Westford, MA (US);
Tushar Mahata, Jersey City, NJ (US);
Vartika Singh, Lawrence, MA (US);
Depankar Neogi, Wilmington, MA (US);
Steven K. Ladd, North Andover, MA (US);
Girish Welling, Nashua, NH (US);
Nirupam Sarkar, Westford, MA (US);
Tushar Mahata, Jersey City, NJ (US);
Vartika Singh, Lawrence, MA (US);
Depankar Neogi, Wilmington, MA (US);
Steven K. Ladd, North Andover, MA (US);
Gruntworx, LLC, Franklin, NC (US);
Abstract
In a document analysis system that receives and processes jobs from a plurality of users, in which each job may contain multiple electronic documents, to extract data from the electronic documents, a method of automatically pre-processing each received electronic document using a plurality of image transformation algorithms to improve subsequent data extraction from said document is provided. The method includes: electronically partitioning each received electronic document page into pieces; automatically processing each piece of the received electronic document page using each of a plurality of image pre-processing algorithms to produce a plurality of image variations of each piece; and analyzing the outputs of subsequent processing and data extraction, on each of the image variations of the pieces to determine which output is best, from the plurality of outputs for each piece.