The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 11, 2019

Filed:

Aug. 21, 2017
Applicant:

Accenture Global Solutions Limited, Dublin, IE;

Inventors:

Prakash Ghatage, Bangalore, IN;

Nirav Sampat, Mumbai, IN;

Kumar Viswanathan, Bangalore, IN;

Suvendu Kumar Mahapatra, Chennai, IN;

Srikanth Narayanan, Chennai, IN;

Rekha Mani, Chennai, IN;

Aravind Krishnan, Chennai, IN;

Rahul Kotnala, Dehradun, IN;

Kameshkumar Lakshminarayanan, Cuddalore, IN;

Ashish Jain, Chennai, IN;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/93 (2019.01); G06F 16/174 (2019.01); G06F 16/13 (2019.01); G06K 9/00 (2006.01); H04N 1/04 (2006.01); G06F 17/30 (2006.01); G06N 99/00 (2019.01); G06K 9/66 (2006.01); H04N 1/21 (2006.01); G06K 9/20 (2006.01); G06N 20/00 (2019.01);
U.S. Cl.
CPC ...
G06F 16/93 (2019.01); G06F 16/13 (2019.01); G06F 16/1748 (2019.01); G06F 17/30011 (2013.01); G06F 17/30091 (2013.01); G06F 17/30156 (2013.01); G06K 9/00449 (2013.01); G06K 9/00456 (2013.01); G06K 9/00483 (2013.01); G06K 9/2081 (2013.01); G06K 9/66 (2013.01); G06N 20/00 (2019.01); G06N 99/005 (2013.01); H04N 1/04 (2013.01); H04N 1/2166 (2013.01); H04N 2201/0081 (2013.01); H04N 2201/0087 (2013.01); H04N 2201/218 (2013.01);
Abstract

Data extraction and automatic validation from digitized documents in non-editable formats is disclosed. Paper documents are digitized or converted into formats suitable for storage on computers or other digital devices. The digitized documents are classified into one of a plurality of document types and based on the document type, document processing rules are selected for analyzing the digitized documents to enable data extraction and automatic validation. The positions and values of the data fields in the digitized documents are obtained using machine learning techniques. The data field values are automatically validated and assigned confidence scores. Data fields with low confidence scores are flagged for manual review.


Find Patent Forward Citations

Loading…