The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Sep. 20, 2022
Filed:
Mar. 30, 2020
Coupa Software Incorporated, San Mateo, CA (US);
COUPA SOFTWARE INCORPORATED, San Mateo, CA (US);
Abstract
Described herein is a computer-implemented method for automatic extraction of canonical data from an electronic document. The method comprises classifying a first text rectangle in an electronic document as a label and a second text rectangle as a value using a first machine learning algorithm. A first probability score of a likelihood of the first text rectangle corresponding to a first canonical category is determined using a second machine learning algorithm. A second probability score of a likelihood of the second text rectangle corresponding to a first canonical category is determined using a third machine learning algorithm. A relative spatial position of the second text rectangle relative to the first text rectangle is calculated. Based on the relative spatial position, the first probability score, and the second probability score, the first text rectangle, and the second text rectangle are classified into the first canonical category.