The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 9858258 B1

Date of Patent:

Jan. 02, 2018

Filed:

Sep. 30, 2016

Automatic locale determination for electronic documents

Applicant:

Coupa Software Incorporated, San Mateo, CA (US);

Inventor:

Matthew Pasquini, San Mateo, CA (US);

Assignee:

Coupa Software Incorporated, San Mateo, CA (US);

Attorney:

Hickman Palermo Becker Bingham LLP

Primary Examiner:

Brian Albertalli

Int. Cl.

CPC ...

G06F 17/27 (2006.01); G06F 17/22 (2006.01);

U.S. Cl.

CPC ...

G06F 17/2765 (2013.01); G06F 17/2247 (2013.01); G06F 17/2252 (2013.01); G06F 17/275 (2013.01);

Abstract

Automatic locale determination for documents is described. In an embodiment, a computer server receives an electronic document comprising a plurality of unknown-language data elements each associated with one or more types. Based on a document schema of the document, the computer system selects one or more unknown-language data elements from the plurality of unknown-language data elements and assigning to each of the one or more unknown-language data elements a corresponding weight value based on a respective type of the unknown-language data element. The computer system compares the one or more unknown-language data elements with a plurality of known-language data elements that are associated with the document schema and based on the comparing, determines a number of unknown-language data elements in the one or more unknown-language data elements that matched any in a subset of the plurality of known-language data elements, wherein the subset of known-language data elements corresponds to a particular language. Based on the number of data elements that matched to the subset of known-language data elements and based on the corresponding weight assigned to each unknown-language data element in the number of unknown-language data elements, the computer system determines a language confidence level value specifying a level of machine confidence that the document is expressed in the particular language and based on the language confidence value for the particular language exceeding a language threshold value, automatically processes the document using the particular language.

Find Patent Forward Citations