The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 05, 2017

Filed:

Aug. 27, 2015
Applicant:

Conduent Business Services, Llc, Dallas, TX (US);

Inventors:

William Radford, Grenoble, FR;

Xavier Carreras, Saint Ismier, FR;

James Brinton Henderson, Vessy, CH;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01); G06F 17/20 (2006.01); G06F 17/30 (2006.01); G06K 9/00 (2006.01);
U.S. Cl.
CPC ...
G06F 17/278 (2013.01); G06F 17/277 (2013.01); G06F 17/30011 (2013.01); G06F 17/30604 (2013.01); G06F 17/30705 (2013.01); G06K 9/00463 (2013.01);
Abstract

A method for entity recognition employs document-level entity tags which correspond to mentions appearing in the document, without specifying their locations. A named entity recognition model is trained on features extracted from text samples tagged with document-level entity tags. A text document to be labeled is received, the text document being tagged with at least one document-level entity tag. A document-specific gazetteer is generated, based on the at least one document-level entity tag. The gazetteer includes a set of entries, one entry for each of a set of entity names. For a text sequence of the document, features for tokens of the text sequence are extracted. The features include document-specific features for tokens matching at least a part of the entity name of one of the gazetteer entries. Entity labels are predicted for the tokens in the text sequence with the named entity recognition model, based on the extracted features.


Find Patent Forward Citations

Loading…