The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 26, 2018

Filed:

Jun. 23, 2016
Applicant:

Abbyy Production Llc, Moscow, RU;

Assignee:

ABBYY PRODUCTION LLC, Moscow, RU;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01);
U.S. Cl.
CPC ...
G06F 17/278 (2013.01); G06F 17/271 (2013.01); G06F 17/277 (2013.01); G06F 17/2755 (2013.01); G06F 17/2785 (2013.01);
Abstract

Systems and methods for multi-stage recognition of named entities based on morphological and semantic features of natural language texts. An example method comprises: performing a lexico-morphological analysis of a natural language text comprising a plurality of tokens, each token comprising at least one natural language word; determining, based on the lexico-morphological analysis, one or more lexical meanings and grammatical meanings associated with each token of the plurality of tokens; for each token the plurality of tokens, evaluating one or more classifier functions using the lexical and grammatical meanings associated with the tokens, wherein a value of each classifier function is indicative of a degree of association of the token with a category of named entities; performing a syntactico-semantic analysis of at least part of the natural language text to produce a plurality of semantic structures representing the part of the natural language text; and interpreting the semantic structures using a set of production rules to determine, for one or more tokens comprised by the part of the natural language text, a degree of association of the token with a category of named entities.


Find Patent Forward Citations

Loading…