The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 24, 2012

Filed:

Mar. 30, 2009
Applicants:

Falk Brauer, Dresden, DE;

Wojciech Barczynski, Dresden, DE;

Hong-hai DO, London, GB;

Alexander Löser, Berlin, DE;

Marcus Schramm, Dresden, DE;

Inventors:

Falk Brauer, Dresden, DE;

Wojciech Barczynski, Dresden, DE;

Hong-Hai Do, London, GB;

Alexander Löser, Berlin, DE;

Marcus Schramm, Dresden, DE;

Assignee:

SAP AG, Walldorf, DE;

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 17/20 (2006.01); G06F 17/30 (2006.01);
U.S. Cl.
CPC ...
Abstract

Methods and systems are described that involve recognizing complex entities from text documents with the help of structured data and Natural Language Processing (NLP) techniques. In one embodiment, the method includes receiving a document as input from a set of documents, wherein the document contains text or unstructured data. The method also includes identifying a plurality of text segments from the document via a set of tagging techniques. Further, the method includes matching the identified plurality of text segments against attributes of a set of predefined entities. Lastly, a best matching predefined entity is selected for each text segment from the plurality of text segments. In one embodiment, the system includes a set of documents, each document containing text or unstructured data. The system also includes a database storage unit that stores a set of predefined entities, wherein each entity contains a set of attributes. Further, the system includes a processor to identify a plurality of text segments from a document via a set of tagging techniques and to match the identified plurality of text segments against the set of attributes.


Find Patent Forward Citations

Loading…