The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 22, 2016

Filed:

Jun. 03, 2014
Applicant:

Microsoft Corporation, Redmond, WA (US);

Inventors:

Sanjay Agrawal, Sammamish, WA (US);

Kaushik Chakrabarti, Redmond, WA (US);

Surajit Chaudhuri, Redmond, WA (US);

Venkatesh Ganti, Redmond, WA (US);

Assignee:
Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/00 (2006.01); G06F 7/00 (2006.01); G06F 17/30 (2006.01); G06F 17/27 (2006.01);
U.S. Cl.
CPC ...
G06F 17/30011 (2013.01); G06F 17/278 (2013.01);
Abstract

A set of documents is filtered for entity extraction. A list of entity strings is received. A set of token sets that covers the entity strings in the list is determined. An inverted index generated on a first set of documents is queried using the set of token sets to determine a set of document identifiers for a subset of the documents in the first set. A second set of documents identified by the set of document identifiers is retrieved from the first set of documents. The second set of documents is filtered to include one or more documents of the second set that each includes a match with at least one entity string of the list of entity strings. Entity recognition may be performed on the filtered second set of documents.


Find Patent Forward Citations

Loading…