The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jul. 08, 2008
Filed:
Feb. 19, 2003
Giovanni B. Marchisio, Kirkland, WA (US);
Krzysztof Koperski, Seattle, WA (US);
Jisheng Liang, Bellevue, WA (US);
Alejandro Murua, Seattle, WA (US);
Thien Nguyen, Edmonds, WA (US);
Carsten Tusk, Seattle, WA (US);
Navdeep S. Dhillon, Seattle, WA (US);
Lubos Pochman, Breckenridge, CO (US);
Giovanni B. Marchisio, Kirkland, WA (US);
Krzysztof Koperski, Seattle, WA (US);
Jisheng Liang, Bellevue, WA (US);
Alejandro Murua, Seattle, WA (US);
Thien Nguyen, Edmonds, WA (US);
Carsten Tusk, Seattle, WA (US);
Navdeep S. Dhillon, Seattle, WA (US);
Lubos Pochman, Breckenridge, CO (US);
Evri Inc., Seattle, WA (US);
Abstract
Methods and systems for syntactically indexing and searching data sets to achieve more accurate search results and for indexing and searching data sets using entity tags alone or in combination therewith are provided. Example embodiments provide a Syntactic Query Engine ('SQE') that parses, indexes, and stores a data set, as well as processes natural language queries subsequently submitted against the data set. The SQE comprises a Query Preprocessor, a Data Set Preprocessor, a Query Builder, a Data Set Indexer, an Enhanced Natural Language Parser ('ENLP'), a data set repository, and, in some embodiments, a user interface. After preprocessing the data set, the SQE parses the data set according to a variety of levels of parsing and determines as appropriate the entity tags and syntactic and grammatical roles of each term to generate enhanced data representations for each object in the data set. The SQE indexes and stores these enhanced data representations in the data set repository. Upon subsequently receiving a query, the SQE parses the query also using a variety of parsing levels and searches the indexed stored data set to locate data that contains similar terms used in similar grammatical roles and/or with similar entity tag types as indicated by the query. In this manner, the SQE is able to achieve more contextually accurate search results more frequently than using traditional search engines.