The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 18, 2020

Filed:

Dec. 06, 2016
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventor:

Craig A. Statchuk, Kars, CA;

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 7/00 (2006.01); G06F 16/951 (2019.01); G06F 16/35 (2019.01); G06F 16/31 (2019.01); G06F 16/332 (2019.01); G06F 17/00 (2019.01); G06N 20/00 (2019.01);
U.S. Cl.
CPC ...
G06F 16/951 (2019.01); G06F 16/316 (2019.01); G06F 16/3322 (2019.01); G06F 16/35 (2019.01); G06N 20/00 (2019.01);
Abstract

A method, system and computer program product for building a data query engine. Initial taxonomies that describe and categorize data are built by expert users (e.g., data scientists) employing machine learning algorithms. The data is also indexed and stored in an index. Queries are then received from non-expert users to query the data based on data categorization from built taxonomies and the indexing. After the queries are executed using the machine learning algorithms in an environment (e.g., Hadoop®), the results of the queries are rated for relevance, precision and accuracy. The machine learning algorithms are also rated based on the number of successful queries. Those machine learning algorithms with a rating above a threshold are identified to be utilized to scan new data to be stored in the index to provide a new environment that replaces the initial environment.


Find Patent Forward Citations

Loading…