The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 06, 2026

Filed:

Mar. 28, 2023
Applicant:

Cloudera, Inc., Santa Clara, CA (US);

Inventors:

Rituparna Agrawal, Palo Alto, CA (US);

Anupam Singh, Palo Alto, CA (US);

Prithviraj Pandian, Palo Alto, CA (US);

Assignee:

CLOUDERA, INC., Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/248 (2019.01); G06F 16/21 (2019.01); G06F 16/2455 (2019.01); G06F 16/28 (2019.01); G06F 16/84 (2019.01);
U.S. Cl.
CPC ...
G06F 16/248 (2019.01); G06F 16/211 (2019.01); G06F 16/2455 (2019.01); G06F 16/285 (2019.01); G06F 16/86 (2019.01);
Abstract

Systems and methods for very fast grouping of 'similar' SQL queries according to user-supplied similarity criteria. The user-supplied similarity criteria include a threshold quantifying the degree of similarity between SQL queries and common artifacts included in the queries. A similarity-characterizing data structure allows for the very fast grouping of “similar” SQL queries. Because the computation is distributed among multiple compute nodes, a small cluster of compute nodes takes a short time to compute the similarity-characterizing data on a workload of tens of millions of queries. The user can supply the similarity criteria through a UI or a command line tool. Furthermore, the user can adjust the degree of similarity by supplying new similarity criteria. Accordingly, the system can display in real time or near real time, updated SQL groupings corresponding to the newly supplied similarity criteria using the originally computed similarity-characterizing data structure.


Find Patent Forward Citations

Loading…