The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 22, 2022

Filed:

Apr. 11, 2019
Applicant:

Oracle International Corporation, Redwood Shores, CA (US);

Inventors:

Sam Idicula, Santa Clara, CA (US);

Tomas Karnagel, Zurich, CH;

Jian Wen, Hollis, NH (US);

Seema Sundara, Nashua, NH (US);

Nipun Agarwal, Saratoga, CA (US);

Mayur Bency, Foster City, CA (US);

Assignee:

Oracle International Corporation, Redwood Shores, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/2453 (2019.01); G06F 16/21 (2019.01); G06N 20/00 (2019.01); G06N 20/20 (2019.01);
U.S. Cl.
CPC ...
G06F 16/24545 (2019.01); G06F 16/217 (2019.01); G06N 20/00 (2019.01); G06N 20/20 (2019.01);
Abstract

Embodiments utilize trained query performance machine learning (QP-ML) models to predict an optimal compute node cluster size for a given in-memory workload. The QP-ML models include models that predict query task runtimes at various compute node cardinalities, and models that predict network communication time between nodes of the cluster. Embodiments also utilize an analytical model to predict overlap between predicted task runtimes and predicted network communication times. Based on this data, an optimal cluster size is selected for the workload. Embodiments further utilize trained data capacity machine learning (DC-ML) models to predict a minimum number of compute nodes needed to run a workload. The DC-ML models include models that predict the size of the workload dataset in a target data encoding, models that predict the amount of memory needed to run the queries in the workload, and models that predict the memory needed to accommodate changes to the dataset.


Find Patent Forward Citations

Loading…