The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 28, 2023

Filed:

Aug. 21, 2019
Applicant:

Oracle International Corporation, Redwood Shores, CA (US);

Inventors:

Tomas Karnagel, Zurich, CH;

Sam Idicula, Santa Clara, CA (US);

Hesam Fathi Moghadam, Sunnyvale, CA (US);

Nipun Agarwal, Saratoga, CA (US);

Assignee:

Oracle International Corporation, Redwood Shores, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/00 (2019.01); G06K 9/62 (2022.01); G06N 20/00 (2019.01);
U.S. Cl.
CPC ...
G06K 9/623 (2013.01); G06K 9/6257 (2013.01); G06N 20/00 (2019.01);
Abstract

The present invention relates to dimensionality reduction for machine learning (ML) models. Herein are techniques that individually rank features and combine features based on their rank to achieve an optimal combination of features that may accelerate training and/or inferencing, prevent overfitting, and/or provide insights into somewhat mysterious datasets. In an embodiment, a computer ranks features of datasets of a training corpus. For each dataset and for each landmark percentage, a target ML model is configured to receive only a highest ranking landmark percentage of features, and a landmark accuracy achieved by training the ML model with the dataset is measured. Based on the landmark accuracies and meta-features values of the dataset, a respective training tuple is generated for each dataset. Based on all of the training tuples, a regressor is trained to predict an optimal amount of features for training the target ML model.


Find Patent Forward Citations

Loading…