The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 26, 2021

Filed:

Feb. 26, 2020
Applicant:

Sas Institute Inc., Cary, NC (US);

Inventors:

Bruce Monroe Mills, Cary, NC (US);

Vinicius Rabbi Vivaldi, Cary, NC (US);

Assignee:

SAS Institute Inc., Cary, NC (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06N 20/00 (2019.01); G06F 16/31 (2019.01); G06N 5/02 (2006.01); G06F 40/279 (2020.01); G06F 40/253 (2020.01); G06F 3/0482 (2013.01);
U.S. Cl.
CPC ...
G06N 5/025 (2013.01); G06F 16/31 (2019.01); G06F 40/253 (2020.01); G06F 40/279 (2020.01); G06N 20/00 (2019.01); G06F 3/0482 (2013.01);
Abstract

A computing device receives training data representing different observations where each observation is categorized into one of options for a target variable. The device obtains computer command(s) for categorizing into one of the options for the target variable. The device generates a sampling scheme for sampling terms of the training data. The device generates sampling models by, for N iterations of the sampling scheme: determining a subset of the training data based on a training data index; sampling, based on a term index, the subset of the training data for a subset of terms; and generating, based on the subset of terms, a sampling model for categorizing, according to the computer command(s). Each sampling model is generated from a different subset of terms such that the sampling models are randomized. The device computes an aggregated model for categorizing test data into one of the options for the target variable.


Find Patent Forward Citations

Loading…