The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Oct. 02, 2012
Filed:
Feb. 01, 2006
Mahesh Jagannath, Shrewsbury, MA (US);
Chitra Bhagwat, Wobum, MA (US);
Joseph Yarmus, Groton, MA (US);
Ari W. Mozes, Lexington, MA (US);
Mahesh Jagannath, Shrewsbury, MA (US);
Chitra Bhagwat, Wobum, MA (US);
Joseph Yarmus, Groton, MA (US);
Ari W. Mozes, Lexington, MA (US);
Oracle International Corporation, Redwood Shores, CA (US);
Abstract
Binning of predictor values used for generating a data mining model provides useful reduction in memory footprint and computation during the computationally dominant decision tree build phase, but reduces the information loss of the model and reduces the introduction of false information artifacts. A method of binning data in a database for data mining modeling in a database system, the data stored in a database table in the database system, the data mining modeling having selected at least one predictor and one target for the data, the data including a plurality of values of the predictor and a plurality of values of the target, the method comprises constructing a binary tree for the predictor that splits the values of the predictor into a plurality of portions, pruning the binary tree, and defining as bins of the predictor leaves of the tree that remain after pruning, each leaf of the tree representing a portion of the values of the predictor.