The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 23, 2010

Filed:

Aug. 21, 2007
Applicants:

Paul Geoffrey Brown, San Jose, CA (US);

Peter Jay Haas, San Jose, CA (US);

Inventors:

Paul Geoffrey Brown, San Jose, CA (US);

Peter Jay Haas, San Jose, CA (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 17/00 (2006.01); G06F 5/00 (2006.01); G06N 5/02 (2006.01); G06N 7/00 (2006.01); G06N 7/08 (2006.01);
U.S. Cl.
CPC ...
Abstract

A scheme is used to automatically discover algebraic constraints between pairs of columns in relational data. The constraints may be 'fuzzy' in that they hold for most, but not all, of the records, and the columns may be in the same table or different tables. The scheme first identifies candidate sets of column value pairs that are likely to satisfy an algebraic constraint. For each candidate, the scheme constructs algebraic constraints by applying statistical histogramming, segmentation, or clustering techniques to samples of column values. In query-optimization mode, the scheme automatically partitions the data into normal and exception records. During subsequent query processing, queries can be modified to incorporate the constraints; the optimizer uses the constraints to identify new, more efficient access paths. The results are then combined with the results of executing the original query against the (small) set of exception records.


Find Patent Forward Citations

Loading…