The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jul. 18, 2006
Filed:
Apr. 29, 2003
Robert T. Berks, Auckland, NZ;
Miroslaw A. Flasza, Pickering, CA;
Leo Tat Man Lau, Richmond Hill, CA;
Mark D. Leitch, Ajax, CA;
Sam S. Lightstone, Toronto, CA;
David Tremaine, Thornhill, CA;
Robert T. Berks, Auckland, NZ;
Miroslaw A. Flasza, Pickering, CA;
Leo Tat Man Lau, Richmond Hill, CA;
Mark D. Leitch, Ajax, CA;
Sam S. Lightstone, Toronto, CA;
David Tremaine, Thornhill, CA;
International Business Machines Corporation, Armonk, NY (US);
Abstract
A system and associated method load an input data stream into a multi-dimensional clustering (MDC) table or other structure containing data clustered along one or more dimensions, by assembling blocks of data in a partial block cache in which each partial block is associated with a distinct logical cell. A minimum threshold number of partial blocks may be maintained. Partial blocks may be spilled from the partial block cache to make room for new logical cells. Last partial pages of spilled partial blocks may be stored in a partial page cache to limit I/O if the cell associated with a spilled block is encountered later in the input data stream. Buffers may be reassigned from the partial block cache to the partial page cache if the latter is filled. Parallelism may be employed for efficiency during sorting of input data subsets and during storage of blocks to secondary storage.