The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Oct. 11, 2011
Filed:
Jul. 23, 2007
Piotr M. Plachta, Toronto, CA;
Wolfram Sauer, Austin, TX (US);
Balakrishna Raghavendra Iyer, San Jose, CA (US);
Steven Wayne White, Austin, TX (US);
Piotr M. Plachta, Toronto, CA;
Wolfram Sauer, Austin, TX (US);
Balakrishna Raghavendra Iyer, San Jose, CA (US);
Steven Wayne White, Austin, TX (US);
International Business Machines Corporation, Armonk, NY (US);
Abstract
Some aspects of the invention provide methods, systems, and computer program products for creating a static dictionary in which longer byte-strings are preferred. To that end, in accordance with aspects of the present invention, a new heuristic is defined to replace the aforementioned frequency count metric used to record the number of times a particular node in a data tree is visited. The new heuristic is based on counting the number of times an end-node of a particular byte-string is visited, while not incrementing a count for nodes storing characters in the middle of the byte-string as often as each time such nodes are visited. The result is an occurrence count metric that favors longer byte-strings, by being biased towards not incrementing the respective occurrence count values for nodes storing characters in the middle of a byte-string.