The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 26, 2017

Filed:

Mar. 29, 2006
Applicants:

Roger F. Osmond, Littleton, MA (US);

Gil Goren, Ashland, MA (US);

Inventors:

Roger F. Osmond, Littleton, MA (US);

Gil Goren, Ashland, MA (US);

Assignee:

EMC IP HOLDING COMPANY LLC, Hopkinton, MA (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 7/00 (2006.01); G06F 17/00 (2006.01); G06F 15/16 (2006.01); G06F 17/22 (2006.01); G06F 17/30 (2006.01); H03M 7/30 (2006.01);
U.S. Cl.
CPC ...
G06F 17/2264 (2013.01); G06F 17/30321 (2013.01); G06F 17/30619 (2013.01); H03M 7/30 (2013.01);
Abstract

Data storage is improved by combining content indexing and data reduction in text-containing files by using common word elimination. Raw data is processed by finding words in selected files, creating an index of found words, and replacing the words in the raw data with pointers to the corresponding words in the index. Each word appears only once in the index. Consequently, the index is relatively small and the procedure is completely reversible. In particular, the index is small relative to other methods because the data is transformed in place, and the transformed data and index are used together to capture the total information about the data.


Find Patent Forward Citations

Loading…