The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 07, 1997

Filed:

May. 13, 1994
Applicant:
Inventors:

Jeffrey Owen Kephart, Yorktown Heights, NY (US);

Gregory Bret Sorkin, New York, NY (US);

Gerald James Tesauro, Croton-on-Hudson, NY (US);

Steven Richard White, New York, NY (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06E / ; G06F / ;
U.S. Cl.
CPC ...
395 22 ; 395 20 ; 395 21 ;
Abstract

A data string is a sequence of atomic units of data that represent information. In the context of computer data, examples of data strings include executable programs, data files, and boot records consisting of sequences of bytes, or text files consisting of sequences of bytes or characters. The invention solves the problem of automatically constructing a classifier of data strings, i.e., constructing a classifier which, given a string, determines which of two or more class labels should be assigned to it. From a set of (string, class-label) pairs, this invention provides an automated technique for extracting features of data strings that are relevant to the classification decision, and an automated technique for developing a classifier which uses those features to classify correctly the data strings in the original examples and, with high accuracy, classify correctly novel data strings not contained in the example set. The classifier is developed using 'adaptive' or 'learning' techniques from the domain of statistical regression and classification, such as, e.g., multi-layer neural networks. As an example, the technique can be applied to the task of distinguishing files or boot records that are infected by computer viruses from files or boot records that are not infected.


Find Patent Forward Citations

Loading…