The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 17, 2002

Filed:

Feb. 22, 1999
Applicant:
Inventors:

Robert E. Schapire, Maplewood, NJ (US);

Yoram Singer, New Providence, NJ (US);

Assignee:

AT&T Corp., New York, NY (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 1/518 ;
U.S. Cl.
CPC ...
G06F 1/518 ;
Abstract

A method and apparatus are provided for multi-class, mutli-label information categorization. A weight is assigned to each information sample in a training set, the training set containing a plurality of information samples, such as text documents, and associated labels. A base hypothesis is determined to predict which labels are associated with a given information sample. The base hypothesis predicts whether or not each label is associated with information sample or predicts the likelihood that each label is associated with the information sample. In the case of a document, the base hypothesis evaluates words in each document to determine one or more words that predict the associated labels. When a base hypothesis is determined, the weight assigned to each information sample in the training set is modified based on the base hypothesis predictions.


Find Patent Forward Citations

Loading…