The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 26, 1993

Filed:

Sep. 24, 1991
Applicant:
Inventor:

Koichi Ejiri, Santa Clara, CA (US);

Assignees:

Ricoh Corporation, Menlo Park, CA (US);

Ricoh Co. Ltd., Tokyo, JP;

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F / ;
U.S. Cl.
CPC ...
364419 ;
Abstract

The present invention provides a method and apparatus for classifying text by using two constants determined by analyzing the text. The first constant, G, classifies text in the order of constraint. It is defined by the equation G=log (N/L)/ {log(N)-1}, where N is the number of words and L is the number of different words in the text being classified. The second constant, R, is the correlation coefficient between the word length and the logarithm scaled rank order of word frequency. The values of the two constants can be used to determine how to classify text. In the case of English text, the text may be classified as computer language, text from a technical manual, English text written by foreigners or English text written by native English speakers.


Find Patent Forward Citations

Loading…