The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jan. 23, 2001
Filed:
Aug. 10, 1998
Manabu Sassano, Kawasaki, JP;
Fujitsu Limited, Kawasaki, JP;
Abstract
The present invention is intended to allow a user to easily and precisely extract related terms through use of mutual information without requiring morphological analysis or syntax analysis, by constituting a related term extraction apparatus from preceding-and-subsequent term extraction means for extracting a preceding term occurring prior to a specified term or a subsequent term occurring subsequent to the same in text data; a frequency calculation means for calculating the occurrence frequencies of the specified term, the preceding terms, and the subsequent terms; probability-of-occurrence calculation means for calculating the occurrence probabilities of the preceding and subsequent terms together with the occurrence probability of the specified term; probability-of-concurrence calculation means for calculating the probabilities of the preceding and subsequent terms cooccurring with the specified term; order-dependent degree-of-association calculation means for calculating an order-dependent degrees of the preceding and subsequent terms cooccurring with the specified term; order-independent degree-of-association calculation means for calculating an order-independent degrees of occurrence of the preceding and subsequent terms with the specified term; and term group extraction means for extracting from the text data a group of terms related to the specified term, on the basis of the degree-of-association information calculated by the order-independent degree-of-association calculation means.