The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 01, 2019

Filed:

Jan. 29, 2017
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Kamila Baron-Palucka, Cracow, PL;

Lukasz G. Cmielowski, Cracow, PL;

Marek J. Oszajec, Debica, PL;

Pawel Slowikowski, Cracow, PL;

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01); H04L 12/58 (2006.01); H04L 29/06 (2006.01);
U.S. Cl.
CPC ...
G06F 17/2785 (2013.01); G06F 17/277 (2013.01); G06F 17/278 (2013.01); G06F 17/2795 (2013.01); H04L 51/32 (2013.01); H04L 67/42 (2013.01);
Abstract

Described herein is an approach for automatically determining the semantic relatedness of documents to semantic concepts. A first text mining analysis extracts a set of reference concepts from reference documents. A second text mining analysis extracts a set of test concepts from test documents that include a mixture of new concepts and reference concepts. An extended co-occurrence matrix is computed that indicates a frequency of co-occurrence (RCCF) of each new and each reference concept in the test documents with all other new and reference concepts. The extended co-occurrence matrix is used for computing a new concept relatedness score (NCRS) for the new concepts. A document similarity score (DSS) is computed for each of the test documents by aggregating, inter alia, the NCRS of each new concept with the RCCF of each reference concept. The DSS represents the semantic relatedness of the test document to the totality of the reference concepts.


Find Patent Forward Citations

Loading…