The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 06, 2022

Filed:

Apr. 09, 2021
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Rachel Tzoref-Brill, Haifa, IL;

Lucas Liu, Leawood, KS (US);

Brian Midei, Monkton, MD (US);

Dagmawi Sraj, Arlington, VA (US);

Thomas North Adams, Round Rock, TX (US);

Tianqiong Wang, Charlotte, NC (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 10/00 (2022.01); G06V 30/418 (2022.01); G06K 9/62 (2022.01); G06V 10/75 (2022.01);
U.S. Cl.
CPC ...
G06V 30/418 (2022.01); G06K 9/623 (2013.01); G06K 9/6215 (2013.01); G06K 9/6218 (2013.01); G06V 10/757 (2022.01);
Abstract

An approach for determining similar text documents. The approach can calculate a first set of vectors for a first cluster of text documents and a first comparison vector for a text document of interest. The approach can select a subset of text documents from the cluster of text documents based on comparing the vectors from the first set of vectors to the first comparison vector and picking a predetermined number of closest comparison text documents. The approach can calculate a second set of vectors for the subset of documents and second comparison vector for the document of interest. The approach can generate similarity ratings for the subset of documents based on pairwise comparisons of the second comparison vector and the second set of vectors. The approach can generate a ranked list of the second cluster of text documents based on the similarity ratings.


Find Patent Forward Citations

Loading…