The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 11, 2020

Filed:

May. 24, 2018
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Ricardo Balduino, San Jose, CA (US);

Avijit Chatterjee, White Plains, NY (US);

Vinay R. Dandin, White Plains, NY (US);

Aleksandr E. Petrov, Acton, MA (US);

John Thomas, Fishkill, NY (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01); G06F 16/36 (2019.01); G06N 20/00 (2019.01); G06F 40/30 (2020.01); G06F 40/289 (2020.01);
U.S. Cl.
CPC ...
G06F 16/36 (2019.01); G06F 40/289 (2020.01); G06F 40/30 (2020.01); G06N 20/00 (2019.01);
Abstract

In a general purpose computer, a method of extracting snippets includes receiving textual content and a plurality of available topics, dividing the textual content into a plurality of snippets, converting each of the snippets to a vector, determining a distance between coadjacent snippets of the plurality of snippets in the textual content, determining an update to the plurality of snippets by merging each of the pairs of coadjacent snippets having a respective distance less than a second threshold, wherein an updated plurality of snippets includes merged snippets, generating a plurality of clusters from the updated plurality of snippets, each cluster associated with one topic selected from the plurality of available topics, and generating, for each of the snippets of the updated plurality of snippets, an affinity score for each of the clusters, each affinity score measuring an assignment strength of a given snippet to a given cluster, and a dominant topic among the at least one identified topic.


Find Patent Forward Citations

Loading…