The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 07, 2021

Filed:

Jan. 26, 2018
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Md Faisal M. Chowdhury, Corona, NY (US);

Sharon M. Trewin, Croton-on-Hudson, NY (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 16/33 (2019.01); G06F 16/335 (2019.01); G06F 16/35 (2019.01); G06F 40/30 (2020.01); G06F 40/242 (2020.01); G06F 40/284 (2020.01);
U.S. Cl.
CPC ...
G06F 16/3344 (2019.01); G06F 16/335 (2019.01); G06F 16/358 (2019.01); G06F 40/242 (2020.01); G06F 40/284 (2020.01); G06F 40/30 (2020.01);
Abstract

A method of extracting jargon from a document corpus stored in a database using a processor and a user interface is described herein. A sub-domain input is entered through the user interface to initiate a review of the document corpus stored in the database. The processor separates the document corpus into at least one sub-corpus and a remainder corpus. The at least one sub-corpus is defined by the sub-domain input. A first topic model and a second topic model are built to generate respective topic similarity scores for at least one term extracted from the at least one sub-corpus and at least one corresponding term extracted from the remainder corpus. The respective topic similarity scores are compared by the processor to identify jargon terms and thereby provide a list of jargon terms through the user interface.


Find Patent Forward Citations

Loading…