The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 13, 2022

Filed:

Apr. 16, 2020
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Nandana Mihindukulasooriya, Cambridge, MA (US);

Ruchi Mahindru, Elmsford, NY (US);

Md Faisal Mahbub Chowdhury, Woodside, NY (US);

Yu Deng, Yorktown Heights, NY (US);

Alfio Massimiliano Gliozzo, Brooklyn, NY (US);

Sarthak Dash, Jersey City, NJ (US);

Nicolas Rodolfo Fauceglia, Brooklyn, NY (US);

Gaetano Rossiello, Brooklyn, NY (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/62 (2022.01); G06F 40/205 (2020.01); G06N 20/00 (2019.01); G06F 40/40 (2020.01); G06F 40/30 (2020.01); G06N 5/02 (2006.01);
U.S. Cl.
CPC ...
G06K 9/623 (2013.01); G06F 40/205 (2020.01); G06F 40/30 (2020.01); G06F 40/40 (2020.01); G06K 9/6215 (2013.01); G06K 9/6218 (2013.01); G06K 9/6232 (2013.01); G06N 5/02 (2013.01); G06N 20/00 (2019.01);
Abstract

One embodiment of the invention provides a method for terminology ranking for use in natural language processing. The method comprises receiving a list of terms extracted from a corpus, where the list comprises a ranking of the terms based on frequencies of the terms across the corpus. The method further comprises accessing a domain ontology associated with the corpus, and re-ranking the list based on the domain ontology. The resulting re-ranked list comprises a different ranking of the terms based on relevance of the terms using knowledge from the domain ontology. The method further comprises generating clusters of terms via a trained model adapted to the corpus, and boosting a rank of at least one term of the re-ranked list based on the clusters to increase a relevance of the at least one term using knowledge from the trained model.


Find Patent Forward Citations

Loading…