The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 11, 2020

Filed:

Nov. 03, 2016
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Sheng Hua Bao, San Jose, CA (US);

Su Yan, San Jose, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01); G06F 16/33 (2019.01); G06F 16/36 (2019.01); G06F 16/332 (2019.01);
U.S. Cl.
CPC ...
G06F 17/2785 (2013.01); G06F 16/3322 (2019.01); G06F 16/3338 (2019.01); G06F 16/3344 (2019.01); G06F 16/3347 (2019.01); G06F 16/36 (2019.01); G06F 17/271 (2013.01); G06F 17/2735 (2013.01);
Abstract

A data handling system enables the unsupervised creation of an information extraction dictionary by expanding upon a word or phrase included within an expansion query. Prior to receiving the expansion query, the data handling system performs an unsupervised learning of an information corpus which includes text to assign a corpus vector to each word and phrase of the text. After the expansion query, the data handling system compares the expansion query to the corpus vectors. The data handling system ranks the corpus vectors by similarity to the expansion query and provides a ranked list of words or phrases associated with the ranked corpus vectors. The ranked list may be subsequently utilized as the information extraction dictionary.


Find Patent Forward Citations

Loading…