The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 19, 2021

Filed:

Sep. 24, 2018
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Ismini Lourentzou, Urbana, IL (US);

Anna Lisa Gentile, San Jose, CA (US);

Daniel Gruhl, San Jose, CA (US);

Alfredo Alba, Morgan Hill, CA (US);

Chris Kau, Mountain View, CA (US);

Chad DeLuca, Morgan Hill, CA (US);

Linda Kato, San Jose, CA (US);

Petar Ristoski, San Jose, CA (US);

Steven R. Welch, Gilroy, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/00 (2019.01); G06F 16/31 (2019.01); G06N 3/04 (2006.01); G06N 3/08 (2006.01); G06F 16/33 (2019.01); G06F 40/211 (2020.01);
U.S. Cl.
CPC ...
G06F 16/313 (2019.01); G06F 16/3347 (2019.01); G06F 40/211 (2020.01); G06N 3/0481 (2013.01); G06N 3/08 (2013.01);
Abstract

One embodiment provides a method for on-demand relation extraction from unstructured text that includes obtaining a text corpus of domain related unstructured text. Representations of the unstructured text that capture entity-specific syntactic knowledge are created. Initial user seeds of informative examples containing relations are received. Extraction models in a neural network are trained using the initial user seeds. Performance information and a confidence score are provided for each prediction for each extraction model. A next batch of informative examples are identified for annotation from the text corpus based on training a neural network classifier on a pool of labeled informative examples. Stopping criteria is determined based on differences of the performance information and the confidence score in relation to parameters for each extraction model. Based on the stopping criteria, it is determined whether to retrain a particular extraction model after the informative examples have been labeled.


Find Patent Forward Citations

Loading…