The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 05, 2017

Filed:

Jul. 28, 2014
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Corville O. Allen, Morrisville, NC (US);

Andrew R. Freed, Cary, NC (US);

Richard A. Salmon, Apex, NC (US);

Beata J. Strack, New York, NY (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/30 (2006.01); G06F 7/00 (2006.01); G06N 5/02 (2006.01); G06N 99/00 (2010.01); G06F 17/27 (2006.01);
U.S. Cl.
CPC ...
G06N 5/02 (2013.01); G06F 17/27 (2013.01); G06N 99/005 (2013.01);
Abstract

A mechanism is provided in a data processing system for corpus quality analysis. The mechanism applies at least one filter to a candidate corpus to determine a degree to which the candidate corpus supplements existing corpora for performing a natural language processing (NLP) operation. Responsive to a determination to add the candidate corpus to the existing corpora based on a result of applying the at least one filter, the mechanism adds the candidate corpus to the existing corpora to form modified corpora. The mechanism performs the NLP operation using the modified corpora.


Find Patent Forward Citations

Loading…