The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 14, 2016

Filed:

Jun. 22, 2012
Applicants:

Glenn M. Lewis, Costa Mesa, CA (US);

Kirill Buryak, Sunnyvale, CA (US);

Aner Ben-artzi, Los Angeles, CA (US);

Jun Peng, San Ramon, CA (US);

Nadav Benbarak, Boston, MA (US);

Inventors:

Glenn M. Lewis, Costa Mesa, CA (US);

Kirill Buryak, Sunnyvale, CA (US);

Aner Ben-Artzi, Los Angeles, CA (US);

Jun Peng, San Ramon, CA (US);

Nadav Benbarak, Boston, MA (US);

Assignee:

Google Inc., Mountain View, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06N 99/00 (2010.01);
U.S. Cl.
CPC ...
G06N 99/005 (2013.01);
Abstract

A method and system for classifying documents is provided. A set of document classifiers is generated by applying a classification algorithm to a trusted corpus that includes a set of training documents representing a taxonomy. One or more of the generated document classifiers are executed against a plurality of input documents to create a plurality of classified documents. Each classified document is associated with a classification within the taxonomy and a classification confidence level. One or more classified documents that are associated with a classification confidence level below a predetermined threshold value are selected to create a set of low-confidence documents. The low-confidence documents are disassociated from each of the associated classifications. A user is prompted to enter a classification within the taxonomy for at least one low-confidence document. The low-confidence document is associated with the entered classification and with a predetermined confidence level to create a newly classified document.


Find Patent Forward Citations

Loading…