The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 06, 2013

Filed:

Sep. 05, 2008
Applicants:

Adam Turkelson, Lansdale, PA (US);

Huanfeng MA, Drexel Hill, PA (US);

Inventors:

Adam Turkelson, Lansdale, PA (US);

Huanfeng Ma, Drexel Hill, PA (US);

Assignee:

The Neat Company, Inc., Philadelphia, PA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 15/00 (2006.01);
U.S. Cl.
CPC ...
Abstract

An automatic document classification system is described that uses lexical and physical features to assign a class cεC{c, c, . . . , c} to a document d. The primary lexical features are the result of a feature selection method known as Orthogonal Centroid Feature Selection (OCFS). Additional information may be gathered on character type frequencies (digits, letters, and symbols) within d. Physical information is assembled through image analysis to yield physical attributes such as document dimensionality, text alignment, and color distribution. The resulting lexical and physical information is combined into an input vector X and is used to train a supervised neural network to perform the classification.


Find Patent Forward Citations

Loading…