The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 12, 2019

Filed:

Dec. 29, 2015
Applicant:

Information Extraction Systems, Inc., Waban, MA (US);

Inventor:

Alwin B Carus, Waban, MA (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 7/00 (2006.01); G06F 17/27 (2006.01);
U.S. Cl.
CPC ...
G06N 7/005 (2013.01); G06F 17/2715 (2013.01);
Abstract

We have invented a process and method for creating a general-purpose adaptive or static machine-learning classifier using prediction by partial matching (PPM) language modeling. This classifier can incorporate homogeneous or heterogeneous feature types; variable-size contexts; sequential or non-sequential features. Features are ordered (linearized) by information saliency; and truncation of least-informative context is used for backoff to handle previously unseen events. Labels may be endogenous (from within the group) or exogenous (outside the group) of the feature types. Classification may generate labels and their probabilities; or only labels. Classification stores may be complete or minimized where redundant states are removed producing significant space savings and performance improvements. Classifiers may be static (unchanging) or online (adaptive or updatable incrementally or in batch). PPM classifiers may be incorporated in ensembles of other PPM classifiers or different machine learning algorithms. Training and prediction algorithms are both simple and efficient; and permit multiple implementations using standard software data structures. These benefits are achieved while providing state-of-the-art prediction performance.


Find Patent Forward Citations

Loading…