The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 01, 2016

Filed:

Dec. 16, 2013
Applicant:

Locu, Inc., San Francisco, CA (US);

Inventors:

Jason Ansel, Cambridge, MA (US);

Adam Marcus, Cambridge, MA (US);

Marek Olszewski, San Francisco, CA (US);

Keir Mierle, San Francisco, CA (US);

Assignee:

Go Daddy Operating Company, LLC, Scottsdale, AZ (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/00 (2006.01); G06N 99/00 (2010.01); G06F 17/27 (2006.01); G06F 17/30 (2006.01);
U.S. Cl.
CPC ...
G06N 99/005 (2013.01); G06F 17/277 (2013.01); G06F 17/30598 (2013.01);
Abstract

A system and method for data classification are presented. A plurality of training tokens are identified by at least one server communicatively coupled to a network. Each training token includes a token retrieved from a content source and a classification of the token. For each training token in the plurality of training tokens, a plurality of n-gram sequences are identified, a plurality of features for the plurality of n-gram sequences are generated, and first training data is generated using the token retrieved from the content source, the plurality of features, and the classification of the token. A first classifier is trained with the first training data, and the first classifier is stored into a storage system in communication with the at least one server.


Find Patent Forward Citations

Loading…