The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 07, 2010

Filed:

Dec. 10, 2002
Applicants:

Nagui Halim, Yorktown Heights, NY (US);

Zhen Liu, Tarrytown, NY (US);

Mark Steven Squillante, Pound Ridge, NY (US);

Honghui Xia, Croton on Hudson, NY (US);

Shun-zheng Yu, Guangdong, CN;

LI Zhang, Yorktown Heights, NY (US);

Inventors:

Nagui Halim, Yorktown Heights, NY (US);

Zhen Liu, Tarrytown, NY (US);

Mark Steven Squillante, Pound Ridge, NY (US);

Honghui Xia, Croton on Hudson, NY (US);

Shun-Zheng Yu, Guangdong, CN;

Li Zhang, Yorktown Heights, NY (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 15/173 (2006.01);
U.S. Cl.
CPC ...
Abstract

Apparatus and methods for classifying web sites are provided. With the apparatus and methods, traffic data is obtained for a plurality of web sites. This patterns, or templates, for each web site are generated based on this traffic data and the patterns are clustered into classes of web sites using a clustering algorithm. The clusters, or classes, are then profiled to generate a template for each class. The template for each class is generated by first shifting the patterns for each web site that is part of the class to compensate for effects like time zone differences, if any, and then identifying a pattern that is most similar to all of the patterns in the class. Once the template for each class is generated, this template is then used with traffic data from a new web site to classify the new web site into one of the existing classes. In other words, when traffic data for a new web site is received, a pattern for the traffic data of the new web site is generated and compared to the templates for the various classes. If a matching class template is identified, the new web site is classified into the corresponding class. If the pattern for the new web site does not match any of the existing templates, a new template and class may be generated based on the pattern for the new web site.


Find Patent Forward Citations

Loading…