The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 26, 2019

Filed:

Dec. 29, 2014
Applicant:

Google Inc., Mountain View, CA (US);

Inventors:

Amitabh Saikia, Mountain View, CA (US);

Marc-Allen Cartright, Stanford, CA (US);

Luis Garcia Pueyo, San Jose, CA (US);

Vanja Josifovski, Los Gatos, CA (US);

Jie Yang, Sunnyvale, CA (US);

Mike Bendersky, Sunnyvale, CA (US);

MyLinh Yang, Saratoga, CA (US);

Assignee:

GOOGLE LLC, Mountain View, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/30 (2006.01); G06F 17/00 (2006.01); G06N 5/00 (2006.01); G06Q 10/10 (2012.01);
U.S. Cl.
CPC ...
G06F 17/30705 (2013.01); G06F 17/30 (2013.01); G06N 5/003 (2013.01); G06Q 10/107 (2013.01); G06F 17/3053 (2013.01);
Abstract

Methods, apparatus, systems, and computer-readable media are provided for selecting pattern matching segments suitable for electronic communication clustering. A set of pattern matching segments may be identified that match at least one of a corpus of electronic communication addresses. A measure of coverage of each of the set of pattern matching segments across the corpus of electronic communication addresses may be determined. A score associated with each pattern matching segment may be determined based on the measure of coverage and one or more measures of flexibility associated with each of the set of pattern matching segments. One or more of the pattern matching segments may be selected based on the determine scores. A corpus of electronic communications may then be grouped into a plurality of clusters based on a comparison of the one or more selected pattern matching segments to electronic communication addresses associated with the corpus of electronic communications.


Find Patent Forward Citations

Loading…