The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 29, 2012

Filed:

Jan. 30, 2009
Applicants:

Ling Qin Zhang, Springboro, OH (US);

Mark Wasson, Seattle, WA (US);

Valentina Templar, Las Vegas, NV (US);

Inventors:

Ling Qin Zhang, Springboro, OH (US);

Mark Wasson, Seattle, WA (US);

Valentina Templar, Las Vegas, NV (US);

Assignee:

LexisNexis Group, Miamisburg, OH (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 15/18 (2006.01);
U.S. Cl.
CPC ...
Abstract

Methods and systems are provided for normalizing strings and for matching records. In one implementation, a string is tokenized into components. Sequences of tags are generated by assigning tags to the components. A sequence of states is determined based on the sequences of tags. A normalized string is generated by normalizing the sequence of the states. A key record including key fields is extracted from a first data source. A candidate record including candidate fields is extracted from a second data source. A numerical record including numerical fields is computed by comparing the key fields and the candidate fields using comparison functions. Matching functions determined by an additive logistic regression method are applied to the numerical fields. Whether the key record and the candidate record are a match is determined based on a sum of results of the matching functions.


Find Patent Forward Citations

Loading…