The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 25, 2014

Filed:

Apr. 30, 2010
Applicants:

Sachindra Joshi, New Delhi, IN;

Tanveer Faruquie, New Delhi, IN;

Hima Prasad Karanam, New Delhi, IN;

Marvin Mendelssohn, Melrose, MA (US);

Mukesh Kumar Mohania, Rajpur Chungi, IN;

Angel Marie Smith, Pepperell, MA (US);

L Venkata Subramaniam, Gurgaon, IN;

Girish Venkatachaliah, San Jose, CA (US);

Inventors:

Sachindra Joshi, New Delhi, IN;

Tanveer Faruquie, New Delhi, IN;

Hima Prasad Karanam, New Delhi, IN;

Marvin Mendelssohn, Melrose, MA (US);

Mukesh Kumar Mohania, Rajpur Chungi, IN;

Angel Marie Smith, Pepperell, MA (US);

L Venkata Subramaniam, Gurgaon, IN;

Girish Venkatachaliah, San Jose, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 7/00 (2006.01); G06F 17/00 (2006.01);
U.S. Cl.
CPC ...
Abstract

A clustering-based approach to data standardization is provided. Certain embodiments take as input a plurality of addresses, identify one or more features of the addresses, cluster the addresses based on the one or more features, utilize the cluster(s) to provide a data-based context useful in identifying one or more synonyms for elements contained in the address(es), and standardize the address(es) to an acceptable format, with one or more synonyms and/or other elements being added to or taken away from the input address(es) as part of the standardization process.


Find Patent Forward Citations

Loading…