The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 08, 2013

Filed:

Apr. 21, 2010
Applicants:

Ariel Fuxman, Redmond, WA (US);

Hoa Nguyen, Salt Lake City, UT (US);

Juliana Freire DE Lima E Silva, Salt Lake City, UT (US);

Stelios Paparizos, Redmond, WA (US);

Rakesh Agrawal, Redmond, WA (US);

Zhimin Chen, Redmond, WA (US);

Lawrence William Colagiovanni, Issaquah, WA (US);

Prakash Sikchi, Redmond, WA (US);

Inventors:

Ariel Fuxman, Redmond, WA (US);

Hoa Nguyen, Salt Lake City, UT (US);

Juliana Freire de Lima e Silva, Salt Lake City, UT (US);

Stelios Paparizos, Redmond, WA (US);

Rakesh Agrawal, Redmond, WA (US);

Zhimin Chen, Redmond, WA (US);

Lawrence William Colagiovanni, Issaquah, WA (US);

Prakash Sikchi, Redmond, WA (US);

Assignee:

Microsoft Corporation, Redmond, WA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06Q 10/00 (2012.01); G06Q 30/00 (2012.01);
U.S. Cl.
CPC ...
Abstract

Methods and systems for automatically synthesizing product information from multiple data sources into an on-line catalog are disclosed, and in particular, for automatically synthesizing the product information based on attribute-value pairs. Information for a product may be obtained, via entity extraction, feed ingestion, and other mechanisms, from a plurality of structured and unstructured data sources having different taxonomies and schemas. Product information may additionally or alternatively be obtained or derived based on popularity data. The product information may be cleansed, segmented and normalized. The product information may be clustered so closest products, attribute names and attribute values are associated. A representative value for an attribute name may be determined, and the on-line catalog may be updated so that entries are comprehensive, meaningful and useful to a catalog user. Updates from at least 500 million different data sources may be scheduled to occur as frequently as several times daily.


Find Patent Forward Citations

Loading…