The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Nov. 05, 2013
Filed:
Jul. 28, 2011
Jianhui Wu, Seattle, WA (US);
Srikanth Thirumalai, Clyde Hill, WA (US);
Jianhui Wu, Seattle, WA (US);
Srikanth Thirumalai, Clyde Hill, WA (US);
Amazon Technologies, Inc., Reno, NV (US);
Abstract
Embodiments may generate a population of candidate rules including multiple rule conditions for detecting duplicates, each duplicate representing different sets of item description information that describe a common item. For each candidate rule of the population, embodiments may apply that rule to a reference data set including known duplicates and non-duplicates. Embodiments may assign each candidate rule a fitness score generated with a fitness function based on the performance of that candidate rule. Embodiments may, based on the fitness scores, select a subset of the population of candidate rules as parents for the new generation of candidate rules. Embodiments may perform crossover and/or mutation operations on the parent candidate rules to generate the new generation of candidate rules. Embodiments may select from the new generation of candidate rules (or from subsequent generations of candidate rules), rules for inclusion within a rule set for detecting duplicates within item description information.