The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 08, 2011

Filed:

Dec. 10, 2007
Applicants:

Srikanth Thirumalai, Clyde Hill, WA (US);

Aswath Manoharan, Bellevue, WA (US);

Mark J. Tomko, Seattle, WA (US);

Grant M. Emery, Seattle, WA (US);

Vijai Mohan, Bellevue, WA (US);

Egidio Terra, Porto Alegre, BR;

Inventors:

Srikanth Thirumalai, Clyde Hill, WA (US);

Aswath Manoharan, Bellevue, WA (US);

Mark J. Tomko, Seattle, WA (US);

Grant M. Emery, Seattle, WA (US);

Vijai Mohan, Bellevue, WA (US);

Egidio Terra, Porto Alegre, BR;

Assignee:

Amazon Technologies, Inc., Reno, NV (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 7/00 (2006.01); G06F 17/00 (2006.01);
U.S. Cl.
CPC ...
Abstract

Systems and methods for determining whether a first document is a potential duplicate of a second document such that the two documents describe the same or substantially the same subject matter, wherein the first and second documents include attribute data in attribute fields. A set of rules is obtained for determining whether the first document is a potential duplicate of the second document. Moreover, for each rule in the set of rules, a determination is made as to whether data in a first set of attributes of the first document is contained in a second set of attributes of the second document. According to the results of the evaluated rules in the rules set, determining whether the first document is a potential duplicate of the second document. If, according to the evaluated rules in the rules set, the first document is determined to be a potential duplicate of the second document, storing a reference to the first document in a set of potential duplicates of the second document.


Find Patent Forward Citations

Loading…