The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Nov. 14, 2017
Filed:
Feb. 17, 2015
Amazon Technologies, Inc., Seattle, WA (US);
Roshan Ram Rammohan, Seattle, WA (US);
Jeremy Leon Calvert, Seattle, WA (US);
Deept Kumar, Seattle, WA (US);
Ismail Baha Tutar, Seattle, WA (US);
Amazon Technologies, Inc., Seattle, WA (US);
Abstract
Technologies are disclosed herein for generating and utilizing machine-learning generated classifiers configured to identify document relationships. Manually-generated data is captured that indicates if documents in a document corpus have a relationship with one another, such as duplicates or variations. A determination may then be made as to whether a classifier is to be generated based on the duplicate decision data. If a classifier is to be generated, machine learning may be performed using training documents from the document corpus and the duplicate decision data to generate a classifier. The machine-learning generated classifier may then be utilized in a production environment to determine whether a new document is a duplicate of documents in the document corpus and/or to identify other relationships between documents in the document corpus, such as documents that are similar or are variations of one another.