The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 10, 2024

Filed:

Nov. 14, 2019
Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Tarik Arici, Seattle, WA (US);

Ismail Baha Tutar, Seattle, WA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06Q 30/02 (2023.01); G06F 40/284 (2020.01); G06Q 30/0601 (2023.01);
U.S. Cl.
CPC ...
G06Q 30/0603 (2013.01); G06F 40/284 (2020.01); G06Q 30/0625 (2013.01); G06Q 30/0631 (2013.01);
Abstract

Methods, systems, and computer-readable media for similarity detection based on token distinctiveness are disclosed. A similarity detection system determines candidate items for a seed item based on a comparison of tokens in textual descriptions of the candidate items to tokens in a textual description of the seed item. The system uses machine learning to determine importance scores for the tokens of the seed item. An importance score is determined based on the frequency of the individual token and the frequency of the most commonly occurring token in the candidate items. Importance scores for the same token differ from the seed item to another seed item. Based on the importance scores, the system determines similarity scores for the candidate items to the seed item. The system selects, from the candidate items, a set of similar items to the seed item based (at least in part) on the similarity scores.


Find Patent Forward Citations

Loading…