The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Sep. 23, 2025
Filed:
Dec. 14, 2022
Amazon Technologies, Inc., Reno, NV (US);
Kasturi Bhattacharjee, Sunnyvale, CA (US);
Rashmi Gangadharaiah, San Jose, CA (US);
Senthil C Chidambaram, Folsom, CA (US);
Ankit Kapoor, Seattle, WA (US);
Sharon Shapira, Sammamish, WA (US);
Tony Chun Tung Ng, San Ramon, CA (US);
Deepak Seetharam Nadig, San Jose, CA (US);
Amazon Technologies, Inc., Reno, NV (US);
Abstract
Systems and methods are used to detect underlying themes from a collection of documents at an aggregated level. A representative set of documents may be selected from a cluster of documents, with the representative set of documents corresponding to a general theme of the cluster. Candidate theme phrases may then be extracted from the documents and used to generate document embeddings and candidate phrase embeddings, which may be ranked, such as with a diversity-based ranking approach. Certain candidates may be selected from the ranking. Each of the documents forming the representative set may then be concatenated and a query embedding may be generated and ranked against the candidate phrases. In this manner, a collection of phrases associated with both the general underlying theme of the cluster, along with granular topics associated with that theme, may be identified.