The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 10, 2025

Filed:

Apr. 19, 2023
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Matan Vetzler, Givat Shmuel, IL;

Koren Ran Lazar, Jerusalem, IL;

Boaz Carmeli, Koranit, IL;

Ateret Anaby-Tavor, Givat Ada, IL;

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 40/30 (2020.01); G06V 30/19 (2022.01);
U.S. Cl.
CPC ...
G06F 40/30 (2020.01); G06V 30/19093 (2022.01);
Abstract

A computer-implemented method, system and computer program product for identifying a semantic representation for a set of texts. The set of texts is encoded using a language model to obtain a set of corresponding texts' contextualized embeddings. A centroid of the contextualized embeddings is then calculated. A user-designated number of words from a pre-defined vocabulary of the language model that have their non-contextualized embeddings closest to the centroid are then identified. Furthermore, permutations of the identified words are calculated using an n-gram range. The permutations are then encoded using the language model in a contextualized manner. The encoded permutation from the encoded permutations with the greatest similarity to the centroid is identified. The identified encoded permutation is then assigned as the semantic representation of the set of texts. In this manner, the semantic representation is more effectively identified by expanding the embedding space upon which the semantic representation is chosen.


Find Patent Forward Citations

Loading…