The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 29, 2025

Filed:

Sep. 23, 2022
Applicant:

Salesforce, Inc., San Francisco, CA (US);

Inventors:

Anthony Meng Huat Tiong, Singapore, SG;

Junnan Li, Singapore, SG;

Chu Hong Hoi, Singapore, SG;

Assignee:

Salesforce, Inc., San Francisco, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 10/86 (2022.01); G06N 3/045 (2023.01); G06V 10/26 (2022.01); G06V 10/774 (2022.01); G06V 10/82 (2022.01);
U.S. Cl.
CPC ...
G06V 10/86 (2022.01); G06N 3/045 (2023.01); G06V 10/26 (2022.01); G06V 10/774 (2022.01); G06V 10/82 (2022.01);
Abstract

Embodiments described herein provide a zero-shot visual question answering (VQA) framework, which conjoins foundation network models with zero additional training. A first image and a question relating to the first image are received. The first image is divided into a plurality of image patches. A plurality of relevant image patches that are relevant to the question are determined, using a first neural network model, from the plurality of image patches. A plurality of image captions are generated, using a second neural network model, based on the plurality of relevant image patches. An answer to the question is generated based on the plurality of image captions.


Find Patent Forward Citations

Loading…