The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 27, 2024

Filed:

May. 31, 2022
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Richard Chen, Baldwin Place, NY (US);

Rameswar Panda, Medford, MA (US);

Quanfu Fan, Lexington, MA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 10/96 (2022.01); G06V 10/25 (2022.01); G06V 10/77 (2022.01);
U.S. Cl.
CPC ...
G06V 10/96 (2022.01); G06V 10/25 (2022.01); G06V 10/7715 (2022.01);
Abstract

Techniques and apparatus for analyzing visual content using a visual transformer are described. An example technique includes generating a first set of tokens based on a visual content item. Each token in the first set of tokens is associated with a regional feature from a different region of a plurality of regions of the visual content item. A second set of tokens is generated based on the visual content item. Each token in the second set of tokens is associated with a local feature from one of the plurality of regions of the visual content item. At least one feature map is generated for the visual content item, based on analyzing the first set of tokens and the second set of tokens separately using a hierarchical vision transformer. At least one vision task is performed based on the at least one feature map.


Find Patent Forward Citations

Loading…