The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 27, 2024

Filed:

Mar. 02, 2021
Applicants:

Beijing Jingdong Shangke Information Technology Co., Ltd., Beijing, CN;

Beijing Jingdong Century Trading Co., Ltd., Beijing, CN;

Inventors:

Yingwei Pan, Beijing, CN;

Yehao Li, Beijing, CN;

Ting Yao, Beijing, CN;

Tao Mei, Beijing, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 20/70 (2022.01); G06T 7/70 (2017.01); G06V 10/25 (2022.01); G06V 10/44 (2022.01); G06V 10/46 (2022.01); G06V 10/80 (2022.01);
U.S. Cl.
CPC ...
G06V 20/70 (2022.01); G06T 7/70 (2017.01); G06V 10/25 (2022.01); G06V 10/44 (2022.01); G06V 10/462 (2022.01); G06V 10/806 (2022.01); G06V 2201/07 (2022.01);
Abstract

The present disclosure relates to the technical field of image processing, and in particular to an image description generation method, apparatus and system, and a medium and an electronic device. The method comprises: acquiring one or more image region features in a target image, and obtaining a current input vector by performing a mean pooling on the image region features; obtaining respective outer product vectors of the image region features by respectively linearly fusing the current input vector and each of the image region features; calculating, based on the respective outer product vectors of the image region features, an attention distribution of the image region features in a spatial dimension and an attention distribution of the image region features in a channel dimension; and generating an image description of the target image based on the attention distribution of the image region features in the spatial dimension and the attention distribution of the image region features in the channel dimension.


Find Patent Forward Citations

Loading…