The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 22, 2025

Filed:

Oct. 31, 2022
Applicant:

Tencent Technology (Shenzhen) Company Limited, Guangdong, CN;

Inventors:

Boyuan Jiang, Guangdong, CN;

Donghao Luo, Guangdong, CN;

Mingyu Wu, Guangdong, CN;

Yabiao Wang, Guangdong, CN;

Chengjie Wang, Guangdong, CN;

Xiaoming Huang, Guangdong, CN;

Jilin Li, Guangdong, CN;

Feiyue Huang, Guangdong, CN;

Yongjian Wu, Guangdong, CN;

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06V 40/20 (2022.01); G06V 10/56 (2022.01); G06V 10/74 (2022.01); G06V 10/75 (2022.01); G06V 10/80 (2022.01); G06V 20/40 (2022.01);
U.S. Cl.
CPC ...
G06V 40/20 (2022.01); G06V 10/56 (2022.01); G06V 10/757 (2022.01); G06V 10/761 (2022.01); G06V 10/806 (2022.01); G06V 20/46 (2022.01); G06V 20/48 (2022.01);
Abstract

The present subject matter discloses an action recognition method, apparatus and device, a storage medium, and a computer program product, belonging to the field of image recognition. Multiple video frames in a target video are obtained. Feature extraction is performed on the multiple video frames respectively according to multiple dimensions to obtain multiple multi-channel feature patterns. Each video frame corresponds to one multi-channel feature pattern. Each channel represents one dimension. An attention weight of each multi-channel feature pattern is determined based on a similarity between every two multi-channel feature patterns. The attention weight is used for representing a degree of correlation between a corresponding multi-channel feature pattern and an action performed by an object in the target video. A type of the action is determined based on the multiple multi-channel feature patterns and the determined multiple attention weights.


Find Patent Forward Citations

Loading…