The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 09, 2024

Filed:

Feb. 23, 2021
Applicant:

Tencent Technology (Shenzhen) Company Limited, Shenzhen, CN;

Inventors:

Bairui Wang, Shenzhen, CN;

Lin Ma, Shenzhen, CN;

Yang Feng, Shenzhen, CN;

Wei Liu, Shenzhen, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 40/56 (2020.01); G06N 20/00 (2019.01); G06F 40/40 (2020.01); G06V 20/40 (2022.01); G06F 18/25 (2023.01); G06V 10/80 (2022.01); G06N 3/08 (2023.01); G06N 3/044 (2023.01);
U.S. Cl.
CPC ...
G06F 40/56 (2020.01); G06F 18/253 (2023.01); G06F 40/40 (2020.01); G06N 20/00 (2019.01); G06V 10/806 (2022.01); G06V 20/46 (2022.01); G06N 3/044 (2023.01); G06N 3/08 (2013.01);
Abstract

The present disclosure describes methods, devices, and storage medium for generating a natural language description for a media object. The method includes respectively processing, by a device, a media object by using a plurality of natural language description models to obtain a plurality of first feature vectors corresponding to a plurality of feature types. The device includes a memory storing instructions and a processor in communication with the memory. The method also includes fusing, by the device, the plurality of first feature to obtain a second feature vector; and generating, by the device, a natural language description for the media object according to the second feature vector, the natural language description being used for expressing the media object in natural language. The present disclosure resolves the technical problem that natural language description generated for a media object can only give an insufficiently accurate description of the media object.


Find Patent Forward Citations

Loading…