The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 02, 2024

Filed:

Dec. 26, 2023
Applicant:

Chung Ang University Industry Academic Cooperation Foundation, Seoul, KR;

Inventors:

Jong Won Choi, Seoul, KR;

Soo Hyun Park, Gangwon-do, KR;

Jong Su Youn, Seoul, KR;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 10/778 (2022.01); G06V 10/74 (2022.01); G06V 10/774 (2022.01); G06V 10/776 (2022.01); G06V 10/82 (2022.01); G06V 20/40 (2022.01); G10L 25/30 (2013.01);
U.S. Cl.
CPC ...
G06V 10/7792 (2022.01); G06V 10/761 (2022.01); G06V 10/774 (2022.01); G06V 10/776 (2022.01); G06V 10/82 (2022.01); G06V 20/46 (2022.01); G10L 25/30 (2013.01);
Abstract

An apparatus for video representation learning according to an embodiment may extract video features from video data to generate a video embedding, extract image features from image data extracted from the video data to generate an image embedding, and extract audio features from audio data extracted from the video data to generate an audio embedding. Further, contrastive learning may be performed by generating a first compositional embedding based on the video embedding and the audio embedding, generating a second compositional embedding based on the video embedding and the audio embedding, generating a positive sample and a negative sample based on a correlation between the image embedding and the audio embedding, and then using the data.


Find Patent Forward Citations

Loading…