The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 20, 2026

Filed:

Apr. 24, 2023
Applicant:

Sony Interactive Entertainment Inc., Tokyo, JP;

Inventors:

Sudha Krishnamurthy, Foster City, CA (US);

Justice Adams, San Mateo, CA (US);

Arindam Jati, Los Angeles, CA (US);

Masanori Omote, Half Moon Bay, CA (US);

Jian Zheng, Binghampton, NY (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/65 (2019.01); A63F 13/60 (2014.01); G06N 3/045 (2023.01); G06V 10/44 (2022.01); G06V 10/764 (2022.01); G06V 10/776 (2022.01); G06V 10/82 (2022.01); G06V 20/20 (2022.01); G06V 20/40 (2022.01); G06V 20/70 (2022.01); G10L 13/02 (2013.01); G10L 15/16 (2006.01); G10L 15/26 (2006.01);
U.S. Cl.
CPC ...
G06F 16/65 (2019.01); A63F 13/60 (2014.09); G06N 3/045 (2023.01); G06V 10/454 (2022.01); G06V 10/764 (2022.01); G06V 10/776 (2022.01); G06V 10/82 (2022.01); G06V 20/20 (2022.01); G06V 20/41 (2022.01); G06V 20/70 (2022.01); G10L 13/02 (2013.01); G10L 15/16 (2013.01); G10L 15/26 (2013.01); G06V 20/44 (2022.01);
Abstract

A system enhances existing audio-visual content with an action a scene annotation module, an action description module, both of which are coupled to a controller. The scene annotation module classifies scene elements from an image frame received from a host system and generates a caption describing the scene elements. The scene annotation module includes a first neural network configured to generate a feature vector from the image frame and a second neural network configured to generate a caption describing elements within the image frame from the feature vector. The action description module recognizes action happening within one or more image frames received from the host system and generates a description of the action happening within one or more image frames.


Find Patent Forward Citations

Loading…