The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

G06F 16/65 (2019.01); A63F 13/60 (2014.01); G06N 3/045 (2023.01); G06V 10/44 (2022.01); G06V 10/764 (2022.01); G06V 10/776 (2022.01); G06V 10/82 (2022.01); G06V 20/20 (2022.01); G06V 20/40 (2022.01); G06V 20/70 (2022.01); G10L 13/02 (2013.01); G10L 15/16 (2006.01); G10L 15/26 (2006.01);

U.S. Cl.

CPC ...

G06F 16/65 (2019.01); A63F 13/60 (2014.09); G06N 3/045 (2023.01); G06V 10/454 (2022.01); G06V 10/764 (2022.01); G06V 10/776 (2022.01); G06V 10/82 (2022.01); G06V 20/20 (2022.01); G06V 20/41 (2022.01); G06V 20/70 (2022.01); G10L 13/02 (2013.01); G10L 15/16 (2013.01); G10L 15/26 (2013.01); G06V 20/44 (2022.01);

Abstract

A system enhances existing audio-visual content with an action a scene annotation module, an action description module, both of which are coupled to a controller. The scene annotation module classifies scene elements from an image frame received from a host system and generates a caption describing the scene elements. The scene annotation module includes a first neural network configured to generate a feature vector from the image frame and a second neural network configured to generate a caption describing elements within the image frame from the feature vector. The action description module recognizes action happening within one or more image frames received from the host system and generates a description of the action happening within one or more image frames.

Find Patent Forward Citations