The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 04, 2021

Filed:

Sep. 06, 2019
Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Hooman Mahyar, Kirkland, WA (US);

Vimal Bhat, Redmond, WA (US);

Jatin Jain, Redmond, WA (US);

Udit Bhatia, Bellevue, WA (US);

Roya Hosseini, Bellevue, WA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
H04N 9/87 (2006.01); G06K 9/00 (2006.01); G06F 16/738 (2019.01); G06N 3/04 (2006.01); G06F 16/78 (2019.01); G06F 40/166 (2020.01); G10L 13/00 (2006.01); G10L 17/00 (2013.01);
U.S. Cl.
CPC ...
H04N 9/8715 (2013.01); G06F 16/738 (2019.01); G06F 16/7867 (2019.01); G06F 40/166 (2020.01); G06K 9/00288 (2013.01); G06K 9/00718 (2013.01); G06K 9/00744 (2013.01); G06N 3/0454 (2013.01); G10L 13/00 (2013.01); G10L 17/00 (2013.01); G06K 2009/00738 (2013.01);
Abstract

Systems, methods, and computer-readable media are disclosed for systems and methods for automated generation of textual descriptions of video content. Example methods may include determining, by one or more computer processors coupled to memory, a first segment of video content, the first segment including a first set of frames and first audio content, determining, using a first neural network, a first action that occurs in the first set of frames, and determining a first sound present in the first audio content. Some methods may include generating a vector representing the first action and the first sound, and generating, using a second neural network and the vector, a first textual description of the first segment, where the first textual description includes words that describe events of the first segment.


Find Patent Forward Citations

Loading…