The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 24, 2024

Filed:

Jul. 21, 2021
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Bo Wu, Cambridge, MA (US);

Chuang Gan, Cambridge, MA (US);

Dakuo Wang, Cambridge, MA (US);

Zhenfang Chen, Cambridge, MA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/00 (2019.01); G06F 40/205 (2020.01); G06F 40/284 (2020.01); G06N 5/02 (2023.01); G06N 5/04 (2023.01); G06N 20/20 (2019.01); G06V 20/40 (2022.01);
U.S. Cl.
CPC ...
G06N 5/04 (2013.01); G06F 40/205 (2020.01); G06F 40/284 (2020.01); G06N 5/02 (2013.01); G06N 20/20 (2019.01); G06V 20/49 (2022.01);
Abstract

Mechanisms are provided for performing artificial intelligence-based video question answering. A video parser parses an input video data sequence to generate situation data structure(s), each situation data structure comprising data elements corresponding to entities, and first relationships between entities, identified by the video parser as present in images of the input video data sequence. First machine learning computer model(s) operate on the situation data structure(s) to predict second relationship(s) between the situation data structure(s). Second machine learning computer model(s) execute on a received input question to predict an executable program to execute to answer the received question. The program is executed on the situation data structure(s) and predicted second relationship(s). An answer to the question is output based on results of executing the program.


Find Patent Forward Citations

Loading…