The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

G06V 30/262 (2022.01); G06T 7/70 (2017.01); G06V 30/413 (2022.01); G06V 20/62 (2022.01); G06F 16/33 (2019.01); G06V 30/19 (2022.01); G06V 10/82 (2022.01); G06V 30/416 (2022.01);

U.S. Cl.

CPC ...

G06V 30/274 (2022.01); G06F 16/3344 (2019.01); G06T 7/70 (2017.01); G06V 10/82 (2022.01); G06V 20/62 (2022.01); G06V 30/19173 (2022.01); G06V 30/413 (2022.01); G06V 30/416 (2022.01); G06T 2207/30176 (2013.01);

Abstract

The present disclosure provides a method for visual question answering, which relates to fields of computer vision and natural language processing. The method includes: acquiring an input image and an input question; detecting visual information and position information of each of at least one text region in the input image; determining semantic information and attribute information of each of the at least one text region based on the visual information and the position information; determining a global feature of the input image based on the visual information, the position information, the semantic information, and the attribute information; determining a question feature based on the input question; and generating a predicted answer for the input image and the input question based on the global feature and the question feature. The present disclosure further provides a device for visual question answering, a computer device and a medium.

Find Patent Forward Citations