The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 11, 2023

Filed:

Jun. 16, 2021
Applicant:

Tata Consultancy Services Limited, Mumbai, IN;

Inventors:
Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 10/25 (2022.01); G06F 16/583 (2019.01); G06F 40/20 (2020.01); G06V 20/62 (2022.01); G06N 3/04 (2023.01); G06V 10/82 (2022.01); G06V 20/20 (2022.01); G06V 10/764 (2022.01); G06V 30/24 (2022.01);
U.S. Cl.
CPC ...
G06V 10/25 (2022.01); G06F 16/5846 (2019.01); G06F 40/20 (2020.01); G06N 3/04 (2013.01); G06V 10/764 (2022.01); G06V 10/82 (2022.01); G06V 20/20 (2022.01); G06V 20/62 (2022.01); G06V 30/24 (2022.01);
Abstract

This disclosure relates generally to visio-linguistic understanding. Conventional methods use contextual visio-linguistic reasoner for visio-linguistic understanding which requires more compute power and large amount of pre-training data. Embodiments of the present disclosure provide a method for visio-linguistic understanding using contextual language model reasoner. The method converts the visual information of an input image into a format that the contextual language model reasoner understands and accepts for a downstream task. The method utilizes the image captions and confidence score associated with the image captions along with a knowledge graph to obtain a combined input in a format compatible with the contextual language model reasoner. Contextual embeddings corresponding to the downstream task is obtained using the combined input. The disclosed method is used to solve several downstream tasks such as scene understanding, visual question answering, visual common-sense reasoning and so on.


Find Patent Forward Citations

Loading…