The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 17, 2020

Filed:

Jan. 29, 2018
Applicant:

Salesforce.com, Inc., San Francisco, CA (US);

Inventors:

Alexander Richard Trott, San Francisco, CA (US);

Caiming Xiong, Mountain View, CA (US);

Richard Socher, Menlo Park, CA (US);

Assignee:

salesforce.com, inc., San Francisco, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/00 (2006.01); G06K 9/46 (2006.01); G06F 16/332 (2019.01); G06N 5/04 (2006.01); G06N 3/04 (2006.01);
U.S. Cl.
CPC ...
G06K 9/46 (2013.01); G06F 16/3329 (2019.01); G06K 9/00 (2013.01); G06N 3/0445 (2013.01); G06N 5/04 (2013.01); G06T 2210/12 (2013.01);
Abstract

Approaches for interpretable counting for visual question answering include a digital image processor, a language processor, and a counter. The digital image processor identifies objects in an image, maps the identified objects into an embedding space, generates bounding boxes for each of the identified objects, and outputs the embedded objects paired with their bounding boxes. The language processor embeds a question into the embedding space. The scorer determines scores for the identified objects. Each respective score determines how well a corresponding one of the identified objects is responsive to the question. The counter determines a count of the objects in the digital image that are responsive to the question based on the scores. The count and a corresponding bounding box for each object included in the count are output. In some embodiments, the counter determines the count interactively based on interactions between counted and uncounted objects.


Find Patent Forward Citations

Loading…