The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 16, 2024

Filed:

Aug. 25, 2021
Applicant:

Nvidia Corporation, Santa Clara, CA (US);

Inventors:

Taihong Xiao, Merced, CA (US);

Sifei Liu, Santa Clara, CA (US);

Shalini De Mello, San Francisco, CA (US);

Zhiding Yu, Santa Clara, CA (US);

Jan Kautz, Lexington, MA (US);

Assignee:

NVIDIA Corporation, Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 18/00 (2023.01); G06F 18/213 (2023.01); G06F 18/214 (2023.01); G06N 3/08 (2023.01); G06V 10/22 (2022.01); G06V 30/14 (2022.01);
U.S. Cl.
CPC ...
G06F 18/2155 (2023.01); G06F 18/213 (2023.01); G06N 3/08 (2013.01); G06V 10/22 (2022.01); G06V 30/1444 (2022.01);
Abstract

A multi-level contrastive training strategy for training a neural network relies on image pairs (no other labels) to learn semantic correspondences at the image level and region or pixel level. The neural network is trained using contrasting image pairs including different objects and corresponding image pairs including different views of the same object. Conceptually, contrastive training pulls corresponding image pairs closer and pushes contrasting image pairs apart. An image-level contrastive loss is computed from the outputs (predictions) of the neural network and used to update parameters (weights) of the neural network via backpropagation. The neural network is also trained via pixel-level contrastive learning using only image pairs. Pixel-level contrastive learning receives an image pair, where each image includes an object in a particular category.


Find Patent Forward Citations

Loading…