The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 25, 2021

Filed:

Jan. 26, 2018
Applicant:

Nvidia Corporation, Santa Clara, CA (US);

Inventors:

Ziyad Hakura, San Jose, CA (US);

Olivier Giroux, Santa Clara, CA (US);

Wishwesh Gandhi, Sunnyvale, CA (US);

Assignee:

NVIDIA Corporation, Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 12/02 (2006.01); G06F 9/48 (2006.01); G06F 12/06 (2006.01); G06F 9/46 (2006.01); G06F 9/52 (2006.01);
U.S. Cl.
CPC ...
G06F 9/4843 (2013.01); G06F 9/461 (2013.01); G06F 12/06 (2013.01); G06F 9/52 (2013.01); G06F 2209/462 (2013.01); G06F 2209/521 (2013.01); G06F 2212/1008 (2013.01);
Abstract

In various embodiments, an ordered atomic operation enables a parallel processing subsystem to executes an atomic operation associated with a memory location in a specified order relative to other ordered atomic operations associated with the memory location. A level 2 (L2) cache slice includes an atomic processing circuit and a content-addressable memory (CAM). The CAM stores an ordered atomic operation specifying at least a memory address, an atomic operation, and an ordering number. In operation, the atomic processing circuit performs a look-up operation on the CAM, where the look-up operation specifies the memory address. After the atomic processing circuit determines that the ordering number is equal to a current ordering number associated with the memory address, the atomic processing circuit executes the atomic operation and returns the result to a processor executing an algorithm. Advantageously, the ordered atomic operation enables the algorithm to achieve a deterministic result while optimizing latency.


Find Patent Forward Citations

Loading…