The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 14, 2017

Filed:

Dec. 16, 2011
Applicants:

Michael Fetterman, Boxborough, MA (US);

Jack Hilaire Choquette, Palo Alto, CA (US);

Omkar Paranjape, Austin, TX (US);

Anjana Rajendran, San Jose, CA (US);

Eric Lyell Hill, Palo Alto, CA (US);

Stewart Glenn Carlton, Madison, AL (US);

Rajeshwaran Selvanesan, Milpitas, CA (US);

Douglas J. Hahn, Los Altos, CA (US);

Steven James Heinrich, Madison, AL (US);

Inventors:

Michael Fetterman, Boxborough, MA (US);

Jack Hilaire Choquette, Palo Alto, CA (US);

Omkar Paranjape, Austin, TX (US);

Anjana Rajendran, San Jose, CA (US);

Eric Lyell Hill, Palo Alto, CA (US);

Stewart Glenn Carlton, Madison, AL (US);

Rajeshwaran Selvanesan, Milpitas, CA (US);

Douglas J. Hahn, Los Altos, CA (US);

Steven James Heinrich, Madison, AL (US);

Assignee:

NVIDIA Corporation, Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 9/38 (2006.01);
U.S. Cl.
CPC ...
G06F 9/3851 (2013.01); G06F 9/3861 (2013.01);
Abstract

One embodiment of the present invention sets forth an approach for executing replay operations for divergent operations in a parallel processing subsystem. Specifically, the streaming multiprocessor (SM) includes a multistage pipeline configured to batch two or more replay operations for processing via replay loop. A logic element within the multistage pipeline detects whether the current pipeline stage is accessing a shared resource, such as loading data from a shared memory. If the threads are accessing data which are distributed across multiple cache lines, then the multistage pipeline batches two or more replay operations, where the replay operations are inserted into the pipeline back-to-back.


Find Patent Forward Citations

Loading…