The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 13, 2025

Filed:

Mar. 13, 2023
Applicant:

Nvidia Corporation, Santa Clara, CA (US);

Inventors:

Tim Kaldewey, Bala Cynwyd, PA (US);

Jiri Johannes Kraus, Bonn, DE (US);

Nikolay Sakharnykh, Chicago, IL (US);

Assignee:

NVIDIA Corporation, Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 9/38 (2018.01); G06F 9/50 (2006.01); G06F 9/54 (2006.01); G06F 16/2455 (2019.01); G06F 16/28 (2019.01); H04L 9/06 (2006.01);
U.S. Cl.
CPC ...
G06F 9/3877 (2013.01); G06F 9/5061 (2013.01); G06F 9/544 (2013.01); G06F 16/2456 (2019.01); G06F 16/283 (2019.01); H04L 9/0643 (2013.01);
Abstract

Described approaches provide for effectively and scalably using multiple GPUs to build and probe hash tables and materialize results of probes. Random memory accesses by the GPUs to build and/or probe a hash table may be distributed across GPUs and executed concurrently using global location identifiers. A global location identifier may be computed from data of an entry and identify a global location for an insertion and/or probe using the entry. The global location identifier may be used by a GPU to determine whether to perform an insertion or probe using an entry and/or where the insertion or probe is to be performed. To coordinate GPUs in materializing results of probing a hash table a global offset to the global output buffer may be maintained in memory accessible to each of the GPUs or the GPUs may compute global offsets using an exclusive sum of the local output buffer sizes.


Find Patent Forward Citations

Loading…