The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 05, 2019

Filed:

Mar. 16, 2018
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Arash Ashari, Kirkland, WA (US);

Matthias Boehm, San Jose, CA (US);

Keith W. Campbell, Ottawa, CA;

Alexandre Evfimievski, San Jose, CA (US);

John D. Keenleyside, Pickering, CA;

Berthold Reinwald, San Jose, CA (US);

Shirish Tatikonda, Santa Clara, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06T 1/20 (2006.01);
U.S. Cl.
CPC ...
G06T 1/20 (2013.01);
Abstract

A method for optimization of machine learning (ML) workloads on a graphics processor unit (GPU). The method includes identifying a computation having a generic pattern commonly observed in ML processes. Hierarchical aggregation spanning a memory hierarchy of the GPU for processing is performed for the identified computation including maintaining partial output vector results in shared memory of the GPU. Hierarchical aggregation for vectors is performed including performing intra-block aggregation for multiple thread blocks of a partial output vector results on GPU global memory.


Find Patent Forward Citations

Loading…