The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 22, 2021

Filed:

Apr. 19, 2018
Applicant:

Intel Corporation, Santa Clara, CA (US);

Inventors:

Subramaniam Maiyuran, Gold River, CA (US);

Guei-Yuan Lueh, San Jose, CA (US);

Supratim Pal, Bangalore, IN;

Ashutosh Garg, Folsom, CA (US);

Chandra S. Gurram, Folsom, CA (US);

Jorge E. Parra, El Dorado Hills, CA (US);

Junjie Gu, Santa Clara, CA (US);

Konrad Trifunovic, Mierzyn, PL;

Hong Bin Liao, Beijing, CN;

Mike B. Macpherson, Portland, OR (US);

Shubh B. Shah, Folsom, CA (US);

Shubra Marwaha, Santa Clara, CA (US);

Stephen Junkins, Bend, OR (US);

Timothy R. Bauer, Santa Clara, CA (US);

Varghese George, Folsom, CA (US);

Weiyu Chen, San Jose, CA (US);

Assignee:

Intel Corporation, Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 9/30 (2018.01); G06T 1/20 (2006.01); G06F 9/38 (2018.01);
U.S. Cl.
CPC ...
G06F 9/3001 (2013.01); G06F 9/30145 (2013.01); G06F 9/3802 (2013.01); G06F 9/3887 (2013.01); G06T 1/20 (2013.01);
Abstract

Embodiments described herein provided for an instruction and associated logic to enable GPGPU program code to access special purpose hardware logic to accelerate dot product operations. One embodiment provides for a graphics processing unit comprising a fetch unit to fetch an instruction for execution and a decode unit to decode the instruction into a decoded instruction. The decoded instruction is a matrix instruction to cause the graphics processing unit to perform a parallel dot product operation. The GPGPU also includes a systolic dot product unit to execute the decoded instruction across one or more SIMD lanes using multiple systolic layers, wherein to execute the decoded instruction, a dot product computed at a first systolic layer is to be output to a second systolic layer, wherein each systolic layer includes one or more sets of interconnected multipliers and adders, each set of multipliers and adders to generate a dot product.


Find Patent Forward Citations

Loading…