The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 23, 2021

Filed:

Sep. 28, 2018
Applicant:

Intel Corporation, Santa Clara, CA (US);

Inventors:

Mike Espig, Newberg, OR (US);

Bret Toll, Hillsboro, OR (US);

Raanan Sade, Kibutz Sarid, IL;

Bob Valentine, Kiryat Tivon, IL;

Alexander Heinecke, San Jose, CA (US);

Christopher J. Hughes, Santa Clara, CA (US);

Assignee:

Intel Corporation, Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 9/38 (2018.01); G06F 9/30 (2018.01); G06F 15/80 (2006.01);
U.S. Cl.
CPC ...
G06F 9/384 (2013.01); G06F 9/30043 (2013.01); G06F 9/382 (2013.01); G06F 9/3826 (2013.01); G06F 9/3851 (2013.01); G06F 15/8046 (2013.01);
Abstract

An apparatus and method for efficient matrix alignment in a systolic array. For example, one embodiment of a processor comprises: a first set of physical tile registers to store first matrix data in rows or columns; a second set of physical tile registers to store second matrix data in rows or columns; a decoder to decode a matrix instruction identifying a first input matrix, a first offset, a second input matrix, and a second offset; and execution circuitry, responsive to the matrix instruction, to read a subset of rows or columns from the first set of physical tile registers in accordance with the first offset, spanning multiple physical tile registers from the first set if indicated by the first offset to generate a first input matrix and the execution circuitry to read a subset of rows or columns from the second set of physical tile registers in accordance with the second offset, spanning multiple physical tile registers from the second set if indicated by the second offset to generate a second input matrix; and the execution circuitry to perform an arithmetic operation with the first and second input matrices in accordance with an opcode of the matrix instruction.


Find Patent Forward Citations

Loading…