The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 06, 2024

Filed:

May. 05, 2023
Applicant:

Intel Corporation, Santa Clara, CA (US);

Inventors:

Naveen Mellempudi, Bangalore, IN;

Alexander F. Heinecke, San Jose, CA (US);

Robert Valentine, Kiryat Tivon, IL;

Mark J. Charney, Lexington, MA (US);

Christopher J. Hughes, Santa Clara, CA (US);

Evangelos Georganas, San Mateo, CA (US);

Zeev Sperber, Zichron Yackov, IL;

Amit Gradstein, Binyamina, IL;

Simon Rubanovich, Haifa, IL;

Assignee:

Intel Corporation, Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 9/30 (2018.01); G06F 7/499 (2006.01); G06F 9/38 (2018.01);
U.S. Cl.
CPC ...
G06F 9/30036 (2013.01); G06F 7/49915 (2013.01); G06F 9/30196 (2013.01); G06F 9/3887 (2013.01);
Abstract

Systems, methods, and apparatuses relating to 8-bit floating-point matrix dot product instructions are described. A processor embodiment includes fetch circuitry to fetch an instruction having fields to specify an opcode and locations of a destination matrix having single-precision elements, a first source matrix, and a second source matrix, the source matrices having elements that each comprise a quadruple of 8-bit floating-point values, the opcode to indicate execution circuitry is to cause, for each element of the first source matrix and corresponding element of the second source matrix, a conversion of the 8-bit floating-point values to single-precision values, a multiplication of different pairs of converted single-precision values to generate plurality of results, and an accumulation of the results with previous contents of a corresponding element of the destination matrix, decode circuitry to decode the fetched instruction, and the execution circuitry to respond to the decoded instruction as specified by the opcode.


Find Patent Forward Citations

Loading…