The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 26, 2024

Filed:

Dec. 24, 2020
Applicant:

Intel Corporation, Santa Clara, CA (US);

Inventors:

Alexander F. Heinecke, San Jose, CA (US);

Robert Valentine, Kiryat Tivon, IL;

Mark J. Charney, Lexington, MA (US);

Menachem Adelman, Haifa, IL;

Christopher J. Hughes, Santa Clara, CA (US);

Evangelos Georganas, San Mateo, CA (US);

Zeev Sperber, Zichron Yackov, IL;

Amit Gradstein, Binyamina, IL;

Simon Rubanovich, Haifa, IL;

Assignee:

Intel Corporation, Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 9/30 (2018.01); G06F 7/544 (2006.01); G06F 9/38 (2018.01); G06F 17/16 (2006.01); G06N 3/08 (2023.01);
U.S. Cl.
CPC ...
G06F 9/3001 (2013.01); G06F 7/5443 (2013.01); G06F 9/30145 (2013.01); G06F 9/3802 (2013.01); G06F 17/16 (2013.01); G06N 3/08 (2013.01);
Abstract

Systems, methods, and apparatuses relating to 16-bit floating-point matrix dot product instructions are described. In one embodiment, a processor includes fetch circuitry to fetch a single instruction having fields to specify an opcode and locations of a M by N destination matrix having single-precision elements, an M by K first source matrix, and a K by N second source matrix, the source matrices having elements that each comprise a pair of half-precision floating-point values, the opcode to indicate execution circuitry is to cause, for each element of the first source matrix and corresponding element of the second source matrix, a conversion of the half-precision floating-point values to single-precision values, a multiplication of converted single-precision values from first values of the pairs together to generate a first result, a multiplication of converted single-precision values from second values of the pairs together to generate a second result, and an accumulation of the first result and the second result with previous contents of a corresponding element of the destination matrix, decode circuitry to decode the fetched instruction, and the execution circuitry to respond to the decoded instruction as specified by the opcode.


Find Patent Forward Citations

Loading…