The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 14, 2025

Filed:

Jul. 14, 2023
Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Jeffrey T. Huynh, San Jose, CA (US);

Ron Diamant, Santa Clara, CA (US);

Hongbin Zheng, San Jose, CA (US);

Yizhi Liu, Fremont, CA (US);

Animesh Jain, Sunnyvale, CA (US);

Yida Wang, Palo Alto, CA (US);

Vinod Sharma, Menlo Park, CA (US);

Richard John Heaton, San Jose, CA (US);

Randy Renfu Huang, Morgan Hill, CA (US);

Sundeep Amirineni, Cedar Park, TX (US);

Drazen Borkovic, Los Altos, CA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/063 (2023.01); G06N 3/04 (2023.01); G06N 3/045 (2023.01); G06N 3/08 (2023.01);
U.S. Cl.
CPC ...
G06N 3/063 (2013.01); G06N 3/04 (2013.01);
Abstract

Generating instructions for programming a processing element array to implement a convolution operation can include determining that the convolution operation under-utilizes the processing element array. The convolution operation involves using the processing element array to perform a series of matrix multiplications between a set of filters and a set of input matrices. Each filter comprises a weight matrix. Each input matrix is assigned to a respective row in the processing element array. Under-utilization can be determined through detecting that less than a threshold number of rows would be used concurrently. In response to determining that the convolution operation under-utilizes the processing element array, instructions can be added for modifying the convolution operation to increase the number of rows used concurrently. The added instructions are executable to cause at least one input matrix to be processed in parallel across more rows compared to processing without modifying the convolution operation.


Find Patent Forward Citations

Loading…