The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 22, 2024

Filed:

Mar. 11, 2021
Applicant:

Perceive Corporation, San Jose, CA (US);

Inventors:

Justin Tantiongloc, San Jose, CA (US);

Brian Thomas, Vancouver, CA;

Steven L. Teig, Menlo Park, CA (US);

Assignee:

PERCEIVE CORPORATION, San Jose, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/048 (2023.01); G06F 9/48 (2006.01); G06F 9/50 (2006.01); G06F 18/21 (2023.01); G06N 3/063 (2023.01);
U.S. Cl.
CPC ...
G06N 3/048 (2023.01); G06F 9/4881 (2013.01); G06F 9/5016 (2013.01); G06F 18/2163 (2023.01); G06N 3/063 (2013.01); G06F 2209/5017 (2013.01); G06F 2209/506 (2013.01);
Abstract

Some embodiments provide a method for generating neural network program instructions for a neural network inference circuit to execute a neural network. The neural network inference circuit includes a particular amount of available memory. The method receives a specification of the neural network including multiple layers. The method determines (i) a required amount of weight memory for the neural network and (ii) required amounts of activation memory for each of a set of layers of the neural network. When the required amount of weight memory and the required amount of activation memory for at least one layer is greater than the particular amount of available memory, the method generates the program instructions for the neural network inference circuit to execute a first set of the layers of the neural network multiple times for different blocks of input data and execute a second set of the layers in a single pass.


Find Patent Forward Citations

Loading…