The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 09, 2021

Filed:

Sep. 18, 2020
Applicant:

Neuralmagic Inc., Somerville, MA (US);

Inventors:

Alexander Matveev, Cambridge, MA (US);

Nir Shavit, Cambridge, MA (US);

Assignee:

NEURALMAGIC INC., Somerville, MA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2006.01); G06F 12/0875 (2016.01); G06N 3/04 (2006.01);
U.S. Cl.
CPC ...
G06N 3/08 (2013.01); G06F 12/0875 (2013.01); G06N 3/04 (2013.01); G06F 2212/452 (2013.01);
Abstract

A system and method of inferring a neural network (NN) on one or more target computing devices. The NN may include a plurality of layers, where at least one layer includes one or more kernels. Embodiments may include: receiving a data structure representing the NN; analyzing the data structure to produce one or more tasks, where each task may include computations pertaining to a kernel of the NN; selecting a sparse version of at least one kernel and replacing the at least one kernel with the sparse version; and compiling the one or more tasks to produce one or more respective tensor columns, The one or more tensor columns are adapted to fit in respective one or more cache memories of the one or more target computing devices, and include task instruction code that represents at least one computation of the kernel of the NN.


Find Patent Forward Citations

Loading…