The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jul. 19, 2022
Filed:
Apr. 02, 2019
Nvidia Corporation, Santa Clara, CA (US);
Jeff Pool, Durham, NC (US);
Ganesh Venkatesh, San Jose, CA (US);
Jorge Albericio Latorre, San Jose, CA (US);
Jack Choquette, Palo Alto, CA (US);
Ronny Krashinsky, San Francisco, CA (US);
John Tran, Denver, CO (US);
Feng Xie, Shanghai, CN;
Ming Y. Siu, Santa Clara, CA (US);
Manan Patel, San Jose, CA (US);
NVIDIA Corporation, Santa Clara, CA (US);
Abstract
Approaches in accordance with various embodiments provide for the processing of sparse matrices for mathematical and programmatic operations. In particular, various embodiments enforce sparsity constraints for performing sparse matrix multiply-add instruction (MMA) operations. Deep neural networks can exhibit significant sparsity in the data used in operations, both in the activations and weights. The computational load can be reduced by excluding zero-valued data elements. A sparsity constraint is applied across all submatrices of a sparse matrix, providing fine-grained structured sparsity that is evenly distributed across the matrix. The matrix may then be compressed since a minimum number of elements of the matrix are known to have zero value. Matrix operations are then performed using these matrices.