The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
May. 16, 2023
Filed:
Feb. 11, 2020
Apple Inc., Cupertino, CA (US);
James C. Gabriel, Seattle, WA (US);
Mohammad Rastegari, Bothell, WA (US);
Hessam Bagherinezhad, Seattle, WA (US);
Saman Naderiparizi, Seattle, WA (US);
Anish Prabhu, Seattle, WA (US);
Sophie Lebrecht, Seattle, WA (US);
Jonathan Gelsey, Mercer Island, WA (US);
Sayyed Karen Khatamifard, Seattle, WA (US);
Andrew L. Chronister, Seattle, WA (US);
David Bakin, Seattle, WA (US);
Andrew Z. Luo, Redmond, WA (US);
Apple Inc., Cupertino, CA (US);
Abstract
Systems and processes for training and compressing a convolutional neural network model include the use of quantization and layer fusion. Quantized training data is passed through a convolutional layer of a neural network model to generate convolutional results during a first iteration of training the neural network model. The convolutional results are passed through a batch normalization layer of the neural network model to update normalization parameters of the batch normalization layer. The convolutional layer is fused with the batch normalization layer to generate a first fused layer and the fused parameters of the fused layer are quantized. The quantized training data is passed through the fused layer using the quantized fused parameters to generate output data, which may be quantized for a subsequent layer in the training iteration.