The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 18, 2023

Filed:

Feb. 06, 2020
Applicant:

Baidu Usa Llc, Sunnyvale, CA (US);

Inventors:

Min Guo, San Diego, CA (US);

Manjiang Zhang, Sunnyvale, CA (US);

Shengjin Zhou, San Jose, CA (US);

Assignee:

BAIDU USA LLC, Sunnyvale, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2023.01); G06N 3/04 (2023.01); G06F 16/22 (2019.01);
U.S. Cl.
CPC ...
G06N 3/08 (2013.01); G06F 16/2219 (2019.01); G06N 3/04 (2013.01);
Abstract

Embodiments relate to systems and methods to optimize quantization of tensors of an AI model. According to one embodiment, a system receives an AI model having one or more layers. The system receives a number of input data for offline inferencing and applies offline inferencing to the AI model based on the input data to generate offline data distributions for the AI model. The system quantizes one or more tensors of the AI model based on the offline data distributions to generate a low-bit representation AI model, where each layer of the AI model includes the one or more tensors, where the one or more tensors include the one or more tensors. In one embodiment, the system applies online inferencing using the low-bit representation AI model to generate online data distributions for a feature map, and quantizes a feature map tensor based on the online data distributions.


Find Patent Forward Citations

Loading…