The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jun. 07, 2022

Filed:

Sep. 30, 2019
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Bharadwaj Pudipeddi, San Jose, CA (US);

Marc Tremblay, Bellevue, WA (US);

Sujeeth Subramanya Bharadwaj, Milpitas, CA (US);

Jinwen Xi, Sunnyvale, CA (US);

Maral Mesmakhosroshahi, Sunnyvale, CA (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 15/16 (2006.01); G06N 3/10 (2006.01); H04L 67/10 (2022.01); G06N 3/08 (2006.01);
U.S. Cl.
CPC ...
G06N 3/10 (2013.01); G06N 3/08 (2013.01); H04L 67/10 (2013.01);
Abstract

Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. This paradigm of executing one portion of the AI model at a time allows for dynamic execution of the large AI model.


Find Patent Forward Citations

Loading…