The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Nov. 08, 2022
Filed:
Jun. 27, 2018
Amazon Technologies, Inc., Seattle, WA (US);
Sudipta Sengupta, Redmond, WA (US);
Poorna Chand Srinivas Perumalla, Seattle, WA (US);
Dominic Rajeev Divakaruni, Seattle, WA (US);
Nafea Bshara, Cupertino, CA (US);
Leo Parker Dirac, Seattle, WA (US);
Bratin Saha, Cupertino, CA (US);
Matthew James Wood, Seattle, WA (US);
Andrea Olgiati, Gilroy, CA (US);
Swaminathan Sivasubramanian, Sammamish, WA (US);
Amazon Technologies, Inc., Seattle, WA (US);
Abstract
Implementations detailed herein include description of a computer-implemented method. In an implementation, the method at least includes receiving an application instance configuration, an application of the application instance to utilize a portion of an attached accelerator during execution of a machine learning model and the application instance configuration including an arithmetic precision of the machine learning model to be used in determining the portion of the accelerator to provision; provisioning the application instance and the portion of the accelerator attached to the application instance, wherein the application instance is implemented using a physical compute instance in a first location, wherein the portion of the accelerator is implemented using a physical accelerator in the second location; loading the machine learning model onto the portion of the accelerator; and performing inference using the loaded machine learning model of the application using the portion of the accelerator on the attached accelerator.