The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 08, 2022

Filed:

Jun. 27, 2018
Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Sudipta Sengupta, Redmond, WA (US);

Poorna Chand Srinivas Perumalla, Seattle, WA (US);

Dominic Rajeev Divakaruni, Seattle, WA (US);

Nafea Bshara, Cupertino, CA (US);

Leo Parker Dirac, Seattle, WA (US);

Bratin Saha, Cupertino, CA (US);

Matthew James Wood, Seattle, WA (US);

Andrea Olgiati, Gilroy, CA (US);

Swaminathan Sivasubramanian, Sammamish, WA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06N 3/063 (2006.01); G06N 3/08 (2006.01);
U.S. Cl.
CPC ...
G06N 3/063 (2013.01); G06N 3/08 (2013.01);
Abstract

Implementations detailed herein include description of a computer-implemented method. In an implementation, the method at least includes receiving an application instance configuration, an application of the application instance to utilize a portion of an attached accelerator during execution of a machine learning model and the application instance configuration including an arithmetic precision of the machine learning model to be used in determining the portion of the accelerator to provision; provisioning the application instance and the portion of the accelerator attached to the application instance, wherein the application instance is implemented using a physical compute instance in a first location, wherein the portion of the accelerator is implemented using a physical accelerator in the second location; loading the machine learning model onto the portion of the accelerator; and performing inference using the loaded machine learning model of the application using the portion of the accelerator on the attached accelerator.


Find Patent Forward Citations

Loading…