The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 16, 2025

Filed:

Jun. 30, 2022
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Muthian Sivathanu, Chennai, IN;

Srinidhi Viswanatha, Bangalore, IN;

Bhargav Gulavani, Bengaluru, IN;

Dharma Kiritkumar Shukla, Bellevue, WA (US);

Rimma Vladimirovna Nehme, Bellevue, WA (US);

Amey Agrawal, Bangalore, IN;

Ramachandran Ramjee, Bengaluru, IN;

Kaustubh Welankar, Bangalore, IN;

Ravi Shreyas Anupindi, Bengaluru, IN;

Assignee:
Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 9/46 (2006.01); G06F 9/38 (2018.01); G06F 9/48 (2006.01); G06F 9/50 (2006.01); G06F 9/52 (2006.01); G06F 11/10 (2006.01); G06F 11/14 (2006.01);
U.S. Cl.
CPC ...
G06F 9/3893 (2013.01); G06F 9/461 (2013.01); G06F 9/4881 (2013.01); G06F 9/5016 (2013.01); G06F 9/522 (2013.01); G06F 11/1004 (2013.01); G06F 11/1407 (2013.01);
Abstract

The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a first worker state. At the identified context switch point, a first memory state of the first worker is stored in a host memory and the accelerator device is configured to a second memory state of the second worker. The second worker is executed during a second time interval and a second context switch point is identified at the end of the second time interval when the second worker is in a state that is equivalent to the first worker state. During the intervals, collective communication operations between the workers are accumulated and, at the second context switch point, the accumulated operations are performed.


Find Patent Forward Citations

Loading…