The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Oct. 31, 2023
Filed:
May. 20, 2020
Google Llc, Mountain View, CA (US);
Samuel Bengio, Los Altos, CA (US);
Mohammad Norouzi, Richmond Hill, CA;
Benoit Steiner, Mountain View, CA (US);
Jeffrey Adgate Dean, Palo Alto, CA (US);
Hieu Hy Pham, Menlo Park, CA (US);
Azalia Mirhoseini, San Jose, CA (US);
Quoc V. Le, Sunnyvale, CA (US);
Naveen Kumar, San Jose, CA (US);
Yuefeng Zhou, Sunnyvale, CA (US);
Rasmus Munk Larsen, San Jose, CA (US);
Google LLC, Mountain View, CA (US);
Abstract
A method for determining a placement for machine learning model operations across multiple hardware devices is described. The method includes receiving data specifying a machine learning model to be placed for distributed processing on multiple hardware devices; generating, from the data, a sequence of operation embeddings, each operation embedding in the sequence characterizing respective operations necessary to perform the processing of the machine learning model; processing the sequence of operation embeddings using a placement recurrent neural network in accordance with first values of a plurality network parameters of the placement recurrent neural network to generate a network output that defines a placement of the operations characterized by the operation embeddings in the sequence across the plurality of devices; and scheduling the machine learning model for processing by the multiple hardware devices by placing the operations on the multiple devices according to the placement defined by the network output.