The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Sep. 23, 2025
Filed:
Dec. 15, 2022
Amazon Technologies, Inc., Seattle, WA (US);
Hongbin Zheng, San Jose, CA (US);
Yuwen Jia, Sunnyvale, CA (US);
Amazon Technologies, Inc., Seattle, WA (US);
Abstract
Techniques for implementing tensor parallel execution can include identifying a first tensor contraction operation in a compute flow, and slicing the first tensor contraction operation into a first set of multiple tensor contraction portions to have each compute engine of multiple compute engines perform a portion of the first tensor contraction operation. A set of slicing options can then be determined for a second tensor contraction operation that operates on a tensor result of the first tensor contraction operation. A cost for each slicing option is determined, and a slicing option having the lowest cost is selected. The second tensor contraction operation is sliced according to the selected slicing option to have each compute engine perform a portion of the second tensor contraction operation. Collective compute operations can be inserted in the compute flow for the first and second tensor contraction operations.