The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 12, 2016

Filed:

Sep. 24, 2014
Applicants:

Michael Heinz, Phoenixville, PA (US);

Todd Rimmer, Exton, PA (US);

James Kunz, Plymouth, MN (US);

Mark Debbage, Santa Clara, CA (US);

Inventors:

Michael Heinz, Phoenixville, PA (US);

Todd Rimmer, Exton, PA (US);

James Kunz, Plymouth, MN (US);

Mark Debbage, Santa Clara, CA (US);

Assignee:

Intel Corporation, Santa Clara, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
H04L 12/28 (2006.01); H04L 12/24 (2006.01); H04L 12/751 (2013.01);
U.S. Cl.
CPC ...
H04L 41/12 (2013.01); H04L 41/0816 (2013.01); H04L 41/0893 (2013.01); H04L 45/08 (2013.01);
Abstract

System, method, and apparatus for improving the performance of collective operations in High Performance Computing (HPC). Compute nodes in a networked HPC environment form collective groups to perform collective operations. A spanning tree is formed including the compute nodes and switches and links used to interconnect the compute nodes, wherein the spanning tree is configured such that there is only a single route between any pair of nodes in the tree. The compute nodes implement processes for performing the collective operations, which includes exchanging messages between processes executing on other compute nodes, wherein the messages contain indicia identifying collective operations they belong to. Each switch is configured to implement message forwarding operations for its portion of the spanning tree. Each of the nodes in the spanning tree implements a ratcheted cyclical state machine that is used for synchronizing collective operations, along with status messages that are exchanged between nodes. Transaction IDs are also used to detect out-of-order and lost messages.


Find Patent Forward Citations

Loading…