Torus interconnect

Topic Tools

Papers

Proceedings Article•10.1109/CMPCON.1993.289660•

Cray T3D: a new dimension for Cray Research

[...]

R.E. Kessler¹, J.L. Schwarzmeier¹•Institutions (1)

1 Jan 1993

TL;DR: Cray Research's massively parallel processing (MPP) philosophy is presented, together with a brief description of the design of the Cray T3D, the first MPP designed by Cray Research, and the 3-D torus interprocessor interconnect is discussed.

...read moreread less

Abstract: The authors present Cray Research's massively parallel processing (MPP) philosophy, together with a brief description of the design of the Cray T3D, the first MPP designed by Cray Research. They give a brief overview of the important features of the Cray T3D, including the 3-D torus interprocessor interconnect. They discuss in more detail the motivation for the 3-D torus interconnect. Using a very simple capacity model of network performance, they show how three dimensions provide a solid balance between locality and scalability up to thousands of nodes. >

...read moreread less

341 citations

Proceedings Article•10.1145/237090.237144•

Synchronization and communication in the T3E multiprocessor

[...]

Steven L. Scott¹•Institutions (1)

Cray¹

1 Sep 1996

TL;DR: The T3E augments the memory interface of the DEC 21164 microprocessor with a large set of explicitly-managed, external registers (E-registers), which provide a rich set of atomic memory operations and a flexible, user-level messaging facility.

...read moreread less

Abstract: This paper describes the synchronization and communication primitives of the Cray T3E multiprocessor, a shared memory system scalable to 2048 processors. We discuss what we have learned from the T3D project (the predecessor to the T3E) and the rationale behind changes made for the T3E. We include performance measurements for various aspects of communication and synchronization.The T3E augments the memory interface of the DEC 21164 microprocessor with a large set of explicitly-managed, external registers (E-registers). E-registers are used as the source or target for all remote communication. They provide a highly pipelined interface to global memory that allows dozens of requests per processor to be outstanding. Through E-registers, the T3E provides a rich set of atomic memory operations and a flexible, user-level messaging facility. The T3E also provides a set of virtual hardware barrier/eureka networks that can be arbitrarily embedded into the 3D torus interconnect.

...read moreread less

317 citations

Journal Article•10.1109/MC.2009.370•

Tofu: A 6D Mesh/Torus Interconnect for Exascale Computers

[...]

Yuichiro Ajima¹, Shinji Sumimoto¹, Toshiyuki Shimizu¹•Institutions (1)

Fujitsu¹

01 Nov 2009-IEEE Computer

TL;DR: A new architecture with a six-dimensional mesh/torus topology achieves highly scalable and fault-tolerant interconnection networks for large-scale supercomputers that can exceed 10 petaflops.

...read moreread less

Abstract: A new architecture with a six-dimensional mesh/torus topology achieves highly scalable and fault-tolerant interconnection networks for large-scale supercomputers that can exceed 10 petaflops.

...read moreread less

220 citations

Proceedings Article•10.1109/ICPADS.2012.81•

Comparing the Performance of Blue Gene/Q with Leading Cray XE6 and InfiniBand Systems

[...]

Darren J. Kerbyson¹, Kevin J. Barker¹, Abhinav Vishnu¹, Adolfy Hoisie¹•Institutions (1)

Pacific Northwest National Laboratory¹

17 Dec 2012

TL;DR: It is shown that significant performance can be lost in normal production operation of the Cray XE6 and InfiniBand Clusters in comparison to Blue Gene/Q.

...read moreread less

Abstract: Three types of systems dominate the current High Performance Computing landscape: the Cray XE6, the IBM Blue Gene, and commodity clusters using InfiniBand. These systems have quite different characteristics making the choice for a particular deployment difficult. The XE6 uses Cray's proprietary Gemini 3-D torus interconnect with two nodes at each network endpoint. The latest IBM Blue Gene/Q uses a single socket integrating processor and communication in a 5-D torus network. InfiniBand provides the flexibility of using nodes from many vendors connected in many possible topologies. The performance characteristics of each vary vastly along with their utilization model. In this work we compare the performance of these three systems using a combination of micro-benchmarks and a set of production applications. We also discuss the causes of variability in performance across the systems and quantify where performance is lost using a combination of measurements and models. Our results show that significant performance can be lost in normal production operation of the Cray XE6 and InfiniBand Clusters in comparison to Blue Gene/Q.

...read moreread less

14 citations

Book Chapter•10.1007/978-3-319-78024-5_29•

Early Performance Evaluation of the Hybrid Cluster with Torus Interconnect Aimed at Molecular-Dynamics Simulations

[...]

10 Sep 2017

TL;DR: The Desmos cluster that consists of 32 hybrid nodes connected by a low-latency high-bandwidth torus interconnect is described, which verifies its ability to unite MPP systems speeding-up effectively MPI-based applications.

...read moreread less

Abstract: In this paper, we describe the Desmos cluster that consists of 32 hybrid nodes connected by a low-latency high-bandwidth torus interconnect. This cluster is aimed at cost-effective classical molecular dynamics calculations. We present strong scaling benchmarks for GROMACS, LAMMPS and VASP and compare the results with other HPC systems. This cluster serves as a test bed for the Angara interconnect that supports 3D and 4D torus network topologies, and verifies its ability to unite MPP systems speeding-up effectively MPI-based applications. We describe the interconnect presenting typical MPI benchmarks.

...read moreread less

13 citations

...

Expand

Year	Papers
2020	1
2018	1
2017	1
2016	1
2015	3
2014	1

Topic Tools

Papers

Cray T3D: a new dimension for Cray Research

Synchronization and communication in the T3E multiprocessor

Tofu: A 6D Mesh/Torus Interconnect for Exascale Computers

Comparing the Performance of Blue Gene/Q with Leading Cray XE6 and InfiniBand Systems

Early Performance Evaluation of the Hybrid Cluster with Torus Interconnect Aimed at Molecular-Dynamics Simulations

Related Topics (5)

Performance Metrics