-
APEnet+: a 3D toroidal network enabling Petaflops scale Lattice QCD simulations on commodity clusters
Authors:
Roberto Ammendola,
Andrea Biagioni,
Ottorino Frezza,
Francesca Lo Cicero,
Alessandro Lonardo,
Pier Paolucci,
Roberto Petronzio,
Davide Rossetti,
Andrea Salamon,
Gaetano Salina,
Francesco Simula,
Nazario Tantalo,
Laura Tosoratto,
Piero Vicini
Abstract:
Many scientific computations need multi-node parallelism for matching up both space (memory) and time (speed) ever-increasing requirements. The use of GPUs as accelerators introduces yet another level of complexity for the programmer and may potentially result in large overheads due to the complex memory hierarchy. Additionally, top-notch problems may easily employ more than a Petaflops of sustain…
▽ More
Many scientific computations need multi-node parallelism for matching up both space (memory) and time (speed) ever-increasing requirements. The use of GPUs as accelerators introduces yet another level of complexity for the programmer and may potentially result in large overheads due to the complex memory hierarchy. Additionally, top-notch problems may easily employ more than a Petaflops of sustained computing power, requiring thousands of GPUs orchestrated with some parallel programming model. Here we describe APEnet+, the new generation of our interconnect, which scales up to tens of thousands of nodes with linear cost, thus improving the price/performance ratio on large clusters. The project target is the development of the Apelink+ host adapter featuring a low latency, high bandwidth direct network, state-of-the-art wire speeds on the links and a PCIe X8 gen2 host interface. It features hardware support for the RDMA programming model and experimental acceleration of GPU networking. A Linux kernel driver, a set of low-level RDMA APIs and an OpenMPI library driver are available, allowing for painless porting of standard applications. Finally, we give an insight of future work and intended developments.
△ Less
Submitted 1 December, 2010;
originally announced December 2010.
-
apeNEXT: A multi-TFlops Computer for Simulations in Lattice Gauge Theory
Authors:
F. Bodin,
Ph. Boucaud,
N. Cabibbo,
F. Di Carlo,
R. De Pietri,
F. Di Renzo,
H. Kaldass,
A. Lonardo,
M. Lukyanov,
S. De Luca,
J. Micheli,
V. Morenas,
O. Pene,
D. Pleiter,
N. Paschedag,
F. Rapuano,
D. Rossetti,
L. Sartori,
F. Schifano,
H. Simma,
R. Tripiccione,
P. Vicini
Abstract:
We present the APE (Array Processor Experiment) project for the development of dedicated parallel computers for numerical simulations in lattice gauge theories. While APEmille is a production machine in today's physics simulations at various sites in Europe, a new machine, apeNEXT, is currently being developed to provide multi-Tflops computing performance. Like previous APE machines, the new sup…
▽ More
We present the APE (Array Processor Experiment) project for the development of dedicated parallel computers for numerical simulations in lattice gauge theories. While APEmille is a production machine in today's physics simulations at various sites in Europe, a new machine, apeNEXT, is currently being developed to provide multi-Tflops computing performance. Like previous APE machines, the new supercomputer is largely custom designed and specifically optimized for simulations of Lattice QCD.
△ Less
Submitted 8 October, 2003; v1 submitted 2 September, 2003;
originally announced September 2003.
-
The apeNEXT project (Status report)
Authors:
F. Bodin,
Ph. Boucaud,
J. Micheli,
O. Pene,
N. Cabibbo,
F. Di Carlo,
A. Lonardo,
S. de Luca,
F. Rapuano,
D. Rossetti,
P. Vicini,
R. De Pietri,
F. Di Renzo,
H. Kaldass,
N. Paschedag,
H. Simma,
V. Morenas,
D. Pleiter,
L. Sartori,
F. Schifano,
R. Tripiccione
Abstract:
We present the current status of the apeNEXT project. Aim of this project is the development of the next generation of APE machines which will provide multi-teraflop computing power. Like previous machines, apeNEXT is based on a custom designed processor, which is specifically optimized for simulating QCD. We discuss the machine design, report on benchmarks, and give an overview on the status of…
▽ More
We present the current status of the apeNEXT project. Aim of this project is the development of the next generation of APE machines which will provide multi-teraflop computing power. Like previous machines, apeNEXT is based on a custom designed processor, which is specifically optimized for simulating QCD. We discuss the machine design, report on benchmarks, and give an overview on the status of the software development.
△ Less
Submitted 4 September, 2003; v1 submitted 13 June, 2003;
originally announced June 2003.
-
Status of the apeNEXT project
Authors:
R. Ammendola,
F. Bodin,
Ph. Boucaud,
N. Cabibbo,
F. Di Carlo,
R. De Pietri,
F. Di Renzo,
W. Errico,
A. Fucci,
M. Guagnelli,
H. Kaldass,
A. Lonardo,
S. de Luca,
J. Micheli,
V. Morenas,
O. Pene,
R. Petronzio,
F. Palombi,
D. Pleiter,
N. Paschedag,
F. Rapuano,
P. De Riso,
D. Rossetti,
A. Salamon,
G. Salina
, et al. (5 additional authors not shown)
Abstract:
We present the current status of the apeNEXT project. Aim of this project is the development of the next generation of APE machines which will provide multi-teraflop computing power. Like previous machines, apeNEXT is based on a custom designed processor, which is specifically optimized for simulating QCD. We discuss the machine design, report on benchmarks, and give an overview on the status of…
▽ More
We present the current status of the apeNEXT project. Aim of this project is the development of the next generation of APE machines which will provide multi-teraflop computing power. Like previous machines, apeNEXT is based on a custom designed processor, which is specifically optimized for simulating QCD. We discuss the machine design, report on benchmarks, and give an overview on the status of the software development.
△ Less
Submitted 8 October, 2003; v1 submitted 15 November, 2002;
originally announced November 2002.
-
The APENEXT project
Authors:
F. Bodin,
P. Boucaud,
N. Cabibbo,
F. Calvayrac,
M. Della Morte,
R. De Pietri,
P. De Riso,
F. Di Carlo,
F. Di Renzo,
W. Errico,
R. Frezzotti,
U. Gensch,
T. Giorgino,
M. Guagnelli,
N. Herve,
H. Kaldass,
A. Lonardo,
M. Lukyanov,
G. Magazzu,
J. Micheli,
V. Morenas,
L. Mori,
F. Palombi,
N. Paschedag,
O. Pene
, et al. (9 additional authors not shown)
Abstract:
APENEXT is a new generation APE processor, optimized for LGT simulations. The project follows the basic ideas of previous APE machines and develops simple and cheap parallel systems with multi T-Flops processing power. This paper describes the main features of this new development.
APENEXT is a new generation APE processor, optimized for LGT simulations. The project follows the basic ideas of previous APE machines and develops simple and cheap parallel systems with multi T-Flops processing power. This paper describes the main features of this new development.
△ Less
Submitted 25 October, 2001;
originally announced October 2001.
-
Status of APEmille
Authors:
APE-Collaboration,
:,
A. Bartoloni,
P. Boucaud,
N. Cabibbo,
F. Calvayrac,
M. Della Morte,
R. De Pietri,
P. De Riso,
F. Di Carlo,
F. Di Renzo,
W. Errico,
R. Frezzotti,
T. Giorgino,
J. Heitger,
A. Lonardo,
M. Loukianov,
G. Magazzu,
J. Micheli,
V. Morenas,
N. Paschedag,
O. Pene,
R. Petronzio,
D. Pleiter,
F. Rapuano
, et al. (9 additional authors not shown)
Abstract:
This paper presents the status of the APEmille project, which is essentially completed, as far as machine development and construction is concerned. Several large installations of APEmille are in use for physics production runs leading to many new results presented at this conference. This paper briefly summarizes the APEmille architecture, reviews the status of the installations and presents so…
▽ More
This paper presents the status of the APEmille project, which is essentially completed, as far as machine development and construction is concerned. Several large installations of APEmille are in use for physics production runs leading to many new results presented at this conference. This paper briefly summarizes the APEmille architecture, reviews the status of the installations and presents some performance figures for physics codes.
△ Less
Submitted 17 October, 2001;
originally announced October 2001.
-
Progress and status of APEmille
Authors:
APE collaboration,
A. Bartoloni,
S. Cabasino,
N. Cabibbo,
M. Cosimi,
P. De Riso,
W. Errico,
S. Giovannetti,
F. Laico,
H. Leich,
A. Lonardo,
G. Magazzu,
A. Michelotti,
E. Panizzi,
P. S. Paolucci,
D. Rossetti,
U. Schwendicke,
H. Simma,
K. H. Sulanke,
M. Torelli,
R. Tripiccione,
P. Vicini
Abstract:
We report on the progress and status of the APEmille project: a SIMD parallel computer with a peak performance in the TeraFlops range which is now in an advanced development phase. We discuss the hardware and software architecture, and present some performance estimates for Lattice Gauge Theory (LGT) applications.
We report on the progress and status of the APEmille project: a SIMD parallel computer with a peak performance in the TeraFlops range which is now in an advanced development phase. We discuss the hardware and software architecture, and present some performance estimates for Lattice Gauge Theory (LGT) applications.
△ Less
Submitted 1 October, 1997;
originally announced October 1997.