-
Landau Gauge Fixing on GPUs and String Tension
Abstract: We explore the performance of CUDA in performing Landau gauge fixing in Lattice QCD, using the steepest descent method with Fourier acceleration. The code performance was tested in a Tesla C2070, Fermi architecture. We also present a study of the string tension at finite temperature in the confined phase. The string tension is extracted from the color averaged free energy and from the color single… ▽ More
Submitted 8 October, 2012; v1 submitted 6 August, 2012; originally announced August 2012.
Comments: 7 pages, 4 figures, 1 table. Contribution to the International Meeting "Excited QCD", Peniche, Portugal, 06 - 12 May 2012
Journal ref: Acta Phys.Polon.Supp. 5 (2012) 1135-1141
-
CuBA - a CUDA implementation of BAMPS
Abstract: Using CUDA as programming language, we create a code named CuBA which is based on the CPU code "Boltzmann Approach for Many Parton Scattering (BAMPS)" developed in Frankfurt in order to study a system of many colliding particles resulting from heavy ion collisions. Furthermore, we benchmark our code with the Riemann Problem and compare the results with BAMPS. They demonstrate an improvement of the… ▽ More
Submitted 4 August, 2012; originally announced August 2012.
Comments: work done partly under the PTQCD Collaboration, contribution Presented by Ulrike Eilhauer at the International Meeting "Excited QCD", Peniche, Portugal, 06 - 12 May 2012
-
Landau Gauge Fixing on GPUs
Abstract: In this paper we present and explore the performance of Landau gauge fixing in GPUs using CUDA. We consider the steepest descent algorithm with Fourier acceleration, and compare the GPU performance with a parallel CPU implementation. Using $32^4$ lattice volumes, we find that the computational power of a single Tesla C2070 GPU is equivalent to approximately 256 CPU cores.
Submitted 8 October, 2012; v1 submitted 4 June, 2012; originally announced June 2012.
Comments: 10 pages, 3 figures and 3 tables
Journal ref: Computer Physics Communications 184 (2013) pp. 124-129
-
Generating SU(Nc) pure gauge lattice QCD configurations on GPUs with CUDA
Abstract: The starting point of any lattice QCD computation is the generation of a Markov chain of gauge field configurations. Due to the large number of lattice links and due to the matrix multiplications, generating SU(Nc) lattice QCD configurations is a highly demanding computational task, requiring advanced computer parallel architectures such as clusters of several Central Processing Units (CPUs) or Gr… ▽ More
Submitted 22 October, 2012; v1 submitted 19 December, 2011; originally announced December 2011.
Comments: 17 pages, 12 figures and 2 tables, minor corrections, work partly done under the PTQCD collaboration (http://nemea.ist.utl.pt/~ptqcd)
Journal ref: Computer Physics Communications 184 (2013) pp. 509-518
-
SU(2) Lattice Gauge Theory Simulations on Fermi GPUs
Abstract: In this work we explore the performance of CUDA in quenched lattice SU(2) simulations. CUDA, NVIDIA Compute Unified Device Architecture, is a hardware and software architecture developed by NVIDIA for computing on the GPU. We present an analysis and performance comparison between the GPU and CPU in single and double precision. Analyses with multiple GPUs and two different architectures (G200 and F… ▽ More
Submitted 11 March, 2011; v1 submitted 22 October, 2010; originally announced October 2010.
Comments: 20 pages, 11 figures, 3 tables, accepted in Journal of Computational Physics
Journal ref: J.Comput.Phys.230:3998-4010,2011
-
Lattice SU(2) on GPU's
Abstract: We discuss the CUDA approach to the simulation of pure gauge Lattice SU(2). CUDA is a hardware and software architecture developed by NVIDIA for computing on the GPU. We present an analysis and performance comparison between the GPU and CPU with single precision. Analysis with single and multiple GPU's, using CUDA and OPENMP, are also presented. In order to obtain a high performance, the code must… ▽ More
Submitted 7 October, 2010; originally announced October 2010.
Comments: 7 pages, 6 figures, contribution to the proceedings of the XXVIII International Symposium on Lattice Field Theory - LAT2010, July 14-19 2010, Villasimius, Sadinia, Italy
Journal ref: PoS Lattice2010:024,2010
-
Iterative method to compute the Fermat points and Fermat distances of multiquarks
Abstract: The multiquark confining potential is proportional to the total distance of the fundamental strings linking the quarks and antiquarks. We address the computation of the total string distance an of the Fermat points where the different strings meet. For a meson (quark-antiquark system) the distance is trivially the quark-antiquark distance. For a baryon (three quark system) the problem was solved… ▽ More
Submitted 3 December, 2008; originally announced December 2008.
Comments: 13 pages, 6 figures, 1 table
Journal ref: Phys.Lett.B674:98-102,2009
-
Time dependent simulation of the Driven Lid Cavity at High Reynolds Number
Abstract: In this work, numerical solutions of the two dimensional time dependent incompressible flow, in a driven cavity at high Reynolds number Re, are presented. At high Re, there is a controversy. Some studies predicted that the flow is steady, others found time dependent non-steady flow, either periodic or aperiodic. In this study, the driven lid cavity is successfully solved using a very fine grid m… ▽ More
Submitted 20 November, 2009; v1 submitted 18 September, 2008; originally announced September 2008.
Comments: 20 pages, 11 figures, 2 tables. We changed the algorithm, from first order to fourth order temporal accuracy as well as the stream equation, therefore the results and figures were updated and some figures are removed. We use GPU's, with double precision capabilities, to solve this problem