Skip to main content

Showing 1–3 of 3 results for author: Yviquel, H

Searching in archive cs. Search in all archives.
.
  1. The OpenMP Cluster Programming Model

    Authors: Hervé Yviquel, Marcio Pereira, Emílio Francesquini, Guilherme Valarini, Gustavo Leite, Pedro Rosso, Rodrigo Ceccato, Carla Cusihualpa, Vitoria Dias, Sandro Rigo, Alan Souza, Guido Araujo

    Abstract: Despite the various research initiatives and proposed programming models, efficient solutions for parallel programming in HPC clusters still rely on a complex combination of different programming models (e.g., OpenMP and MPI), languages (e.g., C++ and CUDA), and specialized runtimes (e.g., Charm++ and Legion). On the other hand, task parallelism has shown to be an efficient and seamless programmin… ▽ More

    Submitted 13 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: 12 pages, 7 figures, 1 listing, to be published in the 51st International Conference on Parallel Processing Workshop Proceedings (ICPP Workshops 22)

    ACM Class: D.4.1; D.3.2

  2. arXiv:2202.04153  [pdf, other

    cs.PL

    Source Matching and Rewriting

    Authors: Vinicius Couto, Luciano Zago, Hervé Yviquel, Guido Araújo

    Abstract: A typical compiler flow relies on a uni-directional sequence of translation/optimization steps that lower the program abstract representation, making it hard to preserve higher-level program information across each transformation step. On the other hand, modern ISA extensions and hardware accelerators can benefit from the compiler's ability to detect and raise program idioms to acceleration instru… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: 10 pages, 7 figures

  3. arXiv:2103.10573  [pdf, other

    cs.DC

    Enabling OpenMP Task Parallelism on Multi-FPGAs

    Authors: R. Nepomuceno, R. Sterle, G. Valarini, M. Pereira, H. Yviquel, G. Araujo

    Abstract: FPGA-based hardware accelerators have received increasing attention mainly due to their ability to accelerate deep pipelined applications, thus resulting in higher computational performance and energy efficiency. Nevertheless, the amount of resources available on even the most powerful FPGA is still not enough to speed up very large modern workloads. To achieve that, FPGAs need to be interconnecte… ▽ More

    Submitted 21 March, 2021; v1 submitted 18 March, 2021; originally announced March 2021.