Skip to main content

Showing 1–4 of 4 results for author: Amaral, J N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.10205  [pdf, other

    cs.DL

    Effective Communication of Scientific Results

    Authors: José Nelson Amaral

    Abstract: Communication is essential for the advancement of Science. Technology advances and the proliferation of personal devices have changed the ways in which people communicate in all aspects of life. Scientific communication has also been profoundly affected by such changes, and thus it is important to reflect on effective ways to communicate scientific results to scientists that are flooded with infor… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 14 pages manuscript

  2. arXiv:2305.18236  [pdf, ps, other

    cs.DC cs.PF

    Fast Matrix Multiplication via Compiler-only Layered Data Reorganization and Intrinsic Lowering

    Authors: Braedy Kuzma, Ivan Korostelev, João P. L. de Carvalho, José E. Moreira, Christopher Barton, Guido Araujo, José Nelson Amaral

    Abstract: The resurgence of machine learning has increased the demand for high-performance basic linear algebra subroutines (BLAS), which have long depended on libraries to achieve peak performance on commodity hardware. High-performance BLAS implementations rely on a layered approach that consists of tiling and packing layers, for data (re)organization, and micro kernels that perform the actual computation… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    ACM Class: C.4

  3. arXiv:2303.04739  [pdf, other

    cs.CV cs.AR cs.LG cs.PF

    Advancing Direct Convolution using Convolution Slicing Optimization and ISA Extensions

    Authors: Victor Ferrari, Rafael Sousa, Marcio Pereira, João P. L. de Carvalho, José Nelson Amaral, José Moreira, Guido Araujo

    Abstract: Convolution is one of the most computationally intensive operations that must be performed for machine-learning model inference. A traditional approach to compute convolutions is known as the Im2Col + BLAS method. This paper proposes SConv: a direct-convolution algorithm based on a MLIR/LLVM code-generation toolchain that can be integrated into machine-learning compilers . This algorithm introduce… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: 15 pages, 11 figures

  4. arXiv:2005.09085  [pdf, other

    cs.CY

    Flexibility Is Key in Organizing a Global Professional Conference Online: The ICPE 2020 Experience in the COVID-19 Era

    Authors: Alexandru Iosup, Catia Trubiani, Anne Koziolek, José Nelson Amaral, Andre B. Bondi, Andreas Brunnert

    Abstract: Organizing professional conferences online has never been more timely. Responding to the new challenges raised by COVID-19, the organizers of the ACM/SPEC International Conference on Performance Engineering 2020 had to address the question: How should we organize these conferences online? This article summarizes their successful answer.

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 18 pages, 32 figures, the 11th ACM/SPEC International Conference on Performance Engineering (ICPE 2020)

    ACM Class: K.4.1; K.4.2; K.7.0