Skip to main content

Showing 1–2 of 2 results for author: Bayliss, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.18430  [pdf

    cs.SE

    Efficiency, Expressivity, and Extensibility in a Close-to-Metal NPU Programming Interface

    Authors: Erika Hunhoff, Joseph Melber, Kristof Denolf, Andra Bisca, Samuel Bayliss, Stephen Neuendorffer, Jeff Fifield, Jack Lo, Pranathi Vasireddy, Phil James-Roxby, Eric Keller

    Abstract: Accelerators such as neural processing units (NPUs) deliver an enticing balance of performance and efficiency compared to general purpose compute architectures. However, effectively leveraging accelerator capabilities is not always simple: low-level programming toolkits may require substantial developer effort while high-level programming toolkits may abstract critical optimization features. Thi… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: Accepted FCCM 25; artifact submitted for evaluation. IRON available at https://github.com/Xilinx/mlir-aie

  2. arXiv:2006.01331  [pdf, other

    cs.DC

    Vyasa: A High-Performance Vectorizing Compiler for Tensor Convolutions on the Xilinx AI Engine

    Authors: Prasanth Chatarasi, Stephen Neuendorffer, Samuel Bayliss, Kees Vissers, Vivek Sarkar

    Abstract: Xilinx's AI Engine is a recent industry example of energy-efficient vector processing that includes novel support for 2D SIMD datapaths and shuffle interconnection network. The current approach to programming the AI Engine relies on a C/C++ API for vector intrinsics. While an advance over assembly-level programming, it requires the programmer to specify a number of low-level operations based on de… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.