Skip to main content

Showing 1–3 of 3 results for author: Kashi, A

Searching in archive math. Search in all archives.
.
  1. arXiv:2412.19322  [pdf, other

    cs.CE math.NA

    Mixed-precision numerics in scientific applications: survey and perspectives

    Authors: Aditya Kashi, Hao Lu, Wesley Brewer, David Rogers, Michael Matheson, Mallikarjun Shankar, Feiyi Wang

    Abstract: The explosive demand for artificial intelligence (AI) workloads has led to a significant increase in silicon area dedicated to lower-precision computations on recent high-performance computing hardware designs. However, mixed-precision capabilities, which can achieve performance improvements of 8x compared to double-precision in extreme compute-intensive workloads, remain largely untapped in most… ▽ More

    Submitted 7 January, 2025; v1 submitted 26 December, 2024; originally announced December 2024.

    Comments: Submitted to IJHPCA

    MSC Class: 65Y10 ACM Class: J.2

  2. arXiv:2406.16740  [pdf, other

    cs.LG math.NA

    Learning the boundary-to-domain mapping using Lifting Product Fourier Neural Operators for partial differential equations

    Authors: Aditya Kashi, Arka Daw, Muralikrishnan Gopalakrishnan Meena, Hao Lu

    Abstract: Neural operators such as the Fourier Neural Operator (FNO) have been shown to provide resolution-independent deep learning models that can learn mappings between function spaces. For example, an initial condition can be mapped to the solution of a partial differential equation (PDE) at a future time-step using a neural operator. Despite the popularity of neural operators, their use to predict solu… ▽ More

    Submitted 1 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024 AI for Science Workshop

    MSC Class: 65N99; 68T07 ACM Class: I.2.1; J.2

  3. arXiv:1912.00539  [pdf, other

    math.NA cs.DC

    An asynchronous incomplete block LU preconditioner for computational fluid dynamics on unstructured grids

    Authors: Aditya Kashi, Siva Nadarajah

    Abstract: We present a study of the effectiveness of asynchronous incomplete LU factorization preconditioners for the time-implicit solution of compressible flow problems while exploiting thread-parallelism within a compute node. A block variant of the asynchronous fine-grain parallel preconditioner adapted to a finite volume discretization of the compressible Navier-Stokes equations on unstructured grids i… ▽ More

    Submitted 4 October, 2020; v1 submitted 1 December, 2019; originally announced December 2019.

    Comments: Accepted by SIAM SISC

    MSC Class: 65F08; 65Y05; 65N22