Skip to main content

Showing 1–2 of 2 results for author: Pugh, D R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.06402  [pdf, other

    cs.CL cs.AI

    Fineweb-Edu-Ar: Machine-translated Corpus to Support Arabic Small Language Models

    Authors: Sultan Alrashed, Dmitrii Khizbullin, David R. Pugh

    Abstract: As large language models (LLMs) grow and develop, so do their data demands. This is especially true for multilingual LLMs, where the scarcity of high-quality and readily available data online has led to a multitude of synthetic dataset generation approaches. A key technique in this space is machine translation (MT), where high-quality English text is adapted to a target, comparatively low-resource… ▽ More

    Submitted 10 November, 2024; originally announced November 2024.

  2. arXiv:2405.16623  [pdf, other

    cs.LG cs.AR cs.PF

    Graph neural networks with configuration cross-attention for tensor compilers

    Authors: Dmitrii Khizbullin, Eduardo Rocha de Andrade, Thanh Hau Nguyen, Matheus Pedroza Ferreira, David R. Pugh

    Abstract: With the recent popularity of neural networks comes the need for efficient serving of inference workloads. A neural network inference workload can be represented as a computational graph with nodes as operators transforming multidimensional tensors. The tensors can be transposed and/or tiled in a combinatorially large number of ways, some configurations leading to accelerated inference. We propose… ▽ More

    Submitted 25 November, 2024; v1 submitted 26 May, 2024; originally announced May 2024.