Skip to main content

Showing 1–4 of 4 results for author: Amvrosiadis, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.15458  [pdf, other

    cs.LG cs.CL

    Validating Large Language Models with ReLM

    Authors: Michael Kuchnik, Virginia Smith, George Amvrosiadis

    Abstract: Although large language models (LLMs) have been touted for their ability to generate natural-sounding text, there are growing concerns around possible negative effects of LLMs such as data memorization, bias, and inappropriate language. Unfortunately, the complexity and generation capacities of LLMs make validating (and correcting) such concerns difficult. In this work, we introduce ReLM, a system… ▽ More

    Submitted 8 May, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  2. arXiv:2111.04131  [pdf, other

    cs.LG cs.PF

    Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines

    Authors: Michael Kuchnik, Ana Klimovic, Jiri Simsa, Virginia Smith, George Amvrosiadis

    Abstract: Input pipelines, which ingest and transform input data, are an essential part of training Machine Learning (ML) models. However, it is challenging to implement efficient input pipelines, as it requires reasoning about parallelism, asynchrony, and variability in fine-grained profiling information. Our analysis of over two million ML jobs in Google datacenters reveals that a significant fraction of… ▽ More

    Submitted 21 March, 2022; v1 submitted 7 November, 2021; originally announced November 2021.

  3. arXiv:2009.02457  [pdf, other

    cs.NI cs.DC

    Unleashing In-network Computing on Scientific Workloads

    Authors: Daehyeok Kim, Ankush Jain, Zaoxing Liu, George Amvrosiadis, Damian Hazen, Bradley Settlemyer, Vyas Sekar

    Abstract: Many recent efforts have shown that in-network computing can benefit various datacenter applications. In this paper, we explore a relatively less-explored domain which we argue can benefit from in-network computing: scientific workloads in high-performance computing. By analyzing canonical examples of HPC applications, we observe unique opportunities and challenges for exploiting in-network comput… ▽ More

    Submitted 5 September, 2020; originally announced September 2020.

    Comments: 8 pages, 3 figures

  4. arXiv:1911.00472  [pdf, other

    cs.LG stat.ML

    Progressive Compressed Records: Taking a Byte out of Deep Learning Data

    Authors: Michael Kuchnik, George Amvrosiadis, Virginia Smith

    Abstract: Deep learning accelerators efficiently train over vast and growing amounts of data, placing a newfound burden on commodity networks and storage devices. A common approach to conserve bandwidth involves resizing or compressing data prior to training. We introduce Progressive Compressed Records (PCRs), a data format that uses compression to reduce the overhead of fetching and transporting data, effe… ▽ More

    Submitted 11 August, 2021; v1 submitted 1 November, 2019; originally announced November 2019.