Skip to main content

Showing 1–3 of 3 results for author: Perry, D J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.05619  [pdf, other

    cs.CL

    Effective Proxy for Human Labeling: Ensemble Disagreement Scores in Large Language Models for Industrial NLP

    Authors: Wei Du, Laksh Advani, Yashmeet Gambhir, Daniel J Perry, Prashant Shiralkar, Zhengzheng Xing, Aaron Colak

    Abstract: Large language models (LLMs) have demonstrated significant capability to generalize across a large number of NLP tasks. For industry applications, it is imperative to assess the performance of the LLM on unlabeled production data from time to time to validate for a real-world setting. Human labeling to assess model error requires considerable expense and time delay. Here we demonstrate that ensemb… ▽ More

    Submitted 19 November, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Camera ready version for 2023 EMNLP (The Third Workshop on Natural Language Generation, Evaluation, and Metrics (GEM))

  2. arXiv:2012.14901  [pdf, other

    cs.GR

    Visualization of topology optimization designs with representative subset selection

    Authors: Daniel J Perry, Vahid Keshavarzzadeh, Shireen Y Elhabian, Robert M Kirby, Michael Gleicher, Ross T Whitaker

    Abstract: An important new trend in additive manufacturing is the use of optimization to automatically design industrial objects, such as beams, rudders or wings. Topology optimization, as it is often called, computes the best configuration of material over a 3D space, typically represented as a grid, in order to satisfy or optimize physical parameters. Designers using these automated systems often seek to… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

    Comments: 14 pages, 10 figures

    ACM Class: I.3.8; G.1.3

  3. arXiv:2010.10499  [pdf, other

    cs.CL cs.LG

    Optimal Subarchitecture Extraction For BERT

    Authors: Adrian de Wynter, Daniel J. Perry

    Abstract: We extract an optimal subset of architectural parameters for the BERT architecture from Devlin et al. (2018) by applying recent breakthroughs in algorithms for neural architecture search. This optimal subset, which we refer to as "Bort", is demonstrably smaller, having an effective (that is, not counting the embedding layer) size of $5.5\%$ the original BERT-large architecture, and $16\%$ of the n… ▽ More

    Submitted 6 November, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: Preprint. Under review. Corrected typos on v2