Skip to main content

Showing 1–6 of 6 results for author: Wohlwend, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2110.04624  [pdf, other

    q-bio.BM cs.LG

    Iterative Refinement Graph Neural Network for Antibody Sequence-Structure Co-design

    Authors: Wengong Jin, Jeremy Wohlwend, Regina Barzilay, Tommi Jaakkola

    Abstract: Antibodies are versatile proteins that bind to pathogens like viruses and stimulate the adaptive immune system. The specificity of antibody binding is determined by complementarity-determining regions (CDRs) at the tips of these Y-shaped proteins. In this paper, we propose a generative model to automatically design the CDRs of antibodies with enhanced binding specificity or neutralization capabili… ▽ More

    Submitted 27 January, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

    Comments: Accepted to ICLR 2022

  2. arXiv:2009.07253  [pdf, other

    cs.CL cs.LG

    Autoregressive Knowledge Distillation through Imitation Learning

    Authors: Alexander Lin, Jeremy Wohlwend, Howard Chen, Tao Lei

    Abstract: The performance of autoregressive models on natural language generation tasks has dramatically improved due to the adoption of deep, self-attentive architectures. However, these gains have come at the cost of hindering inference speed, making state-of-the-art models cumbersome to deploy in real-world, time-sensitive settings. We develop a compression technique for autoregressive models that is dri… ▽ More

    Submitted 28 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

  3. arXiv:2005.10469  [pdf, other

    eess.AS cs.CL cs.SD

    ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition

    Authors: Jing Pan, Joshua Shapiro, Jeremy Wohlwend, Kyu J. Han, Tao Lei, Tao Ma

    Abstract: In this paper we present state-of-the-art (SOTA) performance on the LibriSpeech corpus with two novel neural network architectures, a multistream CNN for acoustic modeling and a self-attentive simple recurrent unit (SRU) for language modeling. In the hybrid ASR framework, the multistream CNN acoustic model processes an input of speech frames in multiple parallel pipelines where each stream has a u… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: Submitted to Interspeech 2020

  4. arXiv:1911.01026  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    Metric Learning for Dynamic Text Classification

    Authors: Jeremy Wohlwend, Ethan R. Elenberg, Samuel Altschul, Shawn Henry, Tao Lei

    Abstract: Traditional text classifiers are limited to predicting over a fixed set of labels. However, in many real-world applications the label set is frequently changing. For example, in intent classification, new intents may be added over time while others are removed. We propose to address the problem of dynamic text classification by replacing the traditional, fixed-size output layer with a learned, sem… ▽ More

    Submitted 3 November, 2019; originally announced November 2019.

  5. Structured Pruning of Large Language Models

    Authors: Ziheng Wang, Jeremy Wohlwend, Tao Lei

    Abstract: Large language models have recently achieved state of the art performance across a wide variety of natural language tasks. Meanwhile, the size of these models and their latency have significantly increased, which makes their usage costly, and raises an interesting question: do language models need to be large? We study this question through the lens of model compression. We present a generic, stru… ▽ More

    Submitted 28 March, 2021; v1 submitted 10 October, 2019; originally announced October 2019.

  6. arXiv:1906.03209  [pdf, other

    cs.CL

    Building a Production Model for Retrieval-Based Chatbots

    Authors: Kyle Swanson, Lili Yu, Christopher Fox, Jeremy Wohlwend, Tao Lei

    Abstract: Response suggestion is an important task for building human-computer conversation systems. Recent approaches to conversation modeling have introduced new model architectures with impressive results, but relatively little attention has been paid to whether these models would be practical in a production setting. In this paper, we describe the unique challenges of building a production retrieval-bas… ▽ More

    Submitted 1 August, 2019; v1 submitted 7 June, 2019; originally announced June 2019.