Skip to main content

Showing 1–7 of 7 results for author: Eisenstein, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2106.00545  [pdf, other

    cs.LG cs.AI stat.ML

    Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests

    Authors: Victor Veitch, Alexander D'Amour, Steve Yadlowsky, Jacob Eisenstein

    Abstract: Informally, a 'spurious correlation' is the dependence of a model on some aspect of the input data that an analyst thinks shouldn't matter. In machine learning, these have a know-it-when-you-see-it character; e.g., changing the gender of a sentence's subject changes a sentiment predictor's output. To check for spurious correlations, we can 'stress test' models by perturbing irrelevant parts of inp… ▽ More

    Submitted 2 November, 2021; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: Published at NeurIPS 2021 (spotlight)

  2. arXiv:2011.03395  [pdf, other

    cs.LG stat.ML

    Underspecification Presents Challenges for Credibility in Modern Machine Learning

    Authors: Alexander D'Amour, Katherine Heller, Dan Moldovan, Ben Adlam, Babak Alipanahi, Alex Beutel, Christina Chen, Jonathan Deaton, Jacob Eisenstein, Matthew D. Hoffman, Farhad Hormozdiari, Neil Houlsby, Shaobo Hou, Ghassen Jerfel, Alan Karthikesalingam, Mario Lucic, Yian Ma, Cory McLean, Diana Mincu, Akinori Mitani, Andrea Montanari, Zachary Nado, Vivek Natarajan, Christopher Nielson, Thomas F. Osborne , et al. (15 additional authors not shown)

    Abstract: ML models often exhibit unexpectedly poor behavior when they are deployed in real-world domains. We identify underspecification as a key reason for these failures. An ML pipeline is underspecified when it can return many predictors with equivalently strong held-out performance in the training domain. Underspecification is common in modern ML pipelines, such as those based on deep learning. Predict… ▽ More

    Submitted 24 November, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Updates: Updated statistical analysis in Section 6; Additional citations

  3. arXiv:1902.01509  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Training on Synthetic Noise Improves Robustness to Natural Noise in Machine Translation

    Authors: Vladimir Karpukhin, Omer Levy, Jacob Eisenstein, Marjan Ghazvininejad

    Abstract: We consider the problem of making machine translation more robust to character-level variation at the source side, such as typos. Existing methods achieve greater coverage by applying subword models such as byte-pair encoding (BPE) and character-level encoders, but these methods are highly sensitive to spelling mistakes. We show how training on a mild amount of random synthetic noise can dramatica… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

  4. arXiv:1802.05695  [pdf, other

    cs.CL cs.LG stat.ML

    Explainable Prediction of Medical Codes from Clinical Text

    Authors: James Mullenbach, Sarah Wiegreffe, Jon Duke, Jimeng Sun, Jacob Eisenstein

    Abstract: Clinical notes are text documents that are created by clinicians for each patient encounter. They are typically accompanied by medical codes, which describe the diagnosis and treatment. Annotating these codes is labor intensive and error prone; furthermore, the connection between the codes and the text is not annotated, obscuring the reasons and details behind specific diagnoses and treatments. We… ▽ More

    Submitted 16 April, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: NAACL 2018

  5. arXiv:1611.06933  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Unsupervised Learning for Lexicon-Based Classification

    Authors: Jacob Eisenstein

    Abstract: In lexicon-based classification, documents are assigned labels by comparing the number of words that appear from two opposed lexicons, such as positive and negative sentiment. Creating such words lists is often easier than labeling instances, and they can be debugged by non-experts if classification performance is unsatisfactory. However, there is little analysis or justification of this classific… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

    Comments: to appear in AAAI 2017

    ACM Class: I.2.6; I.2.7

  6. arXiv:1603.01913  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

    Authors: Yangfeng Ji, Gholamreza Haffari, Jacob Eisenstein

    Abstract: This paper presents a novel latent variable recurrent neural network architecture for jointly modeling sequences of words and (possibly latent) discourse relations between adjacent sentences. A recurrent neural network generates individual words, thus reaping the benefits of discriminatively-trained vector representations. The discourse relations are represented with a latent variable, which can b… ▽ More

    Submitted 5 April, 2016; v1 submitted 6 March, 2016; originally announced March 2016.

    Comments: NAACL 2016 camera ready, 11 pages

  7. arXiv:1511.03962  [pdf, other

    cs.CL cs.LG stat.ML

    Document Context Language Models

    Authors: Yangfeng Ji, Trevor Cohn, Lingpeng Kong, Chris Dyer, Jacob Eisenstein

    Abstract: Text documents are structured on multiple levels of detail: individual words are related by syntax, but larger units of text are related by discourse structure. Existing language models generally fail to account for discourse structure, but it is crucial if we are to have language models that reward coherence and generate coherent texts. We present and empirically evaluate a set of multi-level rec… ▽ More

    Submitted 21 February, 2016; v1 submitted 12 November, 2015; originally announced November 2015.

    Comments: 10 pages, 3 figures