Skip to main content

Showing 1–10 of 10 results for author: Cohen, W W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.04291  [pdf, other

    cs.LG stat.ML

    Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation

    Authors: Adam Fisch, Joshua Maynez, R. Alex Hofer, Bhuwan Dhingra, Amir Globerson, William W. Cohen

    Abstract: Prediction-powered inference (PPI) is a method that improves statistical estimates based on limited human-labeled data. PPI achieves this by combining small amounts of human-labeled data with larger amounts of data labeled by a reasonably accurate -- but potentially biased -- automatic system, in a way that results in tighter confidence intervals for certain parameters of interest (e.g., the mean… ▽ More

    Submitted 3 December, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2004.03658  [pdf, other

    cs.LG cs.CL stat.ML

    Faithful Embeddings for Knowledge Base Queries

    Authors: Haitian Sun, Andrew O. Arnold, Tania Bedrax-Weiss, Fernando Pereira, William W. Cohen

    Abstract: The deductive closure of an ideal knowledge base (KB) contains exactly the logical queries that the KB can answer. However, in practice KBs are both incomplete and over-specified, failing to answer some queries that have real-world answers. \emph{Query embedding} (QE) techniques have been recently proposed where KB entities and KB queries are represented jointly in an embedding space, supporting r… ▽ More

    Submitted 28 January, 2021; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: Published at 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  3. arXiv:2002.06115  [pdf, other

    cs.CL cs.LG stat.ML

    Scalable Neural Methods for Reasoning With a Symbolic Knowledge Base

    Authors: William W. Cohen, Haitian Sun, R. Alex Hofer, Matthew Siegler

    Abstract: We describe a novel way of representing a symbolic knowledge base (KB) called a sparse-matrix reified KB. This representation enables neural modules that are fully differentiable, faithful to the original semantics of the KB, expressive enough to model multi-hop inferences, and scalable enough to use with realistically large KBs. The sparse-matrix reified KB can be distributed across multiple GPUs… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Also published in ICLR2020 https://openreview.net/forum?id=BJlguT4YPr&noteId=BJlguT4YPr

  4. arXiv:1912.06074  [pdf, other

    cs.LG cs.AI stat.ML

    Game Design for Eliciting Distinguishable Behavior

    Authors: Fan Yang, Liu Leqi, Yifan Wu, Zachary C. Lipton, Pradeep Ravikumar, William W. Cohen, Tom Mitchell

    Abstract: The ability to inferring latent psychological traits from human behavior is key to developing personalized human-interacting machine learning systems. Approaches to infer such traits range from surveys to manually-constructed experiments and games. However, these traditional games are limited because they are typically designed based on heuristics. In this paper, we formulate the task of designing… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

  5. arXiv:1911.06111  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Instance-based Transfer Learning for Multilingual Deep Retrieval

    Authors: Andrew O. Arnold, William W. Cohen

    Abstract: We focus on the problem of search in the multilingual setting. Examining the problems of next-sentence prediction and inverse cloze, we show that at large scale, instance-based transfer learning is surprisingly effective in the multilingual setting, leading to positive transfer on all of the 35 target languages and two tasks tested. We analyze this improvement and argue that the most natural expla… ▽ More

    Submitted 15 April, 2021; v1 submitted 8 November, 2019; originally announced November 2019.

    Journal ref: The Web Conference Workshop on Multilingual Search, 2021

  6. arXiv:1905.10417  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Differentiable Representations For Multihop Inference Rules

    Authors: William W. Cohen, Haitian Sun, R. Alex Hofer, Matthew Siegler

    Abstract: We present efficient differentiable implementations of second-order multi-hop reasoning using a large symbolic knowledge base (KB). We introduce a new operation which can be used to compositionally construct second-order multi-hop templates in a neural model, and evaluate a number of alternative implementations, with different time and memory trade offs. These techniques scale to KBs with millions… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

  7. arXiv:1806.05662  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations

    Authors: Zhilin Yang, Jake Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen, Ruslan Salakhutdinov, Yann LeCun

    Abstract: Modern deep transfer learning approaches have mainly focused on learning generic feature vectors from one task that are transferable to other tasks, such as word embeddings in language and pretrained convolutional features in vision. However, these approaches usually transfer unary features and largely ignore more structured graphical representations. This work explores the possibility of learning… ▽ More

    Submitted 2 July, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

  8. arXiv:1804.09238  [pdf, other

    cs.LG cs.AI stat.ML

    Semi-Supervised Learning with Declaratively Specified Entropy Constraints

    Authors: Haitian Sun, William W. Cohen, Lidong Bing

    Abstract: We propose a technique for declaratively specifying strategies for semi-supervised learning (SSL). The proposed method can be used to specify ensembles of semi-supervised learning, as well as agreement constraints and entropic regularization constraints between these learners, and can be used to model both well-known heuristics such as co-training and novel domain-specific heuristics. In addition… ▽ More

    Submitted 18 May, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

  9. arXiv:1703.01557  [pdf, other

    cs.LG cs.CL stat.ML

    Using Graphs of Classifiers to Impose Declarative Constraints on Semi-supervised Learning

    Authors: Lidong Bing, William W. Cohen, Bhuwan Dhingra

    Abstract: We propose a general approach to modeling semi-supervised learning (SSL) algorithms. Specifically, we present a declarative language for modeling both traditional supervised classification tasks and many SSL heuristics, including both well-known heuristics such as co-training and novel domain-specific heuristics. In addition to representing individual SSL heuristics, we show that multiple heuristi… ▽ More

    Submitted 23 March, 2017; v1 submitted 4 March, 2017; originally announced March 2017.

    Comments: 8 pages, 3 figures

  10. arXiv:1602.04393  [pdf, other

    cs.IR stat.ML

    Semantic Scan: Detecting Subtle, Spatially Localized Events in Text Streams

    Authors: Abhinav Maurya, Kenton Murray, Yandong Liu, Chris Dyer, William W. Cohen, Daniel B. Neill

    Abstract: Early detection and precise characterization of emerging topics in text streams can be highly useful in applications such as timely and targeted public health interventions and discovering evolving regional business trends. Many methods have been proposed for detecting emerging events in text streams using topic modeling. However, these methods have numerous shortcomings that make them unsuitable… ▽ More

    Submitted 13 February, 2016; originally announced February 2016.

    Comments: 10 pages, 4 figures, KDD 2016 submission