Skip to main content

Showing 1–5 of 5 results for author: Black, A W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2007.12948  [pdf, ps, other

    eess.AS cs.LG cs.SD stat.ML

    Nonlinear ISA with Auxiliary Variables for Learning Speech Representations

    Authors: Amrith Setlur, Barnabas Poczos, Alan W Black

    Abstract: This paper extends recent work on nonlinear Independent Component Analysis (ICA) by introducing a theoretical framework for nonlinear Independent Subspace Analysis (ISA) in the presence of auxiliary variables. Observed high dimensional acoustic features like log Mel spectrograms can be considered as surface level manifestations of nonlinear transformations over individual multivariate sources of i… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

    Comments: To be presented at Interspeech 2020

  2. arXiv:1909.09699  [pdf, other

    cs.CL cs.LG stat.ML

    Induction and Reference of Entities in a Visual Story

    Authors: Ruo-Ping Dong, Khyathi Raghavi Chandu, Alan W Black

    Abstract: We are enveloped by stories of visual interpretations in our everyday lives. The way we narrate a story often comprises of two stages, which are, forming a central mind map of entities and then weaving a story around them. A contributing factor to coherence is not just basing the story on these entities but also, referring to them using appropriate terms to avoid repetition. In this paper, we addr… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

    Comments: 9 pages, 4 figures, 3 tables

  3. arXiv:1907.08259  [pdf, ps, other

    cs.LG cs.CL stat.ML

    WriterForcing: Generating more interesting story endings

    Authors: Prakhar Gupta, Vinayshekhar Bannihatti Kumar, Mukul Bhutani, Alan W Black

    Abstract: We study the problem of generating interesting endings for stories. Neural generative models have shown promising results for various text generation problems. Sequence to Sequence (Seq2Seq) models are typically trained to generate a single output sequence for a given input sequence. However, in the context of a story, multiple endings are possible. Seq2Seq models tend to ignore the context and ge… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

    Comments: Accepted in ACL workshop on Storytelling 2019

  4. arXiv:1904.04047  [pdf, other

    cs.CL cs.LG stat.ML

    Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings

    Authors: Thomas Manzini, Yao Chong Lim, Yulia Tsvetkov, Alan W Black

    Abstract: Online texts -- across genres, registers, domains, and styles -- are riddled with human stereotypes, expressed in overt or subtle ways. Word embeddings, trained on these texts, perpetuate and amplify these stereotypes, and propagate biases to machine learning models that use word embeddings as features. In this work, we propose a method to debias word embeddings in multiclass settings such as race… ▽ More

    Submitted 1 July, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: Accepted as a conference paper at NAACL. 5 Pages excluding references, additional page for appendix

  5. arXiv:1904.00784  [pdf, ps, other

    cs.CL cs.LG stat.ML

    A Survey of Code-switched Speech and Language Processing

    Authors: Sunayana Sitaram, Khyathi Raghavi Chandu, Sai Krishna Rallabandi, Alan W Black

    Abstract: Code-switching, the alternation of languages within a conversation or utterance, is a common communicative phenomenon that occurs in multilingual communities across the world. This survey reviews computational approaches for code-switched Speech and Natural Language Processing. We motivate why processing code-switched text and speech is essential for building intelligent agents and systems that in… ▽ More

    Submitted 22 July, 2020; v1 submitted 25 March, 2019; originally announced April 2019.