Skip to main content

Showing 1–8 of 8 results for author: Thickstun, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.08620  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Anticipatory Music Transformer

    Authors: John Thickstun, David Hall, Chris Donahue, Percy Liang

    Abstract: We introduce anticipation: a method for constructing a controllable generative model of a temporal point process (the event process) conditioned asynchronously on realizations of a second, correlated process (the control process). We achieve this by interleaving sequences of events and controls, such that controls appear following stopping times in the event sequence. This work is motivated by pro… ▽ More

    Submitted 25 July, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: TMLR accepted version

  2. arXiv:2105.08164  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics

    Authors: Vivek Jayaram, John Thickstun

    Abstract: This paper introduces an alternative approach to sampling from autoregressive models. Autoregressive models are typically sampled sequentially, according to the transition dynamics defined by the model. Instead, we propose a sampling procedure that initializes a sequence with white noise and follows a Markov chain defined by Langevin dynamics on the global log-likelihood of the sequence. This appr… ▽ More

    Submitted 16 December, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: 16 pages, 7 figures, ICML 2021

  3. arXiv:2012.06684  [pdf, other

    cs.LG stat.ML

    Faster Policy Learning with Continuous-Time Gradients

    Authors: Samuel Ainsworth, Kendall Lowrey, John Thickstun, Zaid Harchaoui, Siddhartha Srinivasa

    Abstract: We study the estimation of policy gradients for continuous-time systems with known dynamics. By reframing policy learning in continuous-time, we show that it is possible construct a more efficient and accurate gradient estimator. The standard back-propagation through time estimator (BPTT) computes exact gradients for a crude discretization of the continuous-time system. In contrast, we approximate… ▽ More

    Submitted 24 June, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Journal ref: L4DC 2021

  4. arXiv:2002.07942  [pdf, other

    cs.LG stat.ML

    Source Separation with Deep Generative Priors

    Authors: Vivek Jayaram, John Thickstun

    Abstract: Despite substantial progress in signal source separation, results for richly structured data continue to contain perceptible artifacts. In contrast, recent deep generative models can produce authentic samples in a variety of domains that are indistinguishable from samples of the data distribution. This paper introduces a Bayesian approach to source separation that uses generative models as priors… ▽ More

    Submitted 21 September, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: 20 pages; ICML camera-ready version

  5. arXiv:1911.11737  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Convolutional Composer Classification

    Authors: Harsh Verma, John Thickstun

    Abstract: This paper investigates end-to-end learnable models for attributing composers to musical scores. We introduce several pooled, convolutional architectures for this task and draw connections between our approach and classical learning approaches based on global and n-gram features. We evaluate models on a corpus of 2,500 scores from the KernScores collection, authored by a variety of composers spann… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: 8 pages, published at ISMIR 2019

  6. arXiv:1811.08045  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Coupled Recurrent Models for Polyphonic Music Composition

    Authors: John Thickstun, Zaid Harchaoui, Dean P. Foster, Sham M. Kakade

    Abstract: This paper introduces a novel recurrent model for music composition that is tailored to the structure of polyphonic music. We propose an efficient new conditional probabilistic factorization of musical scores, viewing a score as a collection of concurrent, coupled sequences: i.e. voices. To model the conditional distributions, we borrow ideas from both convolutional and recurrent neural models; we… ▽ More

    Submitted 26 November, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: 13 pages; long version of the paper appearing in ISMIR 2019

  7. arXiv:1711.04845  [pdf, other

    stat.ML cs.LG cs.SD eess.AS

    Invariances and Data Augmentation for Supervised Music Transcription

    Authors: John Thickstun, Zaid Harchaoui, Dean Foster, Sham M. Kakade

    Abstract: This paper explores a variety of models for frame-based music transcription, with an emphasis on the methods needed to reach state-of-the-art on human recordings. The translation-invariant network discussed in this paper, which combines a traditional filterbank with a convolutional neural network, was the top-performing model in the 2017 MIREX Multiple Fundamental Frequency Estimation evaluation.… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

    Comments: 6 pages

  8. arXiv:1611.09827  [pdf, other

    stat.ML cs.LG cs.SD

    Learning Features of Music from Scratch

    Authors: John Thickstun, Zaid Harchaoui, Sham Kakade

    Abstract: This paper introduces a new large-scale music dataset, MusicNet, to serve as a source of supervision and evaluation of machine learning methods for music research. MusicNet consists of hundreds of freely-licensed classical music recordings by 10 composers, written for 11 instruments, together with instrument/note annotations resulting in over 1 million temporal labels on 34 hours of chamber music… ▽ More

    Submitted 5 April, 2017; v1 submitted 29 November, 2016; originally announced November 2016.

    Comments: 14 pages; camera-ready version; updated experiments and related works; additional MIR metrics (Appendix C)