Skip to main content

Showing 1–25 of 25 results for author: Eck, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.08738  [pdf, other

    stat.ME stat.AP

    Volatility Forecasting Using Similarity-based Parameter Correction and Aggregated Shock Information

    Authors: David P. Lundquist, Daniel J. Eck

    Abstract: We develop a procedure for forecasting the volatility of a time series immediately following a news shock. Adapting the similarity-based framework of Lin and Eck (2020), we exploit series that have experienced similar shocks. We aggregate their shock-induced excess volatilities by positing the shocks to be affine functions of exogenous covariates. The volatility shocks are modeled as random effect… ▽ More

    Submitted 6 August, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 26 pages, 7 figures, 2 tables

  2. arXiv:2207.11332  [pdf, other

    stat.AP stat.ME

    Comparing baseball players across eras via novel Full House Modeling

    Authors: Shen Yan, Adrian Burgos Jr., Christopher Kinson, Daniel J. Eck

    Abstract: A new methodological framework suitable for era-adjusting baseball statistics is developed in this article. Within this methodological framework specific models are motivated. We call these models Full House Models. Full House Models work by balancing the achievements of Major League Baseball (MLB) players within a given season and the size of the MLB talent pool from which a player came. We demon… ▽ More

    Submitted 24 April, 2024; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Results and additional supplements can be accessed on our website: https://eckeraadjustment.web.illinois.edu/

  3. arXiv:2110.15189  [pdf, other

    stat.ME stat.AP

    Robust model-based estimation for binary outcomes in genomics studies

    Authors: Suyoung Park, Alexander E. Lipka, Daniel J. Eck

    Abstract: In quantitative genetics, statistical modeling techniques are used to facilitate advances in the understanding of which genes underlie agronomically important traits and have enabled the use of genome-wide markers to accelerate genetic gain. The logistic regression model is a statistically optimal approach for quantitative genetics analysis of binary traits. To encourage more widespread use of the… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  4. arXiv:2101.06755  [pdf

    stat.AP

    Do Most Students Need In-Person Lectures? A Study of a Large Statistics Class

    Authors: Ellen S. Fireman, Zachary S. Donnini, Michael B. Weissman, Daniel J. Eck

    Abstract: Over 1100 students over four semesters were given the option of taking an introductory undergraduate statistics class either by in-person attendance in lectures or by taking exactly the same class (same instructor, recorded lectures, homework, blind grading, website, etc.) without the in-person lectures. Roughly equal numbers of students chose each option. The online lectures were available to all… ▽ More

    Submitted 7 April, 2023; v1 submitted 17 January, 2021; originally announced January 2021.

    Comments: Supplementary materials are available upon request

  5. arXiv:2010.00581  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Emergent Social Learning via Multi-agent Reinforcement Learning

    Authors: Kamal Ndousse, Douglas Eck, Sergey Levine, Natasha Jaques

    Abstract: Social learning is a key component of human and animal intelligence. By taking cues from the behavior of experts in their environment, social learners can acquire sophisticated behavior and rapidly adapt to new circumstances. This paper investigates whether independent reinforcement learning (RL) agents in a multi-agent environment can learn to use social learning to improve their performance. We… ▽ More

    Submitted 22 June, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: 14 pages, 19 figures. To be published in ICML 2021

  6. arXiv:2008.11756  [pdf, other

    stat.ME stat.AP

    Minimizing post-shock forecasting error through aggregation of outside information

    Authors: Jilei Lin, Daniel J. Eck

    Abstract: We develop a forecasting methodology for providing credible forecasts for time series that have recently undergone a shock. We achieve this by borrowing knowledge from other time series that have undergone similar shocks for which post-shock outcomes are observed. Three shock effect estimators are motivated with the aim of minimizing average forecast risk. We propose risk-reduction propositions th… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

  7. arXiv:2005.07742  [pdf, other

    stat.AP stat.ME

    SEAM methodology for context-rich player matchup evaluations

    Authors: Julia Wapner, David Dalpiaz, Daniel J. Eck

    Abstract: We develop the SEAM (synthetic estimated average matchup) method for describing batter versus pitcher matchups in baseball. We first estimate the distribution of balls put into play by a batter facing a pitcher, called the empirical spray chart distribution. Many individual matchups have a sample size that is too small to be reliable for use in predicting future outcomes. Synthetic versions of the… ▽ More

    Submitted 20 August, 2022; v1 submitted 15 May, 2020; originally announced May 2020.

  8. arXiv:2002.01003  [pdf, other

    stat.ME

    General model-free weighted envelope estimation

    Authors: Daniel J. Eck

    Abstract: Envelope methodology is succinctly pitched as a class of procedures for increasing efficiency in multivariate analyses without altering traditional objectives \citep[first sentence of page 1]{cook2018introduction}. This description is true with the additional caveat that the efficiency gains obtained by envelope methodology are mitigated by model selection volatility to an unknown degree. The bulk… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

  9. arXiv:1905.06118  [pdf, other

    cs.SD cs.LG cs.MM eess.AS stat.ML

    Learning to Groove with Inverse Sequence Transformations

    Authors: Jon Gillick, Adam Roberts, Jesse Engel, Douglas Eck, David Bamman

    Abstract: We explore models for translating abstract musical ideas (scores, rhythms) into expressive performances using Seq2Seq and recurrent Variational Information Bottleneck (VIB) models. Though Seq2Seq models usually require painstakingly aligned corpora, we show that it is possible to adapt an approach from the Generative Adversarial Network (GAN) literature (e.g. Pix2Pix (Isola et al., 2017) and Vid2V… ▽ More

    Submitted 26 July, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: Blog post and links: https://g.co/magenta/groovae

    ACM Class: J.5; I.2

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, PMLR 97:2269-2279, 2019

  10. arXiv:1905.03657  [pdf, other

    stat.ME math.ST stat.AP

    Efficient and minimal length parametric conformal prediction regions

    Authors: Daniel J. Eck, Forrest W. Crawford

    Abstract: Conformal prediction methods construct prediction regions for iid data that are valid in finite samples. We provide two parametric conformal prediction regions that are applicable for a wide class of continuous statistical models. This class of statistical models includes generalized linear models (GLMs) with continuous outcomes. Our parametric conformal prediction regions possesses finite sample… ▽ More

    Submitted 25 October, 2019; v1 submitted 9 May, 2019; originally announced May 2019.

  11. arXiv:1904.02632  [pdf, other

    cs.CV cs.LG stat.ML

    A Learned Representation for Scalable Vector Graphics

    Authors: Raphael Gontijo Lopes, David Ha, Douglas Eck, Jonathon Shlens

    Abstract: Dramatic advances in generative models have resulted in near photographic quality for artificially rendered faces, animals and other objects in the natural world. In spite of such advances, a higher level understanding of vision and imagery does not arise from exhaustively modeling an object, but instead identifying higher-level attributes that best summarize the aspects of an object. In this work… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

  12. arXiv:1903.07227  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Counterpoint by Convolution

    Authors: Cheng-Zhi Anna Huang, Tim Cooijmans, Adam Roberts, Aaron Courville, Douglas Eck

    Abstract: Machine learning models of music typically break up the task of composition into a chronological process, composing a piece of music in a single pass from beginning to end. On the contrary, human composers write music in a nonlinear fashion, scribbling motifs here and there, often revisiting choices previously made. In order to better approximate this process, we train a convolutional neural netwo… ▽ More

    Submitted 17 March, 2019; originally announced March 2019.

    Comments: Proceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017

    ACM Class: H.5.5; I.2

  13. arXiv:1810.12247  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset

    Authors: Curtis Hawthorne, Andriy Stasyuk, Adam Roberts, Ian Simon, Cheng-Zhi Anna Huang, Sander Dieleman, Erich Elsen, Jesse Engel, Douglas Eck

    Abstract: Generating musical audio directly with neural networks is notoriously difficult because it requires coherently modeling structure at many different timescales. Fortunately, most music is also highly structured and can be represented as discrete note events played on musical instruments. Herein, we show that by using notes as an intermediate representation, we can train a suite of models capable of… ▽ More

    Submitted 17 January, 2019; v1 submitted 29 October, 2018; originally announced October 2018.

    Comments: Examples available at https://goo.gl/magenta/maestro-examples

  14. arXiv:1810.08029  [pdf, ps, other

    stat.AP

    Challenging nostalgia and performance metrics in baseball

    Authors: Daniel J. Eck

    Abstract: We show that the great baseball players that started their careers before 1950 are overrepresented among rankings of baseball's all time greatest players. The year 1950 coincides with the decennial US Census that is closest to when Major League Baseball (MLB) was integrated in 1947. We also show that performance metrics used to compare players have substantial era biases that favor players who sta… ▽ More

    Submitted 17 June, 2019; v1 submitted 18 October, 2018; originally announced October 2018.

    Comments: Accepted at Chance

  15. arXiv:1809.04281  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Music Transformer

    Authors: Cheng-Zhi Anna Huang, Ashish Vaswani, Jakob Uszkoreit, Noam Shazeer, Ian Simon, Curtis Hawthorne, Andrew M. Dai, Matthew D. Hoffman, Monica Dinculescu, Douglas Eck

    Abstract: Music relies heavily on repetition to build structure and meaning. Self-reference occurs on multiple timescales, from motifs to phrases to reusing of entire sections of music, such as in pieces with ABA structure. The Transformer (Vaswani et al., 2017), a sequence model based on self-attention, has achieved compelling results in many generation tasks that require maintaining long-range coherence.… ▽ More

    Submitted 12 December, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: Improved skewing section and accompanying figures. Previous titles are "An Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation" and "Music Transformer"

  16. arXiv:1808.05593  [pdf, other

    stat.AP math.ST q-bio.PE

    Randomization for the susceptibility effect of an infectious disease intervention

    Authors: Daniel J. Eck, Olga Morozova, Forrest W. Crawford

    Abstract: Randomized trials of infectious disease interventions, such as vaccines, often focus on groups of connected or potentially interacting individuals. When the pathogen of interest is transmissible between study subjects, interference may occur: individual infection outcomes may depend on treatments received by others. Epidemiologists have defined the primary causal effect of interest -- called the "… ▽ More

    Submitted 9 December, 2019; v1 submitted 16 August, 2018; originally announced August 2018.

  17. arXiv:1808.04753  [pdf, other

    math.ST stat.AP stat.ME

    Estimating the size of a hidden finite set: large-sample behavior of estimators

    Authors: Si Cheng, Daniel J. Eck, Forrest W. Crawford

    Abstract: A finite set is "hidden" if its elements are not directly enumerable or if its size cannot be ascertained via a deterministic query. In public health, epidemiology, demography, ecology and intelligence analysis, researchers have developed a wide variety of indirect statistical approaches, under different models for sampling and observation, for estimating the size of a hidden set. Some methods mak… ▽ More

    Submitted 15 October, 2019; v1 submitted 14 August, 2018; originally announced August 2018.

  18. arXiv:1806.00195  [pdf, other

    stat.ML cs.LG cs.SD eess.AS

    Learning a Latent Space of Multitrack Measures

    Authors: Ian Simon, Adam Roberts, Colin Raffel, Jesse Engel, Curtis Hawthorne, Douglas Eck

    Abstract: Discovering and exploring the underlying structure of multi-instrumental music using learning-based approaches remains an open problem. We extend the recent MusicVAE model to represent multitrack polyphonic measures as vectors in a latent space. Our approach enables several useful operations such as generating plausible measures from scratch, interpolating between measures in a musically meaningfu… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  19. arXiv:1803.05428  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music

    Authors: Adam Roberts, Jesse Engel, Colin Raffel, Curtis Hawthorne, Douglas Eck

    Abstract: The Variational Autoencoder (VAE) has proven to be an effective model for producing semantically meaningful latent representations for natural data. However, it has thus far seen limited application to sequential data, and, as we demonstrate, existing recurrent VAE models have difficulty modeling sequences with long-term structure. To address this issue, we propose the use of a hierarchical decode… ▽ More

    Submitted 11 November, 2019; v1 submitted 13 March, 2018; originally announced March 2018.

    Comments: ICML Camera Ready Version (w/ fixed typos)

    Journal ref: ICML 2018

  20. arXiv:1710.11153  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Onsets and Frames: Dual-Objective Piano Transcription

    Authors: Curtis Hawthorne, Erich Elsen, Jialin Song, Adam Roberts, Ian Simon, Colin Raffel, Jesse Engel, Sageev Oore, Douglas Eck

    Abstract: We advance the state of the art in polyphonic piano music transcription by using a deep convolutional and recurrent neural network which is trained to jointly predict onsets and frames. Our model predicts pitch onset events and then uses those predictions to condition framewise pitch predictions. During inference, we restrict the predictions from the framewise detector by not allowing a new note t… ▽ More

    Submitted 5 June, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: Examples available at https://goo.gl/magenta/onsets-frames-examples

  21. arXiv:1708.01481  [pdf, other

    stat.AP stat.ME

    Multivariate Design of Experiments for Engineering Dimensional Analysis

    Authors: Daniel J. Eck, Christopher J. Nachtsheim, R. Dennis Cook, Thomas A. Albrecht

    Abstract: We consider the design of dimensional analysis experiments when there is more than a single response. We first give a brief overview of dimensional analysis experiments and the dimensional analysis (DA) procedure. The validity of the DA method for univariate responses was established by the Buckingham $Π$-Theorem in the early 20th century. We extend the theorem to the multivariate case, develop ba… ▽ More

    Submitted 7 August, 2018; v1 submitted 4 August, 2017; originally announced August 2017.

  22. arXiv:1704.07040  [pdf, ps, other

    math.ST stat.ME

    Bootstrapping for multivariate linear regression models

    Authors: Daniel J. Eck

    Abstract: The multivariate linear regression model is an important tool for investigating relationships between several response variables and several predictor variables. The primary interest is in inference about the unknown regression coefficient matrix. We propose multivariate bootstrap techniques as a means for making inferences about the unknown regression coefficient matrix. These bootstrapping techn… ▽ More

    Submitted 12 September, 2017; v1 submitted 24 April, 2017; originally announced April 2017.

  23. arXiv:1704.03477  [pdf, other

    cs.NE cs.LG stat.ML

    A Neural Representation of Sketch Drawings

    Authors: David Ha, Douglas Eck

    Abstract: We present sketch-rnn, a recurrent neural network (RNN) able to construct stroke-based drawings of common objects. The model is trained on thousands of crude human-drawn images representing hundreds of classes. We outline a framework for conditional and unconditional sketch generation, and describe new robust training methods for generating coherent sketch drawings in a vector format.

    Submitted 19 May, 2017; v1 submitted 11 April, 2017; originally announced April 2017.

  24. arXiv:1701.07910  [pdf, ps, other

    stat.AP stat.ME

    Combining Envelope Methodology and Aster Models for Variance Reduction in Life History Analyses

    Authors: Daniel J. Eck, Charles J. Geyer, R. Dennis Cook

    Abstract: Precise estimation of expected Darwinian fitness, the expected lifetime number of offspring of organism, is a central component of life history analysis. The aster model serves as a defensible statistical model for distributions of Darwinian fitness. The aster model is equipped to incorporate the major life stages an organism travels through which separately may effect Darwinian fitness. Envelope… ▽ More

    Submitted 27 February, 2018; v1 submitted 26 January, 2017; originally announced January 2017.

    Comments: Title changed from "An Application of Envelope Methodology and Aster Models" to "Combining Envelope Methodology and Aster Models for Variance Reduction in Life History Analyses"

  25. arXiv:1701.00856  [pdf, ps, other

    stat.ME

    Weighted envelope estimation to handle variability in model selection

    Authors: Daniel J. Eck, R. Dennis Cook

    Abstract: Envelope methodology can provide substantial efficiency gains in multivariate statistical problems, but in some applications the estimation of the envelope dimension can induce selection volatility that may mitigate those gains. Current envelope methodology does not account for the added variance that can result from this selection. In this article, we circumvent dimension selection volatility thr… ▽ More

    Submitted 14 April, 2017; v1 submitted 3 January, 2017; originally announced January 2017.