Skip to main content

Showing 1–16 of 16 results for author: Miller, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.00640  [pdf, other

    stat.AP cs.CL

    Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations

    Authors: Evan Miller

    Abstract: Evaluations are critical for understanding the capabilities of large language models (LLMs). Fundamentally, evaluations are experiments; but the literature on evaluations has largely ignored the literature from other sciences on experiment analysis and planning. This article shows researchers with some training in statistics how to think about and analyze data from language model evaluations. Conc… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: 14 pages

  2. arXiv:2401.14973  [pdf, other

    stat.ML cs.LG

    Discovering group dynamics in coordinated time series via hierarchical recurrent switching-state models

    Authors: Michael T. Wojnowicz, Kaitlin Gili, Preetish Rath, Eric Miller, Jeffrey Miller, Clifford Hancock, Meghan O'Donovan, Seth Elkin-Frankston, Tad T. BrunyƩ, Michael C. Hughes

    Abstract: We seek a computationally efficient model for a collection of time series arising from multiple interacting entities (a.k.a. "agents"). Recent models of spatiotemporal patterns across individuals fail to incorporate explicit system-level collective behavior that can influence the trajectories of individual entities. To address this gap in the literature, we present a new hierarchical switching-sta… ▽ More

    Submitted 2 December, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  3. arXiv:2401.10233  [pdf, other

    stat.ME

    Likelihood-ratio inference on differences in quantiles

    Authors: Evan Miller

    Abstract: Quantiles can represent key operational and business metrics, but the computational challenges associated with inference has hampered their adoption in online experimentation. One-sample confidence intervals are trivial to construct; however, two-sample inference has traditionally required bootstrapping or a density estimator. This paper presents a new two-sample difference-in-quantile hypothesis… ▽ More

    Submitted 31 July, 2024; v1 submitted 15 September, 2023; originally announced January 2024.

    Comments: 6 pages, 2 figures; corrected typos, clarified equations in the two-step algorithm, updated author affiliation

  4. arXiv:2206.00093  [pdf, other

    stat.ML cs.LG stat.CO

    Easy Variational Inference for Categorical Models via an Independent Binary Approximation

    Authors: Michael T. Wojnowicz, Shuchin Aeron, Eric L. Miller, Michael C. Hughes

    Abstract: We pursue tractable Bayesian analysis of generalized linear models (GLMs) for categorical data. Thus far, GLMs are difficult to scale to more than a few dozen categories due to non-conjugacy or strong posterior dependencies when using conjugate auxiliary variable methods. We define a new class of GLMs for categorical data called categorical-from-binary (CB) models. Each CB model has a likelihood t… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: to appear at ICML 2022

  5. arXiv:2110.06741  [pdf, other

    cs.LG stat.ML

    Dynamical Wasserstein Barycenters for Time-series Modeling

    Authors: Kevin C. Cheng, Shuchin Aeron, Michael C. Hughes, Eric L. Miller

    Abstract: Many time series can be modeled as a sequence of segments representing high-level discrete states, such as running and walking in a human activity application. Flexible models should describe the system state and observations in stationary "pure-state" periods as well as transition periods between adjacent segments, such as a gradual slowdown between running and walking. However, most prior work a… ▽ More

    Submitted 31 October, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: To appear at Neurips 2021

  6. arXiv:2104.07651  [pdf

    cs.MS cs.LG q-bio.QM stat.ML

    mlf-core: a framework for deterministic machine learning

    Authors: Lukas Heumos, Philipp Ehmele, Luis Kuhn Cuellar, Kevin Menden, Edmund Miller, Steffen Lemke, Gisela Gabernet, Sven Nahnsen

    Abstract: Machine learning has shown extensive growth in recent years and is now routinely applied to sensitive areas. To allow appropriate verification of predictive models before deployment, models must be deterministic. However, major machine learning libraries default to the usage of non-deterministic algorithms based on atomic operations. Solely fixing all random seeds is not sufficient for determinist… ▽ More

    Submitted 16 June, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: https://mlf-core.com and https://github.com/mlf-core/mlf-core

  7. arXiv:2010.04196  [pdf, other

    cs.LG stat.ML

    A Fully Tensorized Recurrent Neural Network

    Authors: Charles C. Onu, Jacob E. Miller, Doina Precup

    Abstract: Recurrent neural networks (RNNs) are powerful tools for sequential modeling, but typically require significant overparameterization and regularization to achieve optimal performance. This leads to difficulties in the deployment of large RNNs in resource-limited settings, while also introducing complications in hyperparameter selection and training. To address these issues, we introduce a "fully te… ▽ More

    Submitted 10 November, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

  8. arXiv:2006.05539  [pdf, other

    eess.SP math.ST stat.ML

    On Matched Filtering for Statistical Change Point Detection

    Authors: Kevin C. Cheng, Eric L. Miller, Michael C. Hughes, Shuchin Aeron

    Abstract: Non-parametric and distribution-free two-sample tests have been the foundation of many change point detection algorithms. However, randomness in the test statistic as a function of time makes them susceptible to false positives and localization ambiguity. We address these issues by deriving and applying filters matched to the expected temporal signatures of a change for various sliding window, two… ▽ More

    Submitted 27 October, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

  9. arXiv:2004.13775  [pdf, other

    stat.ME stat.AP

    Estimation of ascertainment bias and its effect on power in clinical trials with time-to-event outcomes

    Authors: E. J. Greene, P. Peduzzi, J. Dziura, C. Meng, M. E. Miller, T. G. Travison, D. Esserman

    Abstract: While the gold standard for clinical trials is to blind all parties -- participants, researchers, and evaluators -- to treatment assignment, this is not always a possibility. When some or all of the above individuals know the treatment assignment, this leaves the study open to the introduction of post-randomization biases. In the Strategies to Reduce Injuries and Develop Confidence in Elders (STRI… ▽ More

    Submitted 2 October, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: 31 pages, 11 figures; submitted to Statistics in Medicine

  10. arXiv:1910.11044  [pdf, ps, other

    stat.ME stat.ML

    Torus Graphs for Multivariate Phase Coupling Analysis

    Authors: Natalie Klein, Josue Orellana, Scott Brincat, Earl K. Miller, Robert E. Kass

    Abstract: Angular measurements are often modeled as circular random variables, where there are natural circular analogues of moments, including correlation. Because a product of circles is a torus, a d-dimensional vector of circular random variables lies on a d-dimensional torus. For such vectors we present here a class of graphical models, which we call torus graphs, based on the full exponential family wi… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: N.K. and J.O. contributed equally to this work. Peer reviewed version, in press at The Annals of Applied Statistics. 10 main Figures, supplementary text appended with 11 supplementary figures

  11. arXiv:1810.00045  [pdf, other

    cs.LG q-bio.NC stat.ML

    Adversarial Domain Adaptation for Stable Brain-Machine Interfaces

    Authors: Ali Farshchian, Juan A. Gallego, Joseph P. Cohen, Yoshua Bengio, Lee E. Miller, Sara A. Solla

    Abstract: Brain-Machine Interfaces (BMIs) have recently emerged as a clinically viable option to restore voluntary movements after paralysis. These devices are based on the ability to extract information about movement intent from neural signals recorded using multi-electrode arrays chronically implanted in the motor cortices of the brain. However, the inherent loss and turnover of recorded neurons requires… ▽ More

    Submitted 15 January, 2019; v1 submitted 28 September, 2018; originally announced October 2018.

    Comments: 14 pages, 6 figures

  12. arXiv:1709.07903  [pdf, other

    stat.ML cs.LG

    Ensemble Multi-task Gaussian Process Regression with Multiple Latent Processes

    Authors: Weitong Ruan, Eric L. Miller

    Abstract: Multi-task/Multi-output learning seeks to exploit correlation among tasks to enhance performance over learning or solving each task independently. In this paper, we investigate this problem in the context of Gaussian Processes (GPs) and propose a new model which learns a mixture of latent processes by decomposing the covariance matrix into a sum of structured hidden components each of which is con… ▽ More

    Submitted 9 May, 2018; v1 submitted 22 September, 2017; originally announced September 2017.

    Comments: main body: 9 pages, supplemental material: 7 pages. This version corrected a few typos in previous version

  13. arXiv:1708.00909  [pdf

    q-bio.NC cs.LG stat.ML

    Machine learning for neural decoding

    Authors: Joshua I. Glaser, Ari S. Benjamin, Raeed H. Chowdhury, Matthew G. Perich, Lee E. Miller, Konrad P. Kording

    Abstract: Despite rapid advances in machine learning tools, the majority of neural decoding approaches still use traditional methods. Modern machine learning tools, which are versatile and easy to use, have the potential to significantly improve decoding performance. This tutorial describes how to effectively apply these algorithms for typical decoding problems. We provide descriptions, best practices, and… ▽ More

    Submitted 3 July, 2020; v1 submitted 2 August, 2017; originally announced August 2017.

  14. arXiv:1411.6652  [pdf, other

    stat.AP

    Persistent homology analysis of brain artery trees

    Authors: Paul Bendich, J. S. Marron, Ezra Miller, Alex Pieloch, Sean Skwerer

    Abstract: New representations of tree-structured data objects, using ideas from topological data analysis, enable improved statistical analyses of a population of brain artery trees. A number of representations of each data tree arise from persistence diagrams that quantify branching and looping of vessels at multiple scales. Novel approaches to the statistical analysis, through various summaries of the per… ▽ More

    Submitted 24 November, 2014; originally announced November 2014.

  15. arXiv:1411.0007  [pdf

    cs.CL cs.LG stat.ML

    Rapid Adaptation of POS Tagging for Domain Specific Uses

    Authors: John E. Miller, Michael Bloodgood, Manabu Torii, K. Vijay-Shanker

    Abstract: Part-of-speech (POS) tagging is a fundamental component for performing natural language tasks such as parsing, information extraction, and question answering. When POS taggers are trained in one domain and applied in significantly different domains, their performance can degrade dramatically. We present a methodology for rapid adaptation of POS taggers to new domains. Our technique is unsupervised… ▽ More

    Submitted 31 October, 2014; originally announced November 2014.

    Comments: 2 pages, 2 tables; appeared in Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology, June 2006

    ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

    Journal ref: In Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology, pages 118-119, New York, New York, June 2006. Association for Computational Linguistics

  16. arXiv:1305.2170  [pdf, other

    physics.geo-ph cs.IT stat.AP

    Exploiting Structural Complexity for Robust and Rapid Hyperspectral Imaging

    Authors: Gregory Ely, Shuchin Aeron, Eric L. Miller

    Abstract: This paper presents several strategies for spectral de-noising of hyperspectral images and hypercube reconstruction from a limited number of tomographic measurements. In particular we show that the non-noisy spectral data, when stacked across the spectral dimension, exhibits low-rank. On the other hand, under the same representation, the spectral noise exhibits a banded structure. Motivated by thi… ▽ More

    Submitted 9 May, 2013; originally announced May 2013.