Skip to main content

Showing 1–26 of 26 results for author: Miller, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03889  [pdf, other

    cs.LG nlin.CD

    Temporal horizons in forecasting: a performance-learnability trade-off

    Authors: Pau Vilimelis Aceituno, Jack William Miller, Noah Marti, Youssef Farag, Victor Boussange

    Abstract: When training autoregressive models for dynamical systems, a critical question arises: how far into the future should the model be trained to predict? Too short a horizon may miss long-term trends, while too long a horizon can impede convergence due to accumulating prediction errors. In this work, we formalize this trade-off by analyzing how the geometry of the loss landscape depends on the traini… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 33 pages, 12 figures

  2. arXiv:2412.10351  [pdf, other

    cs.CV

    VibrantVS: A high-resolution multi-task transformer for forest canopy height estimation

    Authors: Tony Chang, Kiarie Ndegwa, Andreas Gros, Vincent A. Landau, Luke J. Zachmann, Bogdan State, Mitchell A. Gritts, Colton W. Miller, Nathan E. Rutenbeck, Scott Conway, Guy Bayes

    Abstract: This paper explores the application of a novel multi-task vision transformer (ViT) model for the estimation of canopy height models (CHMs) using 4-band National Agriculture Imagery Program (NAIP) imagery across the western United States. We compare the effectiveness of this model in terms of accuracy and precision aggregated across ecoregions and class heights versus three other benchmark peer-rev… ▽ More

    Submitted 24 January, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

    Comments: 15 pages, 12 figures

    MSC Class: I.2.10

  3. arXiv:2410.02208  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Nonparametric IPSS: Fast, flexible feature selection with false discovery control

    Authors: Omar Melikechi, David B. Dunson, Jeffrey W. Miller

    Abstract: Feature selection is a critical task in machine learning and statistics. However, existing feature selection methods either (i) rely on parametric methods such as linear or generalized linear models, (ii) lack theoretical false discovery control, or (iii) identify few true positives. Here, we introduce a general feature selection method with finite-sample false discovery control based on applying… ▽ More

    Submitted 6 May, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Journal ref: Bioinformatics (2025)

  4. arXiv:2409.02779  [pdf, other

    cs.CY cs.AI

    Governing dual-use technologies: Case studies of international security agreements and lessons for AI governance

    Authors: Akash R. Wasil, Peter Barnett, Michael Gerovitch, Roman Hauksson, Tom Reed, Jack William Miller

    Abstract: International AI governance agreements and institutions may play an important role in reducing global security risks from advanced AI. To inform the design of such agreements and institutions, we conducted case studies of historical and contemporary international security agreements. We focused specifically on those arrangements around dual-use technologies, examining agreements in nuclear securit… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  5. arXiv:2408.16074  [pdf, other

    cs.CY cs.AI

    Verification methods for international AI agreements

    Authors: Akash R. Wasil, Tom Reed, Jack William Miller, Peter Barnett

    Abstract: What techniques can be used to verify compliance with international agreements about advanced AI development? In this paper, we examine 10 verification methods that could detect two types of potential violations: unauthorized AI training (e.g., training runs above a certain FLOP threshold) and unauthorized data centers. We divide the verification methods into three categories: (a) national technic… ▽ More

    Submitted 4 November, 2024; v1 submitted 28 August, 2024; originally announced August 2024.

  6. arXiv:2405.08784  [pdf, other

    cs.CL cs.SI

    Refinement of an Epilepsy Dictionary through Human Annotation of Health-related posts on Instagram

    Authors: Aehong Min, Xuan Wang, Rion Brattig Correia, Jordan Rozum, Wendy R. Miller, Luis M. Rocha

    Abstract: We used a dictionary built from biomedical terminology extracted from various sources such as DrugBank, MedDRA, MedlinePlus, TCMGeneDIT, to tag more than 8 million Instagram posts by users who have mentioned an epilepsy-relevant drug at least once, between 2010 and early 2016. A random sample of 1,771 posts with 2,947 term matches was evaluated by human annotators to identify false-positives. Open… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  7. arXiv:2405.05229  [pdf, other

    cs.IR cs.DL

    myAURA: Personalized health library for epilepsy management via knowledge graph sparsification and visualization

    Authors: Rion Brattig Correia, Jordan C. Rozum, Leonard Cross, Jack Felag, Michael Gallant, Ziqi Guo, Bruce W. Herr II, Aehong Min, Deborah Stungis Rocha, Xuan Wang, Katy Börner, Wendy Miller, Luis M. Rocha

    Abstract: Objective: We report the development of the patient-centered myAURA application and suite of methods designed to aid epilepsy patients, caregivers, and researchers in making decisions about care and self-management. Materials and Methods: myAURA rests on the federation of an unprecedented collection of heterogeneous data resources relevant to epilepsy, such as biomedical databases, social media,… ▽ More

    Submitted 10 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  8. arXiv:2311.02019  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Reproducible Parameter Inference Using Bagged Posteriors

    Authors: Jonathan H. Huggins, Jeffrey W. Miller

    Abstract: Under model misspecification, it is known that Bayesian posteriors often do not properly quantify uncertainty about true or pseudo-true parameters. Even more fundamentally, misspecification leads to a lack of reproducibility in the sense that the same model will yield contradictory posteriors on independent data sets from the true distribution. To define a criterion for reproducible uncertainty qu… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:1912.07104

  9. arXiv:2307.16506  [pdf, other

    hep-ph cs.LG hep-ex

    Explainable Equivariant Neural Networks for Particle Physics: PELICAN

    Authors: Alexander Bogatskiy, Timothy Hoffman, David W. Miller, Jan T. Offermann, Xiaoyang Liu

    Abstract: PELICAN is a novel permutation equivariant and Lorentz invariant or covariant aggregator network designed to overcome common limitations found in architectures applied to particle physics problems. Compared to many approaches that use non-specialized architectures that neglect underlying physics principles and require very large numbers of parameters, PELICAN employs a fundamentally symmetry group… ▽ More

    Submitted 23 February, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: 52 pages, 34 figures, 12 tables

    Journal ref: J. High Energ. Phys. 2024, 113 (2024)

  10. arXiv:2212.12086  [pdf, other

    cs.LG math.DS

    Eigenvalue initialisation and regularisation for Koopman autoencoders

    Authors: Jack W. Miller, Charles O'Neill, Navid C. Constantinou, Omri Azencot

    Abstract: Regularising the parameter matrices of neural networks is ubiquitous in training deep models. Typical regularisation approaches suggest initialising weights using small random values, and to penalise weights to promote sparsity. However, these widely used techniques may be less effective in certain scenarios. Here, we study the Koopman autoencoder model which includes an encoder, a Koopman operato… ▽ More

    Submitted 25 December, 2022; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: 18 pages

  11. arXiv:2211.00454  [pdf, other

    hep-ph cs.LG hep-ex

    PELICAN: Permutation Equivariant and Lorentz Invariant or Covariant Aggregator Network for Particle Physics

    Authors: Alexander Bogatskiy, Timothy Hoffman, David W. Miller, Jan T. Offermann

    Abstract: Many current approaches to machine learning in particle physics use generic architectures that require large numbers of parameters and disregard underlying physics principles, limiting their applicability as scientific modeling tools. In this work, we present a machine learning architecture that uses a set of inputs maximally reduced with respect to the full 6-dimensional Lorentz symmetry, and is… ▽ More

    Submitted 23 December, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

  12. arXiv:2203.07620  [pdf, ps, other

    hep-ex cs.LG eess.SY physics.ins-det

    Innovations in trigger and data acquisition systems for next-generation physics facilities

    Authors: Rainer Bartoldus, Catrin Bernius, David W. Miller

    Abstract: Data-intensive physics facilities are increasingly reliant on heterogeneous and large-scale data processing and computational systems in order to collect, distribute, process, filter, and analyze the ever increasing huge volumes of data being collected. Moreover, these tasks are often performed in hard real-time or quasi real-time processing pipelines that place extreme constraints on various para… ▽ More

    Submitted 17 March, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Contribution to Snowmass 2021

  13. arXiv:2203.06153  [pdf, other

    cs.LG astro-ph.IM cs.AI hep-ex hep-ph

    Symmetry Group Equivariant Architectures for Physics

    Authors: Alexander Bogatskiy, Sanmay Ganguly, Thomas Kipf, Risi Kondor, David W. Miller, Daniel Murnane, Jan T. Offermann, Mariel Pettee, Phiala Shanahan, Chase Shimmin, Savannah Thais

    Abstract: Physical theories grounded in mathematical symmetries are an essential component of our understanding of a wide range of properties of the universe. Similarly, in the domain of machine learning, an awareness of symmetries such as rotation or permutation invariance has driven impressive performance breakthroughs in computer vision, natural language processing, and other important applications. In t… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: Contribution to Snowmass 2021

  14. arXiv:2203.01547  [pdf, other

    cs.RO

    The RATTLE Motion Planning Algorithm for Robust Online Parametric Model Improvement with On-Orbit Validation

    Authors: Keenan Albee, Monica Ekal, Brian Coltin, Rodrigo Ventura, Richard Linares, David W. Miller

    Abstract: Certain forms of uncertainty that robotic systems encounter can be explicitly learned within the context of a known model, like parametric model uncertainties such as mass and moments of inertia. Quantifying such parametric uncertainty is important for more accurate prediction of the system behavior, leading to safe and precise task execution. In tandem, providing a form of robustness guarantee ag… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: 8 pages, 11 figures, RA-L with IROS 2022 option

  15. arXiv:2201.07552  [pdf, other

    q-bio.QM cs.CY cs.SI stat.CO

    Small Cohort of Epilepsy Patients Showed Increased Activity on Facebook before Sudden Unexpected Death

    Authors: Ian B. Wood, Rion Brattig Correia, Wendy R. Miller, Luis M. Rocha

    Abstract: Sudden Unexpected Death in Epilepsy (SUDEP) remains a leading cause of death in people with epilepsy. Despite the constant risk for patients and bereavement to family members, to date the physiological mechanisms of SUDEP remain unknown. Here we explore the potential to identify putative predictive signals of SUDEP from online digital behavioral data using text and sentiment analysis. Specifically… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Comments: Submitted to Epilepsy & Behavior

    MSC Class: 62P10 (Primary) 92D50; 68U15; 92D30 (Secondary) ACM Class: J.3; I.5.4

  16. arXiv:2112.05878  [pdf, other

    cs.RO

    Online Information-Aware Motion Planning with Inertial Parameter Learning for Robotic Free-Flyers

    Authors: Monica Ekal, Keenan Albee, Brian Coltin, Rodrigo Ventura, Richard Linares, David W. Miller

    Abstract: Space free-flyers like the Astrobee robots currently operating aboard the International Space Station must operate with inherent system uncertainties. Parametric uncertainties like mass and moment of inertia are especially important to quantify in these safety-critical space systems and can change in scenarios such as on-orbit cargo movement, where unknown grappled payloads significantly change th… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: 8 pages, 8 figures, IROS 2021 preprint (accepted)

  17. arXiv:2104.06622  [pdf, other

    cs.AI cs.LG hep-ex physics.ins-det

    Towards an Interpretable Data-driven Trigger System for High-throughput Physics Facilities

    Authors: Chinmaya Mahesh, Kristin Dona, David W. Miller, Yuxin Chen

    Abstract: Data-intensive science is increasingly reliant on real-time processing capabilities and machine learning workflows, in order to filter and analyze the extreme volumes of data being collected. This is especially true at the energy and intensity frontiers of particle physics where bandwidths of raw data can exceed 100 Tb/s of heterogeneous, high-dimensional data sourced from hundreds of millions of… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: Appeared in the 3rd Workshop on Machine Learning and the Physical Sciences, NeurIPS 2020

  18. arXiv:2103.01992  [pdf, other

    cs.LG stat.AP stat.ME

    Improving Neural Networks for Time Series Forecasting using Data Augmentation and AutoML

    Authors: Indrajeet Y. Javeri, Mohammadhossein Toutiaee, Ismailcem B. Arpinar, Tom W. Miller, John A. Miller

    Abstract: Statistical methods such as the Box-Jenkins method for time-series forecasting have been prominent since their development in 1970. Many researchers rely on such models as they can be efficiently estimated and also provide interpretability. However, advances in machine learning research indicate that neural networks can be powerful data modeling techniques, as they can give higher accuracy for a p… ▽ More

    Submitted 7 May, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

  19. arXiv:2006.04780  [pdf, other

    hep-ph cs.LG hep-ex physics.comp-ph stat.ML

    Lorentz Group Equivariant Neural Network for Particle Physics

    Authors: Alexander Bogatskiy, Brandon Anderson, Jan T. Offermann, Marwah Roussi, David W. Miller, Risi Kondor

    Abstract: We present a neural network architecture that is fully equivariant with respect to transformations under the Lorentz group, a fundamental symmetry of space and time in physics. The architecture is based on the theory of the finite-dimensional representations of the Lorentz group and the equivariant nonlinearity involves the tensor product. For classification tasks in particle physics, we demonstra… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  20. arXiv:1501.00039  [pdf

    cs.DC

    Design, Construction, and Use of a Single Board Computer Beowulf Cluster: Application of the Small-Footprint, Low-Cost, InSignal 5420 Octa Board

    Authors: James J. Cusick, William Miller, Nicholas Laurita, Tasha Pitt

    Abstract: In recent years development in the area of Single Board Computing has been advancing rapidly. At Wolters Kluwer's Corporate Legal Services Division a prototyping effort was undertaken to establish the utility of such devices for practical and general computing needs. This paper presents the background of this work, the design and construction of a 64 core 96 GHz cluster, and their possibility of y… ▽ More

    Submitted 5 January, 2015; v1 submitted 30 December, 2014; originally announced January 2015.

    Comments: 9 Figures

  21. arXiv:1310.1118  [pdf, ps, other

    cs.OH

    Evolution of choices over time: The U.S. Presidential election 2012 and the NY City Mayoral Election, 2013

    Authors: Mukkai Krishnamoorthy, Wesley Miller, Raju Krishnamoorthy

    Abstract: We conducted surveys before and after the 2012 U.S. Presidential election and prior to the NY City Mayoral election in 2013. The surveys were done using Amazon Turk. This poster describes the results of our analysis of the surveys and predicts the winner of the NY City Mayoral Election.

    Submitted 8 October, 2013; v1 submitted 3 October, 2013; originally announced October 2013.

  22. arXiv:1304.1104  [pdf

    cs.AI

    A Polynomial Time Algorithm for Finding Bayesian Probabilities from Marginal Constraints

    Authors: J. W. Miller, R. M. Goodman

    Abstract: A method of calculating probability values from a system of marginal constraints is presented. Previous systems for finding the probability of a single attribute have either made an independence assumption concerning the evidence or have required, in the worst case, time exponential in the number of attributes of the system. In this paper a closed form solution to the probability of an attribute g… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Sixth Conference on Uncertainty in Artificial Intelligence (UAI1990)

    Report number: UAI-P-1990-PG-186-193

  23. Reduced Criteria for Degree Sequences

    Authors: Jeffrey W. Miller

    Abstract: For many types of graphs, criteria have been discovered that give necessary and sufficient conditions for an integer sequence to be the degree sequence of such a graph. These criteria tend to take the form of a set of inequalities, and in the case of the Erdős-Gallai criterion (for simple undirected graphs) and the Gale-Ryser criterion (for bipartite graphs), it has been shown that the number of i… ▽ More

    Submitted 12 January, 2013; v1 submitted 11 May, 2012; originally announced May 2012.

    Journal ref: Discrete Mathematics, Volume 313, Issue 4, 28 February 2013, Pages 550-562

  24. arXiv:1104.0323  [pdf, ps, other

    stat.CO cs.DM

    Exact Enumeration and Sampling of Matrices with Specified Margins

    Authors: Jeffrey W. Miller, Matthew T. Harrison

    Abstract: We describe a dynamic programming algorithm for exact counting and exact uniform sampling of matrices with specified row and column sums. The algorithm runs in polynomial time when the column sums are bounded. Binary or non-negative integer matrices are handled. The method is distinguished by applicability to non-regular margins, tractability on large matrices, and the capacity for exact sampling.

    Submitted 2 April, 2011; originally announced April 2011.

  25. arXiv:1003.0931  [pdf, other

    physics.ed-ph cs.DL cs.IR

    A student's guide to searching the literature using online databases

    Authors: Casey W. Miller, Michelle D. Chabot, Troy C. Messina

    Abstract: A method is described to empower students to efficiently perform general and literature searches using online resources. The method was tested on undergraduate and graduate students with varying backgrounds with scientific literature. Students involved in this study showed marked improvement in their awareness of how and where to find accurate scientific information.

    Submitted 3 March, 2010; originally announced March 2010.

    Comments: 16 pages, 5 figures, and 1 table

    Journal ref: Am. J. Phys. 77(12), 1112-1117 (2009)

  26. arXiv:cmp-lg/9606023  [pdf, ps

    cs.CL

    A Robust System for Natural Spoken Dialogue

    Authors: James F. Allen, Bradford W. Miller, Eric K. Ringger, Teresa Sikorski

    Abstract: This paper describes a system that leads us to believe in the feasibility of constructing natural spoken dialogue systems in task-oriented domains. It specifically addresses the issue of robust interpretation of speech in the presence of recognition errors. Robustness is achieved by a combination of statistical error post-correction, syntactically- and semantically-driven robust parsing, and ext… ▽ More

    Submitted 18 June, 1996; originally announced June 1996.

    Comments: uuencoded, gzipped PostScript. Includes extra Appendix

    Journal ref: Proceedings of the 34th Annual Meeting of the ACL