Skip to main content

Showing 1–50 of 167 results for author: Miller, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.02554  [pdf, ps, other

    cs.AI cs.LG

    AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench

    Authors: Edan Toledo, Karen Hambardzumyan, Martin Josifoski, Rishi Hazra, Nicolas Baldwin, Alexis Audran-Reiss, Michael Kuchnik, Despoina Magka, Minqi Jiang, Alisia Maria Lupidi, Andrei Lupu, Roberta Raileanu, Kelvin Niu, Tatiana Shavrina, Jean-Christophe Gagnon-Audet, Michael Shvartsman, Shagun Sodhani, Alexander H. Miller, Abhishek Charnalia, Derek Dunfield, Carole-Jean Wu, Pontus Stenetorp, Nicola Cancedda, Jakob Nicolaus Foerster, Yoram Bachrach

    Abstract: AI research agents are demonstrating great potential to accelerate scientific progress by automating the design, implementation, and training of machine learning models. We focus on methods for improving agents' performance on MLE-bench, a challenging benchmark where agents compete in Kaggle competitions to solve real-world machine learning problems. We formalize AI research agents as search polic… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: Code: https://github.com/facebookresearch/aira-dojo

  2. arXiv:2507.01780  [pdf, ps, other

    cs.LO cs.PL

    LeanLTL: A unifying framework for linear temporal logics in Lean

    Authors: Eric Vin, Kyle A. Miller, Daniel J. Fremont

    Abstract: We propose LeanLTL, a unifying framework for linear temporal logics in Lean 4. LeanLTL supports reasoning about traces that represent either infinite or finite linear time. The library allows traditional LTL syntax to be combined with arbitrary Lean expressions, making it straightforward to define properties involving numerical or other types. We prove that standard flavors of LTL can be embedded… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: 9 pages, 3 figures; for associated project files see https://github.com/UCSCFormalMethods/LeanLTL; to be published in LIPIcs for ITP '25

    ACM Class: F.3.1; F.4.1; F.3.3

  3. arXiv:2506.22419  [pdf, ps, other

    cs.AI cs.CL cs.LG

    The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements

    Authors: Bingchen Zhao, Despoina Magka, Minqi Jiang, Xian Li, Roberta Raileanu, Tatiana Shavrina, Jean-Christophe Gagnon-Audet, Kelvin Niu, Shagun Sodhani, Michael Shvartsman, Andrei Lupu, Alisia Lupidi, Edan Toledo, Karen Hambardzumyan, Martin Josifoski, Thomas Foster, Lucia Cipolina-Kun, Abhishek Charnalia, Derek Dunfield, Alexander H. Miller, Oisin Mac Aodha, Jakob Foerster, Yoram Bachrach

    Abstract: Rapid advancements in large language models (LLMs) have the potential to assist in scientific progress. A critical capability toward this endeavor is the ability to reproduce existing work. To evaluate the ability of AI agents to reproduce results in an active research area, we introduce the Automated LLM Speedrunning Benchmark, leveraging the research community contributions on the NanoGPT speedr… ▽ More

    Submitted 30 June, 2025; v1 submitted 27 June, 2025; originally announced June 2025.

  4. arXiv:2506.14964  [pdf, ps, other

    cs.CR

    Narrowing the Gap between TEEs Threat Model and Deployment Strategies

    Authors: Filip Rezabek, Jonathan Passerat-Palmbach, Moe Mahhouk, Frieder Erdmann, Andrew Miller

    Abstract: Confidential Virtual Machines (CVMs) provide isolation guarantees for data in use, but their threat model does not include physical level protection and side-channel attacks. Therefore, current deployments rely on trusted cloud providers to host the CVMs' underlying infrastructure. However, TEE attestations do not provide information about the operator hosting a CVM. Without knowing whether a Trus… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  5. arXiv:2506.00076  [pdf

    cs.CY cs.AI cs.CL cs.LG

    Optimizing Storytelling, Improving Audience Retention, and Reducing Waste in the Entertainment Industry

    Authors: Andrew Cornfeld, Ashley Miller, Mercedes Mora-Figueroa, Kurt Samuels, Anthony Palomba

    Abstract: Television networks face high financial risk when making programming decisions, often relying on limited historical data to forecast episodic viewership. This study introduces a machine learning framework that integrates natural language processing (NLP) features from over 25000 television episodes with traditional viewership data to enhance predictive accuracy. By extracting emotional tone, cogni… ▽ More

    Submitted 29 May, 2025; originally announced June 2025.

  6. arXiv:2504.17857  [pdf, ps, other

    cs.LG cs.RO

    High-Performance Reinforcement Learning on Spot: Optimizing Simulation Parameters with Distributional Measures

    Authors: AJ Miller, Fangzhou Yu, Michael Brauckmann, Farbod Farshidian

    Abstract: This work presents an overview of the technical details behind a high performance reinforcement learning policy deployment with the Spot RL Researcher Development Kit for low level motor access on Boston Dynamics Spot. This represents the first public demonstration of an end to end end reinforcement learning policy deployed on Spot hardware with training code publicly available through Nvidia Isaa… ▽ More

    Submitted 3 July, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

  7. arXiv:2504.09522  [pdf, other

    cs.CL cs.AI

    How new data permeates LLM knowledge and how to dilute it

    Authors: Chen Sun, Renat Aksitov, Andrey Zhmoginov, Nolan Andrew Miller, Max Vladymyrov, Ulrich Rueckert, Been Kim, Mark Sandler

    Abstract: Large language models learn and continually learn through the accumulation of gradient-based updates, but how individual pieces of new information affect existing knowledge, leading to both beneficial generalization and problematic hallucination, remains poorly understood. We demonstrate that when learning new information, LLMs exhibit a "priming" effect: learning a new fact can cause the model to… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

  8. arXiv:2502.15018  [pdf, other

    cs.CL

    Using tournaments to calculate AUROC for zero-shot classification with LLMs

    Authors: Wonjin Yoon, Ian Bulovic, Timothy A. Miller

    Abstract: Large language models perform surprisingly well on many zero-shot classification tasks, but are difficult to fairly compare to supervised classifiers due to the lack of a modifiable decision boundary. In this work, we propose and evaluate a method that converts binary classification tasks into pairwise comparison tasks, obtaining relative rankings from LLMs. Repeated pairwise comparisons can be us… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  9. arXiv:2502.07924  [pdf, ps, other

    econ.TH cs.AI

    NDAI Agreements

    Authors: Matthew Stephenson, Andrew Miller, Xyn Sun, Bhargav Annem, Rohan Parikh

    Abstract: We study a fundamental challenge in the economics of innovation: an inventor must reveal details of a new idea to secure compensation or funding, yet such disclosure risks expropriation. We present a model in which a seller (inventor) and buyer (investor) bargain over an information good under the threat of hold-up. In the classical setting, the seller withholds disclosure to avoid misappropriatio… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: 21 pages, 1 figure

  10. arXiv:2501.00578  [pdf, ps, other

    econ.TH cs.LO

    The Limits of Tolerance

    Authors: Alan D. Miller

    Abstract: I propose a model of aggregation of intervals relevant to the study of legal standards of tolerance. Seven axioms: responsiveness, anonymity, continuity, strategyproofness, and three variants of neutrality are then used to prove several important results about a new class of aggregation methods called endpoint rules. The class of endpoint rules includes extreme tolerance (allowing anything permitt… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

  11. arXiv:2412.17542  [pdf, other

    cs.LG cs.CE physics.bio-ph

    Leveraging Cardiovascular Simulations for In-Vivo Prediction of Cardiac Biomarkers

    Authors: Laura Manduchi, Antoine Wehenkel, Jens Behrmann, Luca Pegolotti, Andy C. Miller, Ozan Sener, Marco Cuturi, Guillermo Sapiro, Jörn-Henrik Jacobsen

    Abstract: Whole-body hemodynamics simulators, which model blood flow and pressure waveforms as functions of physiological parameters, are now essential tools for studying cardiovascular systems. However, solving the corresponding inverse problem of mapping observations (e.g., arterial pressure waveforms at specific locations in the arterial network) back to plausible physiological parameters remains challen… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

  12. arXiv:2412.11276  [pdf, other

    cs.LG cs.AI eess.SP

    Wearable Accelerometer Foundation Models for Health via Knowledge Distillation

    Authors: Salar Abbaspourazad, Anshuman Mishra, Joseph Futoma, Andrew C. Miller, Ian Shapiro

    Abstract: Modern wearable devices can conveniently record various biosignals in the many different environments of daily living, enabling a rich view of individual health. However, not all biosignals are the same: high-fidelity biosignals, such as photoplethysmogram (PPG), contain more physiological information, but require optical sensors with a high power footprint. Alternatively, a lower-fidelity biosign… ▽ More

    Submitted 31 January, 2025; v1 submitted 15 December, 2024; originally announced December 2024.

    Comments: updated format

  13. arXiv:2411.11510  [pdf, ps, other

    cs.RO cs.AI cs.ET eess.SY

    Closed-loop multi-step planning with innate physics knowledge

    Authors: Giulia Lafratta, Bernd Porr, Christopher Chandler, Alice Miller

    Abstract: We present a hierarchical framework to solve robot planning as an input control problem. At the lowest level are temporary closed control loops, ("tasks"), each representing a behaviour, contingent on a specific sensory input and therefore temporary. At the highest level, a supervising "Configurator" directs task creation and termination. Here resides "core" knowledge as a physics engine, where se… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  14. arXiv:2411.04962  [pdf, other

    cs.AI cs.CL

    Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability

    Authors: Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy A Miller, Danielle Bitterman, Guanhua Chen, Anoop Mayampurath, Matthew Churpek, Majid Afshar

    Abstract: Large language models (LLMs) are being explored for diagnostic decision support, yet their ability to estimate pre-test probabilities, vital for clinical decision-making, remains limited. This study evaluates two LLMs, Mistral-7B and Llama3-70B, using structured electronic health record data on three diagnosis tasks. We examined three current methods of extracting LLM probability estimations and r… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Accepted to GenAI4Health Workshop at NeurIPS 2024

  15. arXiv:2411.03887  [pdf, ps, other

    cs.AI cs.CR

    Reclaiming "Open AI" -- AI Model Serving Can Be Open Access, Yet Monetizable and Loyal

    Authors: Zerui Cheng, Edoardo Contente, Ben Finch, Oleg Golev, Jonathan Hayase, Andrew Miller, Niusha Moshrefi, Anshul Nasery, Sandeep Nailwal, Sewoong Oh, Himanshu Tyagi, Pramod Viswanath

    Abstract: The rapid rise of AI has split model serving between open-weight distribution, which often lacks owner control and monetization, and opaque API-based approaches that risk user privacy and model transparency, forming a dichotomy that hinders an equitable AI ecosystem. This position paper introduces, rigorously formulates, and champions the Open-access, Monetizable, and Loyal (OML) paradigm for AI m… ▽ More

    Submitted 3 June, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: 54 pages

  16. arXiv:2410.21750  [pdf, other

    cs.CL cs.AI

    Learning and Unlearning of Fabricated Knowledge in Language Models

    Authors: Chen Sun, Nolan Andrew Miller, Andrey Zhmoginov, Max Vladymyrov, Mark Sandler

    Abstract: What happens when a new piece of knowledge is introduced into the training data and how long does it last while a large language model (LM) continues to train? We investigate this question by injecting facts into LMs from a new probing dataset, "Outlandish", which is designed to permit the testing of a spectrum of different fact types. When studying how robust these memories are, there appears to… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

    Journal ref: ICML 2024 Workshop on Mechanistic Interpretability

  17. arXiv:2410.19575  [pdf, other

    stat.ML cs.LG

    Considerations for Distribution Shift Robustness of Diagnostic Models in Healthcare

    Authors: Arno Blaas, Adam Goliński, Andrew Miller, Luca Zappella, Jörn-Henrik Jacobsen, Christina Heinze-Deml

    Abstract: We consider robustness to distribution shifts in the context of diagnostic models in healthcare, where the prediction target $Y$, e.g., the presence of a disease, is causally upstream of the observations $X$, e.g., a biomarker. Distribution shifts may occur, for instance, when the training data is collected in a domain with patients having particular demographic characteristics while the model is… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  18. arXiv:2410.14516  [pdf, other

    cs.AI cs.CL

    Do LLMs "know" internally when they follow instructions?

    Authors: Juyeon Heo, Christina Heinze-Deml, Oussama Elachqar, Kwan Ho Ryan Chan, Shirley Ren, Udhay Nallasamy, Andy Miller, Jaya Narain

    Abstract: Instruction-following is crucial for building AI agents with large language models (LLMs), as these models must adhere strictly to user-provided constraints and guidelines. However, LLMs often fail to follow even simple and clear instructions. To improve instruction-following behavior and prevent undesirable outputs, a deeper understanding of how LLMs' internal states relate to these outcomes is r… ▽ More

    Submitted 28 March, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

  19. arXiv:2410.13095  [pdf, other

    cs.SI cs.CE cs.CR cs.CY cs.HC

    Future of Algorithmic Organization: Large-Scale Analysis of Decentralized Autonomous Organizations (DAOs)

    Authors: Tanusree Sharma, Yujin Potter, Kornrapat Pongmala, Henry Wang, Andrew Miller, Dawn Song, Yang Wang

    Abstract: Decentralized Autonomous Organizations (DAOs) resemble early online communities, particularly those centered around open-source projects, and present a potential empirical framework for complex social-computing systems by encoding governance rules within "smart contracts" on the blockchain. A key function of a DAO is collective decision-making, typically carried out through a series of proposals w… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  20. arXiv:2410.09053  [pdf, other

    math.RA cs.MS cs.SC math.NA

    Fast Symbolic Integer-Linear Spectra

    Authors: Jonny Luntzel, Abraham Miller

    Abstract: Here we contribute a fast symbolic eigenvalue solver for matrices whose eigenvalues are $\mathbb{Z}$-linear combinations of their entries, alongside efficient general and stochastic $M^{X}$ generators. Users can interact with a few degrees of freedom to create linear operators, making high-dimensional symbolic analysis feasible for when numerical analyses are insufficient.

    Submitted 12 December, 2024; v1 submitted 18 September, 2024; originally announced October 2024.

  21. arXiv:2409.15163  [pdf, other

    cs.CL cs.IR

    Lessons Learned on Information Retrieval in Electronic Health Records: A Comparison of Embedding Models and Pooling Strategies

    Authors: Skatje Myers, Timothy A. Miller, Yanjun Gao, Matthew M. Churpek, Anoop Mayampurath, Dmitriy Dligach, Majid Afshar

    Abstract: Objective: Applying large language models (LLMs) to the clinical domain is challenging due to the context-heavy nature of processing medical records. Retrieval-augmented generation (RAG) offers a solution by facilitating reasoning over large text sources. However, there are many parameters to optimize in just the retrieval system alone. This paper presents an ablation study exploring how different… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  22. arXiv:2409.06627  [pdf, other

    cs.HC cs.CY cs.ET

    "The struggle is a part of the experience": Engaging Discontents in the Design of Family Meal Technologies

    Authors: Yuxing Wu, Andrew D Miller, Chia-Fang Chung, Elizabeth Kaziunas

    Abstract: Meals are a central (and messy) part of family life. Previous design framings for mealtime technologies have focused on supporting dietary needs or social and celebratory interactions at the dinner table; however, family meals involve the coordination of many activities and complicated family dynamics. In this paper, we report on findings from interviews and design sessions with 18 families from t… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Journal ref: Proc. ACM Hum.-Comput. Interact 8, CSCW2, Article 477 (November 2024), 33 pages

  23. arXiv:2408.11854  [pdf, other

    cs.CL cs.AI cs.LG

    When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications?

    Authors: Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy A Miller, Danielle Bitterman, Matthew Churpek, Majid Afshar

    Abstract: The introduction of Large Language Models (LLMs) has advanced data representation and analysis, bringing significant progress in their use for medical questions and answering. Despite these advancements, integrating tabular data, especially numerical data pivotal in clinical contexts, into LLM paradigms has not been thoroughly explored. In this study, we examine the effectiveness of vector represe… ▽ More

    Submitted 19 September, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

    Comments: Accepted to Findings of EMNLP 2024

  24. arXiv:2408.02303  [pdf, other

    cs.CR

    PROF: Protected Order Flow in a Profit-Seeking World

    Authors: Kushal Babel, Nerla Jean-Louis, Yan Ji, Ujval Misra, Mahimna Kelkar, Kosala Yapa Mudiyanselage, Andrew Miller, Ari Juels

    Abstract: Users of decentralized finance (DeFi) applications face significant risks from adversarial actions that manipulate the order of transactions to extract value from users. Such actions -- an adversarial form of what is called maximal-extractable value (MEV) -- impact both individual outcomes and the stability of the DeFi ecosystem. MEV exploitation, moreover, is being institutionalized through an ar… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 21 pages, 14 figures

  25. arXiv:2406.05636  [pdf, other

    quant-ph cs.LG

    What is my quantum computer good for? Quantum capability learning with physics-aware neural networks

    Authors: Daniel Hothem, Ashe Miller, Timothy Proctor

    Abstract: Quantum computers have the potential to revolutionize diverse fields, including quantum chemistry, materials science, and machine learning. However, contemporary quantum computers experience errors that often cause quantum programs run on them to fail. Until quantum computers can reliably execute large quantum programs, stakeholders will need fast and reliable methods for assessing a quantum compu… ▽ More

    Submitted 26 February, 2025; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: 24 pages, 4 figures, 4 tables, includes conference checklist

    Journal ref: Advances in Neural Information Processing Systems 37 (NeurIPS 2024)

  26. arXiv:2405.05535  [pdf, other

    cs.DS cs.DM

    Reconfiguration of Multisets with Applications to Bin Packing

    Authors: Jeffrey Kam, Shahin Kamali, Avery Miller, Naomi Nishimura

    Abstract: We use the reconfiguration framework to analyze problems that involve the rearrangement of items among groups. In various applications, a group of items could correspond to the files or jobs assigned to a particular machine, and the goal of rearrangement could be improving efficiency or increasing locality. To cover problems arising in a wide range of application areas, we define the general Rep… ▽ More

    Submitted 28 October, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: A preliminary version of this paper appeared in the proceedings of the 18th International Conference and Workshops on Algorithms and Computation (WALCOM 2024)

  27. arXiv:2404.03044  [pdf

    cs.LG cs.AI

    The Artificial Intelligence Ontology: LLM-assisted construction of AI concept hierarchies

    Authors: Marcin P. Joachimiak, Mark A. Miller, J. Harry Caufield, Ryan Ly, Nomi L. Harris, Andrew Tritt, Christopher J. Mungall, Kristofer E. Bouchard

    Abstract: The Artificial Intelligence Ontology (AIO) is a systematization of artificial intelligence (AI) concepts, methodologies, and their interrelations. Developed via manual curation, with the additional assistance of large language models (LLMs), AIO aims to address the rapidly evolving landscape of AI by providing a comprehensive framework that encompasses both technical and ethical aspects of AI tech… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  28. arXiv:2403.13313  [pdf, other

    cs.AI cs.CL

    Polaris: A Safety-focused LLM Constellation Architecture for Healthcare

    Authors: Subhabrata Mukherjee, Paul Gamble, Markel Sanz Ausin, Neel Kant, Kriti Aggarwal, Neha Manjunath, Debajyoti Datta, Zhengliang Liu, Jiayuan Ding, Sophia Busacca, Cezanne Bianco, Swapnil Sharma, Rae Lasko, Michelle Voisard, Sanchay Harneja, Darya Filippova, Gerry Meixiong, Kevin Cha, Amir Youssefi, Meyhaa Buvanesh, Howard Weingram, Sebastian Bierman-Lytle, Harpreet Singh Mangat, Kim Parikh, Saad Godil , et al. (1 additional authors not shown)

    Abstract: We develop Polaris, the first safety-focused LLM constellation for real-time patient-AI healthcare conversations. Unlike prior LLM works in healthcare focusing on tasks like question answering, our work specifically focuses on long multi-turn voice conversations. Our one-trillion parameter constellation system is composed of several multibillion parameter LLMs as co-operative agents: a stateful pr… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  29. arXiv:2402.15384  [pdf, ps, other

    cs.RO cs.AI eess.SY

    Closed-loop Multi-step Planning

    Authors: Giulia Lafratta, Bernd Porr, Christopher Chandler, Alice Miller

    Abstract: Living organisms interact with their surroundings in a closed-loop fashion, where sensory inputs dictate the initiation and termination of behaviours. Even simple animals are able to develop and execute complex plans, which has not yet been replicated in robotics using pure closed-loop input control. We propose a solution to this problem by defining a set of discrete and temporary closed-loop cont… ▽ More

    Submitted 29 January, 2025; v1 submitted 23 February, 2024; originally announced February 2024.

  30. arXiv:2402.14959  [pdf, other

    stat.AP cs.CY stat.ML

    A Causal Framework to Evaluate Racial Bias in Law Enforcement Systems

    Authors: Jessy Xinyi Han, Andrew Miller, S. Craig Watkins, Christopher Winship, Fotini Christia, Devavrat Shah

    Abstract: We are interested in developing a data-driven method to evaluate race-induced biases in law enforcement systems. While the recent works have addressed this question in the context of police-civilian interactions using police stop data, they have two key limitations. First, bias can only be properly quantified if true criminality is accounted for in addition to race, but it is absent in prior works… ▽ More

    Submitted 20 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  31. arXiv:2401.13912  [pdf, other

    cs.LG

    A Survey of Deep Learning and Foundation Models for Time Series Forecasting

    Authors: John A. Miller, Mohammed Aldosari, Farah Saeed, Nasid Habib Barna, Subas Rana, I. Budak Arpinar, Ninghao Liu

    Abstract: Deep Learning has been successfully applied to many application domains, yet its advantages have been slow to emerge for time series forecasting. For example, in the well-known Makridakis (M) Competitions, hybrids of traditional statistical or machine learning techniques have only recently become the top performers. With the recent architectural advances in deep learning being applied to time seri… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  32. arXiv:2401.04915  [pdf, other

    cs.SI

    From low resource information extraction to identifying influential nodes in knowledge graphs

    Authors: Erica Cai, Olga Simek, Benjamin A. Miller, Danielle Sullivan-Pao, Evan Young, Christopher L. Smith

    Abstract: We propose a pipeline for identifying important entities from intelligence reports that constructs a knowledge graph, where nodes correspond to entities of fine-grained types (e.g. traffickers) extracted from the text and edges correspond to extracted relations between entities (e.g. cartel membership). The important entities in intelligence reports then map to central nodes in the knowledge graph… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 14 pages, 6 figures, to appear at CompleNet 2024

  33. arXiv:2312.16619  [pdf, other

    cs.CR quant-ph

    Evaluating the security of CRYSTALS-Dilithium in the quantum random oracle model

    Authors: Kelsey A. Jackson, Carl A. Miller, Daochen Wang

    Abstract: In the wake of recent progress on quantum computing hardware, the National Institute of Standards and Technology (NIST) is standardizing cryptographic protocols that are resistant to attacks by quantum adversaries. The primary digital signature scheme that NIST has chosen is CRYSTALS-Dilithium. The hardness of this scheme is based on the hardness of three computational problems: Module Learning wi… ▽ More

    Submitted 7 March, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

    Comments: 23 pages; v2: added description of CRYSTALS-Dilithium, improved analysis of concrete parameters

  34. arXiv:2312.05409  [pdf, other

    cs.LG cs.AI eess.SP

    Large-scale Training of Foundation Models for Wearable Biosignals

    Authors: Salar Abbaspourazad, Oussama Elachqar, Andrew C. Miller, Saba Emrani, Udhyakumar Nallasamy, Ian Shapiro

    Abstract: Tracking biosignals is crucial for monitoring wellness and preempting the development of severe medical conditions. Today, wearable devices can conveniently record various biosignals, creating the opportunity to monitor health status without disruption to one's daily routine. Despite widespread use of wearable devices and existing digital biomarkers, the absence of curated data with annotated medi… ▽ More

    Submitted 6 March, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Camera ready version for ICLR 2024

  35. arXiv:2311.18259  [pdf, other

    cs.CV cs.AI

    Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

    Authors: Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Jain , et al. (76 additional authors not shown)

    Abstract: We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike repair). 740 participants from 13 cities worldwide performed these activities in 123 different natural scene contexts, yielding long-form captures from… ▽ More

    Submitted 25 September, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Expanded manuscript (compared to arxiv v1 from Nov 2023 and CVPR 2024 paper from June 2024) for more comprehensive dataset and benchmark presentation, plus new results on v2 data release

  36. arXiv:2311.12976  [pdf, ps, other

    cs.DS cs.DC

    Fast Deterministic Rendezvous in Labeled Lines

    Authors: Avery Miller, Andrzej Pelc

    Abstract: Two mobile agents, starting from different nodes of a network modeled as a graph, and woken up at possibly different times, have to meet at the same node. This problem is known as rendezvous. We consider deterministic distributed rendezvous in the infinite path. Each node has a distinct label which is a positive integer. The time of rendezvous is the number of rounds until meeting, counted from th… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: A preliminary version of this paper appeared in the Proceedings of the 37th International Symposium on Distributed Computing (DISC 2023)

  37. arXiv:2311.09780  [pdf, other

    cs.LO cs.AI cs.RO

    Model Checking for Closed-Loop Robot Reactive Planning

    Authors: Christopher Chandler, Bernd Porr, Alice Miller, Giulia Lafratta

    Abstract: In this paper, we show how model checking can be used to create multi-step plans for a differential drive wheeled robot so that it can avoid immediate danger. Using a small, purpose built model checking algorithm in situ we generate plans in real-time in a way that reflects the egocentric reactive response of simple biological agents. Our approach is based on chaining temporary control systems whi… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: In Proceedings FMAS 2023, arXiv:2311.08987

    Journal ref: EPTCS 395, 2023, pp. 77-94

  38. arXiv:2311.04931  [pdf, other

    cs.CL cs.AI

    GPT4All: An Ecosystem of Open Source Compressed Language Models

    Authors: Yuvanesh Anand, Zach Nussbaum, Adam Treat, Aaron Miller, Richard Guo, Ben Schmidt, GPT4All Community, Brandon Duderstadt, Andriy Mulyar

    Abstract: Large language models (LLMs) have recently achieved human-level performance on a range of professional and academic benchmarks. The accessibility of these models has lagged behind their performance. State-of-the-art LLMs require costly infrastructure; are only accessible via rate-limited, geo-locked, and censored web interfaces; and lack publicly available code and technical reports. In this paper… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted at NLP-OSS at EMNLP 2023

  39. arXiv:2310.07980  [pdf, other

    cs.LG

    GRASP: Accelerating Shortest Path Attacks via Graph Attention

    Authors: Zohair Shafi, Benjamin A. Miller, Ayan Chatterjee, Tina Eliassi-Rad, Rajmonda S. Caceres

    Abstract: Recent advances in machine learning (ML) have shown promise in aiding and accelerating classical combinatorial optimization algorithms. ML-based speed ups that aim to learn in an end to end manner (i.e., directly output the solution) tend to trade off run time with solution quality. Therefore, solutions that are able to accelerate existing solvers while maintaining their performance guarantees, ar… ▽ More

    Submitted 23 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  40. arXiv:2310.07979  [pdf, other

    cs.LG cs.DM

    Graph-SCP: Accelerating Set Cover Problems with Graph Neural Networks

    Authors: Zohair Shafi, Benjamin A. Miller, Tina Eliassi-Rad, Rajmonda S. Caceres

    Abstract: Machine learning (ML) approaches are increasingly being used to accelerate combinatorial optimization (CO) problems. We investigate the Set Cover Problem (SCP) and propose Graph-SCP, a graph neural network method that augments existing optimization solvers by learning to identify a much smaller sub-problem that contains the solution space. Graph-SCP uses both supervised learning from prior solved… ▽ More

    Submitted 26 August, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  41. arXiv:2309.13050  [pdf, other

    cs.IR cs.LG

    Decoding the Alphabet Soup of Degrees in the United States Postsecondary Education System Through Hybrid Method: Database and Text Mining

    Authors: Sahar Voghoei, James Byars, John A Miller, Khaled Rasheed, Hamid A Arabnia

    Abstract: This paper proposes a model to predict the levels (e.g., Bachelor, Master, etc.) of postsecondary degree awards that have been ambiguously expressed in the student tracking reports of the National Student Clearinghouse (NSC). The model will be the hybrid of two modules. The first module interprets the relevant abbreviatory elements embedded in NSC reports by referring to a comprehensive database t… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 18 Pages, 8 figures

  42. arXiv:2309.12339  [pdf

    cs.CY cs.AI cs.CL

    Considerations for health care institutions training large language models on electronic health records

    Authors: Weipeng Zhou, Danielle Bitterman, Majid Afshar, Timothy A. Miller

    Abstract: Large language models (LLMs) like ChatGPT have excited scientists across fields; in medicine, one source of excitement is the potential applications of LLMs trained on electronic health record (EHR) data. But there are tough questions we must first answer if health care institutions are interested in having LLMs trained on their own data; should they train an LLM from scratch or fine-tune it from… ▽ More

    Submitted 23 August, 2023; originally announced September 2023.

  43. arXiv:2309.01001  [pdf, other

    math.CO cs.DM

    Cops and Robbers on 1-Planar Graphs

    Authors: Stephane Durocher, Shahin Kamali, Myroslav Kryven, Fengyi Liu, Amirhossein Mashghdoust, Avery Miller, Pouria Zamani Nezhad, Ikaro Penha Costa, Timothy Zapp

    Abstract: Cops and Robbers is a well-studied pursuit-evasion game in which a set of cops seeks to catch a robber in a graph G, where cops and robber move along edges of G. The cop number of G is the minimum number of cops that is sufficient to catch the robber. Every planar graph has cop number at most three, and there are planar graphs for which three cops are necessary [Aigner and Fromme, DAM 1984]. We st… ▽ More

    Submitted 6 September, 2023; v1 submitted 2 September, 2023; originally announced September 2023.

    Comments: Appears in the Proceedings of the 31st International Symposium on Graph Drawing and Network Visualization (GD 2023)

    MSC Class: 68R10; 91A24

  44. arXiv:2308.06605  [pdf, other

    cs.DC

    Towards Exascale Computation for Turbomachinery Flows

    Authors: Yuhang Fu, Weiqi Shen, Jiahuan Cui, Yao Zheng, Guangwen Yang, Zhao Liu, Jifa Zhang, Tingwei Ji, Fangfang Xie, Xiaojing Lv, Hanyue Liu, Xu Liu, Xiyang Liu, Xiaoyu Song, Guocheng Tao, Yan Yan, Paul Tucker, Steven A. E. Miller, Shirui Luo, Seid Koric, Weimin Zheng

    Abstract: A state-of-the-art large eddy simulation code has been developed to solve compressible flows in turbomachinery. The code has been engineered with a high degree of scalability, enabling it to effectively leverage the many-core architecture of the new Sunway system. A consistent performance of 115.8 DP-PFLOPs has been achieved on a high-pressure turbine cascade consisting of over 1.69 billion mesh e… ▽ More

    Submitted 29 December, 2023; v1 submitted 12 August, 2023; originally announced August 2023.

    Comments: SC23, November, 2023, Denver, CO., USA

  45. arXiv:2308.05498  [pdf, other

    cs.SI

    Complex Network Effects on the Robustness of Graph Convolutional Networks

    Authors: Benjamin A. Miller, Kevin Chan, Tina Eliassi-Rad

    Abstract: Vertex classification -- the problem of identifying the class labels of nodes in a graph -- has applicability in a wide variety of domains. Examples include classifying subject areas of papers in citation networks or roles of machines in a computer network. Vertex classification using graph convolutional networks is susceptible to targeted poisoning attacks, in which both graph structure and node… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: 39 pages, 8 figures. arXiv admin note: text overlap with arXiv:2003.05822

  46. arXiv:2308.03081  [pdf, other

    cs.SI

    Using Overlapping Methods to Counter Adversaries in Community Detection

    Authors: Benjamin A. Miller, Kevin Chan, Tina Eliassi-Rad

    Abstract: When dealing with large graphs, community detection is a useful data triage tool that can identify subsets of the network that a data analyst should investigate. In an adversarial scenario, the graph may be manipulated to avoid scrutiny of certain nodes by the analyst. Robustness to such behavior is an important consideration for data analysts in high-stakes scenarios such as cyber defense and cou… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: 28 pages, 10 figures

  47. arXiv:2307.13918  [pdf, other

    stat.ML cs.LG q-bio.QM

    Simulation-based Inference for Cardiovascular Models

    Authors: Antoine Wehenkel, Laura Manduchi, Jens Behrmann, Luca Pegolotti, Andrew C. Miller, Guillermo Sapiro, Ozan Sener, Marco Cuturi, Jörn-Henrik Jacobsen

    Abstract: Over the past decades, hemodynamics simulators have steadily evolved and have become tools of choice for studying cardiovascular systems in-silico. While such tools are routinely used to simulate whole-body hemodynamics from physiological parameters, solving the corresponding inverse problem of mapping waveforms back to plausible physiological parameters remains both promising and challenging. Mot… ▽ More

    Submitted 30 December, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

  48. arXiv:2305.19083  [pdf, other

    cs.SI

    Defense Against Shortest Path Attacks

    Authors: Benjamin A. Miller, Zohair Shafi, Wheeler Ruml, Yevgeniy Vorobeychik, Tina Eliassi-Rad, Scott Alfeld

    Abstract: Identifying shortest paths between nodes in a network is an important task in many applications. Recent work has shown that a malicious actor can manipulate a graph to make traffic between two nodes of interest follow their target path. In this paper, we develop a defense against such attacks by modifying the edge weights that users observe. The defender must balance inhibiting the attacker agains… ▽ More

    Submitted 30 April, 2025; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: 21 pages, 8 figures, to appear at the 2025 SIAM International Conference on Data Mining

  49. arXiv:2305.10989  [pdf, other

    cs.RO

    Reinforcement Learning for Legged Robots: Motion Imitation from Model-Based Optimal Control

    Authors: AJ Miller, Shamel Fahmi, Matthew Chignoli, Sangbae Kim

    Abstract: We propose MIMOC: Motion Imitation from Model-Based Optimal Control. MIMOC is a Reinforcement Learning (RL) controller that learns agile locomotion by imitating reference trajectories from model-based optimal control. MIMOC mitigates challenges faced by other motion imitation RL approaches because the references are dynamically consistent, require no motion retargeting, and include torque referenc… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  50. arXiv:2305.02462  [pdf, other

    physics.optics cs.ET eess.IV

    Scalable Low-latency Optical Phase Sensor Array

    Authors: Zhanghao Sun, Sunil Pai, Carson Valdez, Maziyar Milanizadeh, Andrea Melloni, Francesco Morichetti, David A. B. Miller, Olav Solgaard

    Abstract: Optical phase measurement is critical for many applications and traditional approaches often suffer from mechanical instability, temporal latency, and computational complexity. In this paper, we describe compact phase sensor arrays based on integrated photonics, which enable accurate and scalable reference-free phase sensing in a few measurement steps. This is achieved by connecting multiple two-p… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.