Skip to main content

Showing 1–50 of 20,359 results for author: David

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.19296  [pdf, ps, other

    cs.CV cs.GR

    Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

    Authors: Sherwin Bahmani, Tianchang Shen, Jiawei Ren, Jiahui Huang, Yifeng Jiang, Haithem Turki, Andrea Tagliasacchi, David B. Lindell, Zan Gojcic, Sanja Fidler, Huan Ling, Jun Gao, Xuanchi Ren

    Abstract: The ability to generate virtual environments is crucial for applications ranging from gaming to physical AI domains such as robotics, autonomous driving, and industrial AI. Current learning-based 3D reconstruction methods rely on the availability of captured real-world multi-view data, which is not always readily available. Recent advancements in video diffusion models have shown remarkable imagin… ▽ More

    Submitted 23 September, 2025; originally announced September 2025.

    Comments: Project Page: https://research.nvidia.com/labs/toronto-ai/lyra/

  2. arXiv:2509.19231  [pdf, ps, other

    cs.SD cs.AI cs.CL

    Finding My Voice: Generative Reconstruction of Disordered Speech for Automated Clinical Evaluation

    Authors: Karen Rosero, Eunjung Yeo, David R. Mortensen, Cortney Van't Slot, Rami R. Hallac, Carlos Busso

    Abstract: We present ChiReSSD, a speech reconstruction framework that preserves children speaker's identity while suppressing mispronunciations. Unlike prior approaches trained on healthy adult speech, ChiReSSD adapts to the voices of children with speech sound disorders (SSD), with particular emphasis on pitch and prosody. We evaluate our method on the STAR dataset and report substantial improvements in le… ▽ More

    Submitted 23 September, 2025; originally announced September 2025.

  3. arXiv:2509.18768  [pdf, ps, other

    cs.CY

    Purer than pure: how purity reshapes the upstream materiality of the semiconductor industry

    Authors: Gauthier Roussilhe, Thibault Pirson, David Bol, Srinjoy Mitra

    Abstract: Growing attention is given to the environmental impacts of the digital sector, exacerbated by the increase of digital products and services in our globalized societies. The materiality of the digital sector is often presented through the environmental impacts of mining activities to point out that digitization does not mean dematerialization. Despite its importance, such a narrative is often restr… ▽ More

    Submitted 23 September, 2025; originally announced September 2025.

    Comments: 11 pages, 7 figures

  4. arXiv:2509.18465  [pdf, ps, other

    cs.NI cs.IT

    Using Age of Information for Throughput Optimal Spectrum Sharing

    Authors: Hongjae Nam, Vishrant Tripathi, David J. Love

    Abstract: We consider a spectrum sharing problem where two users attempt to communicate over N channels. The Primary User (PU) has prioritized transmissions and its occupancy on each channel over time can be modeled as a Markov chain. The Secondary User (SU) needs to determine which channels are free at each time-slot and attempt opportunistic transmissions. The goal of the SU is to maximize its own through… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: 16 pages, 10 figures

  5. arXiv:2509.18439  [pdf

    cs.CL cs.AI

    Developing an AI framework to automatically detect shared decision-making in patient-doctor conversations

    Authors: Oscar J. Ponce-Ponte, David Toro-Tobon, Luis F. Figueroa, Michael Gionfriddo, Megan Branda, Victor M. Montori, Saturnino Luz, Juan P. Brito

    Abstract: Shared decision-making (SDM) is necessary to achieve patient-centred care. Currently no methodology exists to automatically measure SDM at scale. This study aimed to develop an automated approach to measure SDM by using language modelling and the conversational alignment (CA) score. A total of 157 video-recorded patient-doctor conversations from a randomized multi-centre trial evaluating SDM decis… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: 53 pages, 1 figure, 4 tables, 5 supplementary figures, 13 supplementary tables

  6. arXiv:2509.18412  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Identifying birdsong syllables without labelled data

    Authors: Mélisande Teng, Julien Boussard, David Rolnick, Hugo Larochelle

    Abstract: Identifying sequences of syllables within birdsongs is key to tackling a wide array of challenges, including bird individual identification and better understanding of animal communication and sensory-motor learning. Recently, machine learning approaches have demonstrated great potential to alleviate the need for experts to label long audio recordings by hand. However, they still typically rely on… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

  7. Proceedings Seventh International Conference on Applied Category Theory 2024

    Authors: Michael Johnson, David Jaz Myers

    Abstract: Proceedings of the Seventh International Conference on Applied Category Theory, held at the University of Oxford on 17 - 21 June 2024. The contributions to ACT 2024 ranged from pure to applied and included contributions in a wide range of disciplines in science and engineering. ACT 2024 included talks in classical mechanics, quantum physics, probability theory, linguistics, decision theory, machin… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Journal ref: EPTCS 429, 2025

  8. arXiv:2509.18030  [pdf, ps, other

    cs.CL

    RadEval: A framework for radiology text evaluation

    Authors: Justin Xu, Xi Zhang, Javid Abderezaei, Julie Bauml, Roger Boodoo, Fatemeh Haghighi, Ali Ganjizadeh, Eric Brattain, Dave Van Veen, Zaiqiao Meng, David Eyre, Jean-Benoit Delbrouck

    Abstract: We introduce RadEval, a unified, open-source framework for evaluating radiology texts. RadEval consolidates a diverse range of metrics, from classic n-gram overlap (BLEU, ROUGE) and contextual measures (BERTScore) to clinical concept-based scores (F1CheXbert, F1RadGraph, RaTEScore, SRR-BERT, TemporalEntityF1) and advanced LLM-based evaluators (GREEN). We refine and standardize implementations, ext… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: Accepted to EMNLP 2025 Demo track - Oral

  9. arXiv:2509.17957  [pdf, ps, other

    cs.AI cs.IT

    On the Variational Costs of Changing Our Minds

    Authors: David Hyland, Mahault Albarracin

    Abstract: The human mind is capable of extraordinary achievements, yet it often appears to work against itself. It actively defends its cherished beliefs even in the face of contradictory evidence, conveniently interprets information to conform to desired narratives, and selectively searches for or avoids information to suit its various purposes. Despite these behaviours deviating from common normative stan… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: Accepted as a full paper at the 6th International Workshop on Active Inference

  10. arXiv:2509.17789  [pdf, ps, other

    cs.CV

    From Restoration to Reconstruction: Rethinking 3D Gaussian Splatting for Underwater Scenes

    Authors: Guoxi Huang, Haoran Wang, Zipeng Qi, Wenjun Lu, David Bull, Nantheera Anantrasirichai

    Abstract: Underwater image degradation poses significant challenges for 3D reconstruction, where simplified physical models often fail in complex scenes. We propose \textbf{R-Splatting}, a unified framework that bridges underwater image restoration (UIR) with 3D Gaussian Splatting (3DGS) to improve both rendering quality and geometric fidelity. Our method integrates multiple enhanced views produced by diver… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

  11. arXiv:2509.17768  [pdf, ps, other

    cs.CL cs.AI cs.LG

    DIVERS-Bench: Evaluating Language Identification Across Domain Shifts and Code-Switching

    Authors: Jessica Ojo, Zina Kamel, David Ifeoluwa Adelani

    Abstract: Language Identification (LID) is a core task in multilingual NLP, yet current systems often overfit to clean, monolingual data. This work introduces DIVERS-BENCH, a comprehensive evaluation of state-of-the-art LID models across diverse domains, including speech transcripts, web text, social media texts, children's stories, and code-switched text. Our findings reveal that while models achieve high… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

  12. arXiv:2509.17726  [pdf, ps, other

    cs.CV cs.LG

    Automated Labeling of Intracranial Arteries with Uncertainty Quantification Using Deep Learning

    Authors: Javier Bisbal, Patrick Winter, Sebastian Jofre, Aaron Ponce, Sameer A. Ansari, Ramez Abdalla, Michael Markl, Oliver Welin Odeback, Sergio Uribe, Cristian Tejos, Julio Sotelo, Susanne Schnell, David Marlevi

    Abstract: Accurate anatomical labeling of intracranial arteries is essential for cerebrovascular diagnosis and hemodynamic analysis but remains time-consuming and subject to interoperator variability. We present a deep learning-based framework for automated artery labeling from 3D Time-of-Flight Magnetic Resonance Angiography (3D ToF-MRA) segmentations (n=35), incorporating uncertainty quantification to enh… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: 16 pages, 6 figures

    MSC Class: I.4.0

  13. arXiv:2509.17661  [pdf, ps, other

    eess.AS cs.SD

    Comparator Loss: An Ordinal Contrastive Loss to Derive a Severity Score for Speech-based Health Monitoring

    Authors: Jacob J Webber, Oliver Watts, Lovisa Wihlborg, David Wheatley, Johnny Tam, Christine Weaver, Suvankar Pal, Siddharthan Chandran, Cassia Valentini-Botinhao

    Abstract: Monitoring the progression of neurodegenerative disease has important applications in the planning of treatment and the evaluation of future medications. Whereas much of the state-of-the-art in health monitoring from speech has been focused on classifying patients versus healthy controls, or predicting real-world health metrics, we propose here a novel measure of disease progression: the severity… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: Submitted to ICASSP 2026. This work is supported by NEURii, a collaborative partnership involving the University of Edinburgh, Gates Ventures, Eisai, LifeArc and Health Data Research UK (HDR UK)

  14. arXiv:2509.17645  [pdf, ps, other

    astro-ph.EP astro-ph.IM cs.LG

    RAVEN: RAnking and Validation of ExoplaNets

    Authors: Andreas Hadjigeorghiou, David J. Armstrong, Kaiming Cui, Marina Lafarga Magro, Luis Agustín Nieto, Rodrigo F. Díaz, Lauren Doyle, Vedad Kunovac

    Abstract: We present RAVEN, a newly developed vetting and validation pipeline for TESS exoplanet candidates. The pipeline employs a Bayesian framework to derive the posterior probability of a candidate being a planet against a set of False Positive (FP) scenarios, through the use of a Gradient Boosted Decision Tree and a Gaussian Process classifier, trained on comprehensive synthetic training sets of simula… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: Submitted to MNRAS. Comments from the community are welcome

  15. arXiv:2509.17601  [pdf, ps, other

    physics.ao-ph cs.LG

    FastNet: Improving the physical consistency of machine-learning weather prediction models through loss function design

    Authors: Tom Dunstan, Oliver Strickson, Thusal Bennett, Jack Bowyer, Matthew Burnand, James Chappell, Alejandro Coca-Castro, Kirstine Ida Dale, Eric G. Daub, Noushin Eftekhari, Manvendra Janmaijaya, Jon Lillis, David Salvador-Jasin, Nathan Simpson, Ryan Sze-Yin Chan, Mohamad Elmasri, Lydia Allegranza France, Sam Madge, Levan Bokeria, Hannah Brown, Tom Dodds, Anna-Louise Ellis, David Llewellyn-Jones, Theo McCaie, Sophia Moreton , et al. (9 additional authors not shown)

    Abstract: Machine learning weather prediction (MLWP) models have demonstrated remarkable potential in delivering accurate forecasts at significantly reduced computational cost compared to traditional numerical weather prediction (NWP) systems. However, challenges remain in ensuring the physical consistency of MLWP outputs, particularly in deterministic settings. This study presents FastNet, a graph neural n… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

  16. LongEval at CLEF 2025: Longitudinal Evaluation of IR Systems on Web and Scientific Data

    Authors: Matteo Cancellieri, Alaa El-Ebshihy, Tobias Fink, Maik Fröbe, Petra Galuščáková, Gabriela Gonzalez-Saez, Lorraine Goeuriot, David Iommi, Jüri Keller, Petr Knoth, Philippe Mulhem, Florina Piroi, David Pride, Philipp Schaer

    Abstract: The LongEval lab focuses on the evaluation of information retrieval systems over time. Two datasets are provided that capture evolving search scenarios with changing documents, queries, and relevance assessments. Systems are assessed from a temporal perspective-that is, evaluating retrieval effectiveness as the data they operate on changes. In its third edition, LongEval featured two retrieval tas… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

  17. arXiv:2509.17405  [pdf, ps, other

    cs.LG

    Efficient Sliced Wasserstein Distance Computation via Adaptive Bayesian Optimization

    Authors: Manish Acharya, David Hyde

    Abstract: The sliced Wasserstein distance (SW) reduces optimal transport on $\mathbb{R}^d$ to a sum of one-dimensional projections, and thanks to this efficiency, it is widely used in geometry, generative modeling, and registration tasks. Recent work shows that quasi-Monte Carlo constructions for computing SW (QSW) yield direction sets with excellent approximation error. This paper presents an alternate, no… ▽ More

    Submitted 23 September, 2025; v1 submitted 22 September, 2025; originally announced September 2025.

    Comments: 19 pages, 11 figures

    MSC Class: 49Q22 (Primary) 90C57; 68Txx (Secondary) ACM Class: G.3; I.2

  18. arXiv:2509.17389  [pdf, ps, other

    cs.RO

    3D Printable Soft Liquid Metal Sensors for Delicate Manipulation Tasks

    Authors: Lois Liow, Jonty Milford, Emre Uygun, Andre Farinha, Vinoth Viswanathan, Josh Pinskier, David Howard

    Abstract: Robotics and automation are key enablers to increase throughput in ongoing conservation efforts across various threatened ecosystems. Cataloguing, digitisation, husbandry, and similar activities require the ability to interact with delicate, fragile samples without damaging them. Additionally, learning-based solutions to these tasks require the ability to safely acquire data to train manipulation… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: 8 pages, 4 figures

  19. arXiv:2509.17351  [pdf, ps, other

    cs.DC

    Institutional Research Computing Capabilities in Australia: 2024

    Authors: Slava Kitaeff, Luc Betbeder-Matibet, Jake Carroll, Stephen Giugni, David Abramson, John Zaitseff, Sarah Walters, David Powell, Chris Bording, Trung Nguyen, Angus Macoustra, Fabien Voisin, Bowen Chen, Jarrod Hurley

    Abstract: Institutional research computing infrastructure plays a vital role in Australia's research ecosystem, complementing and extending national facilities. This paper analyses research computing capabilities across Australian universities and organisations, showing how institutional systems support research excellence through local compute resources, specialised hardware, and cluster solutions. Our stu… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: 9 pages in IEEE Proceedings format, International Conference on eScience 2025, Accepted

  20. arXiv:2509.17286  [pdf, ps, other

    eess.AS cs.SD

    RADE for Land Mobile Radio: A Neural Codec for Transmission of Speech over Baseband FM Radio Channels

    Authors: David Rowe, Tibor Bece

    Abstract: In the 1990s Land Mobile Radio (LMR) systems evolved from analog frequency modulation (FM) to standardised digital systems. Both digital and analog FM systems now co-exist in various services and exhibit similar speech quality. The architecture of many digital radios retains the analog FM modulator and demodulator from legacy analog radios, but driven by a multi-level digital pulse train rather th… ▽ More

    Submitted 21 September, 2025; originally announced September 2025.

    Comments: 6 pages, 9 figures

  21. arXiv:2509.17282  [pdf, ps, other

    cs.CV cs.NI

    Task-Oriented Communications for 3D Scene Representation: Balancing Timeliness and Fidelity

    Authors: Xiangmin Xu, Zhen Meng, Kan Chen, Jiaming Yang, Emma Li, Philip G. Zhao, David Flynn

    Abstract: Real-time Three-dimensional (3D) scene representation is a foundational element that supports a broad spectrum of cutting-edge applications, including digital manufacturing, Virtual, Augmented, and Mixed Reality (VR/AR/MR), and the emerging metaverse. Despite advancements in real-time communication and computing, achieving a balance between timeliness and fidelity in 3D scene representation remain… ▽ More

    Submitted 21 September, 2025; originally announced September 2025.

    Comments: Submitted to IEEE Transactions on Mobile Computing

  22. arXiv:2509.17265  [pdf, ps, other

    cs.IR

    Identifying and Upweighting Power-Niche Users to Mitigate Popularity Bias in Recommendations

    Authors: David Liu, Erik Weis, Moritz Laber, Tina Eliassi-Rad, Brennan Klein

    Abstract: Recommender systems have been shown to exhibit popularity bias by over-recommending popular items and under-recommending relevant niche items. We seek to understand interactions with niche items in benchmark recommendation datasets as a step toward mitigating popularity bias. We find that, compared to mainstream users, niche-preferring users exhibit a longer-tailed activity-level distribution, ind… ▽ More

    Submitted 21 September, 2025; originally announced September 2025.

  23. arXiv:2509.17202  [pdf

    cs.IT cs.HC

    Fundamental Mechanisms of Human Learning

    Authors: Scott E. Allen, A. David Redish, René F. Kizilcec

    Abstract: Learning underlies nearly all human behavior and is central to education and education reform. Although recent advances in neuroscience have revealed the fundamental structure of learning processes, these insights have yet to be integrated into research and practice. Specifically, neuroscience has found that decision-making is governed by a structured process of perception, action-selection, and e… ▽ More

    Submitted 21 September, 2025; originally announced September 2025.

  24. arXiv:2509.17180  [pdf, ps, other

    cs.LG econ.EM stat.ME

    Regularizing Extrapolation in Causal Inference

    Authors: David Arbour, Harsh Parikh, Bijan Niknam, Elizabeth Stuart, Kara Rudolph, Avi Feller

    Abstract: Many common estimators in machine learning and causal inference are linear smoothers, where the prediction is a weighted average of the training outcomes. Some estimators, such as ordinary least squares and kernel ridge regression, allow for arbitrarily negative weights, which improve feature imbalance but often at the cost of increased dependence on parametric modeling assumptions and higher vari… ▽ More

    Submitted 21 September, 2025; originally announced September 2025.

  25. arXiv:2509.16801  [pdf, ps, other

    cs.DS cs.LG quant-ph

    Sublinear Time Quantum Sensitivity Sampling

    Authors: Zhao Song, David P. Woodruff, Lichen Zhang

    Abstract: We present a unified framework for quantum sensitivity sampling, extending the advantages of quantum computing to a broad class of classical approximation problems. Our unified framework provides a streamlined approach for constructing coresets and offers significant runtime improvements in applications such as clustering, regression, and low-rank approximation. Our contributions include: * $k$-… ▽ More

    Submitted 20 September, 2025; originally announced September 2025.

  26. arXiv:2509.16617  [pdf, ps, other

    cs.CV cs.AI

    Detection and Simulation of Urban Heat Islands Using a Fine-Tuned Geospatial Foundation Model

    Authors: David Kreismann

    Abstract: As urbanization and climate change progress, urban heat island effects are becoming more frequent and severe. To formulate effective mitigation plans, cities require detailed air temperature data. However, predictive analytics methods based on conventional machine learning models and limited data infrastructure often provide inaccurate predictions, especially in underserved areas. In this context,… ▽ More

    Submitted 20 September, 2025; originally announced September 2025.

    Comments: 12 pages, 4 figures, to appear in GI LNI (SKILL 2025)

    ACM Class: I.2.6; I.5.4; I.6.8

  27. arXiv:2509.16531  [pdf, ps, other

    cs.CL

    Leveraging Multilingual Training for Authorship Representation: Enhancing Generalization across Languages and Domains

    Authors: Junghwan Kim, Haotian Zhang, David Jurgens

    Abstract: Authorship representation (AR) learning, which models an author's unique writing style, has demonstrated strong performance in authorship attribution tasks. However, prior research has primarily focused on monolingual settings-mostly in English-leaving the potential benefits of multilingual AR models underexplored. We introduce a novel method for multilingual AR learning that incorporates two key… ▽ More

    Submitted 20 September, 2025; originally announced September 2025.

    Comments: Accepted to EMNLP 2025

  28. arXiv:2509.16508  [pdf, ps, other

    cs.LG

    Federated Learning with Ad-hoc Adapter Insertions: The Case of Soft-Embeddings for Training Classifier-as-Retriever

    Authors: Marijan Fofonjka, Shahryar Zehtabi, Alireza Behtash, Tyler Mauer, David Stout

    Abstract: When existing retrieval-augmented generation (RAG) solutions are intended to be used for new knowledge domains, it is necessary to update their encoders, which are taken to be pretrained large language models (LLMs). However, fully finetuning these large models is compute- and memory-intensive, and even infeasible when deployed on resource-constrained edge devices. We propose a novel encoder archi… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

    Comments: 22 pages, 7 figures, 3 tables

  29. arXiv:2509.16413  [pdf, ps, other

    cs.CL cs.AI

    Pico: A Modular Framework for Hypothesis-Driven Small Language Model Research

    Authors: Richard Diehl Martinez, David Demitri Africa, Yuval Weiss, Suchir Salhan, Ryan Daniels, Paula Buttery

    Abstract: Building language models (LMs), especially small and medium ones, remains more art than science. While large LMs often improve by sheer scale, it is still unclear why many design choices work. For small LMs, this uncertainty is more limiting: tight parameter budgets make each decision critical, yet researchers still lack systematic, scientific ways to test and refine new ideas. We introduce Pico… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

  30. arXiv:2509.16378  [pdf

    cs.CY cs.CL

    Longitudinal and Multimodal Recording System to Capture Real-World Patient-Clinician Conversations for AI and Encounter Research: Protocol

    Authors: Misk Al Zahidy, Kerly Guevara Maldonado, Luis Vilatuna Andrango, Ana Cristina Proano, Ana Gabriela Claros, Maria Lizarazo Jimenez, David Toro-Tobon, Oscar J. Ponce-Ponce, Juan P. Brito

    Abstract: The promise of AI in medicine depends on learning from data that reflect what matters to patients and clinicians. Most existing models are trained on electronic health records (EHRs), which capture biological measures but rarely patient-clinician interactions. These relationships, central to care, unfold across voice, text, and video, yet remain absent from datasets. As a result, AI systems traine… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

    Comments: 23 pages, 2 figures, 2 tables

  31. arXiv:2509.16266  [pdf, ps, other

    physics.chem-ph cond-mat.mtrl-sci cs.LG

    Vibrational Fingerprints of Strained Polymers: A Spectroscopic Pathway to Mechanical State Prediction

    Authors: Julian Konrad, Janina Mittelhaus, David M. Wilkins, Bodo Fiedler, Robert Meißner

    Abstract: The vibrational response of polymer networks under load provides a sensitive probe of molecular deformation and a route to non-destructive diagnostics. Here we show that machine-learned force fields reproduce these spectroscopic fingerprints with quantum-level fidelity in realistic epoxy thermosets. Using MACE-OFF23 molecular dynamics, we capture the experimentally observed redshifts of para-pheny… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

  32. arXiv:2509.16262  [pdf

    cs.CY cs.AI

    Socratic Mind: Impact of a Novel GenAI-Powered Assessment Tool on Student Learning and Higher-Order Thinking

    Authors: Jeonghyun Lee, Jui-Tse Hung, Meryem Yilmaz Soylu, Diana Popescu, Christopher Zhang Cui, Gayane Grigoryan, David A Joyner, Stephen W Harmon

    Abstract: This study examines the impact of Socratic Mind, a Generative Artificial Intelligence (GenAI) powered formative assessment tool that employs Socratic questioning to support student learning in a large, fully online undergraduate-level computing course. Employing a quasi-experimental, mixed-methods design, we investigated participants' engagement patterns, the influence of user experience on engage… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

  33. arXiv:2509.16203  [pdf, ps, other

    cs.LG

    Inverting Trojans in LLMs

    Authors: Zhengxing Li, Guangmingmei Yang, Jayaram Raghuram, David J. Miller, George Kesidis

    Abstract: While effective backdoor detection and inversion schemes have been developed for AIs used e.g. for images, there are challenges in "porting" these methods to LLMs. First, the LLM input space is discrete, which precludes gradient-based search over this space, central to many backdoor inversion methods. Second, there are ~30,000^k k-tuples to consider, k the token-length of a putative trigger. Third… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

  34. arXiv:2509.16180  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Query-Efficient Locally Private Hypothesis Selection via the Scheffe Graph

    Authors: Gautam Kamath, Alireza F. Pour, Matthew Regehr, David P. Woodruff

    Abstract: We propose an algorithm with improved query-complexity for the problem of hypothesis selection under local differential privacy constraints. Given a set of $k$ probability distributions $Q$, we describe an algorithm that satisfies local differential privacy, performs $\tilde{O}(k^{3/2})$ non-adaptive queries to individuals who each have samples from a probability distribution $p$, and outputs a pr… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

  35. arXiv:2509.16040  [pdf, ps, other

    cs.LG cond-mat.mtrl-sci cs.CE

    Automated Constitutive Model Discovery by Pairing Sparse Regression Algorithms with Model Selection Criteria

    Authors: Jorge-Humberto Urrea-Quintero, David Anton, Laura De Lorenzis, Henning Wessels

    Abstract: The automated discovery of constitutive models from data has recently emerged as a promising alternative to the traditional model calibration paradigm. In this work, we present a fully automated framework for constitutive model discovery that systematically pairs three sparse regression algorithms (Least Absolute Shrinkage and Selection Operator (LASSO), Least Angle Regression (LARS), and Orthogon… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

  36. arXiv:2509.16032  [pdf, ps, other

    cs.RO cs.HC

    A Matter of Height: The Impact of a Robotic Object on Human Compliance

    Authors: Michael Faber, Andrey Grishko, Julian Waksberg, David Pardo, Tomer Leivy, Yuval Hazan, Emanuel Talmansky, Benny Megidish, Hadas Erel

    Abstract: Robots come in various forms and have different characteristics that may shape the interaction with them. In human-human interactions, height is a characteristic that shapes human dynamics, with taller people typically perceived as more persuasive. In this work, we aspired to evaluate if the same impact replicates in a human-robot interaction and specifically with a highly non-humanoid robotic obj… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

    Comments: 8 pages, 6 figures, 1 table, submitted to IEEE RO-MAN 2025

  37. arXiv:2509.16020  [pdf, ps, other

    quant-ph cs.AI cs.LG

    AI Methods for Permutation Circuit Synthesis Across Generic Topologies

    Authors: Victor Villar, Juan Cruz-Benito, Ismael Faro, David Kremer

    Abstract: This paper investigates artificial intelligence (AI) methodologies for the synthesis and transpilation of permutation circuits across generic topologies. Our approach uses Reinforcement Learning (RL) techniques to achieve near-optimal synthesis of permutation circuits up to 25 qubits. Rather than developing specialized models for individual topologies, we train a foundational model on a generic re… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

    Comments: This paper has been accepted by First AAAI Symposium on Quantum Information & Machine Learning (QIML): Bridging Quantum Computing and Artificial Intelligence at AAAI 2025 Fall Symposium

  38. arXiv:2509.15933  [pdf, ps, other

    cs.LG eess.SY

    Bayesian Physics Informed Neural Networks for Reliable Transformer Prognostics

    Authors: Ibai Ramirez, Jokin Alcibar, Joel Pino, Mikel Sanz, David Pardo, Jose I. Aizpurua

    Abstract: Scientific Machine Learning (SciML) integrates physics and data into the learning process, offering improved generalization compared with purely data-driven models. Despite its potential, applications of SciML in prognostics remain limited, partly due to the complexity of incorporating partial differential equations (PDEs) for ageing physics and the scarcity of robust uncertainty quantification me… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

    Comments: Submitted to the Annual Prognostics and Health Management (PHM) Society Conference 2025

  39. arXiv:2509.15909  [pdf, ps, other

    cs.CE cs.RO

    A CARLA-based Simulation of Electrically Driven Forklifts

    Authors: David Claus, Christiane Thielemann, Hans-Georg Stark

    Abstract: This paper presents the simulation of the operation of an electric forklift fleet within an intralogistics scenario. For this purpose, the open source simulation tool CARLA is used; according to our knowledge this is a novel approach in the context of logistics simulation. First, CARLA is used to generate and visualize a realistic 3D outdoor warehouse scenario, incorporating a number of randomly m… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

  40. arXiv:2509.15905  [pdf, ps, other

    cs.CV

    Deep Feedback Models

    Authors: David Calhas, Arlindo L. Oliveira

    Abstract: Deep Feedback Models (DFMs) are a new class of stateful neural networks that combine bottom up input with high level representations over time. This feedback mechanism introduces dynamics into otherwise static architectures, enabling DFMs to iteratively refine their internal state and mimic aspects of biological decision making. We model this process as a differential equation solved through a rec… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

  41. arXiv:2509.15895  [pdf

    cs.LG cs.AI cs.CV

    From Data to Diagnosis: A Large, Comprehensive Bone Marrow Dataset and AI Methods for Childhood Leukemia Prediction

    Authors: Henning Höfener, Farina Kock, Martina Pontones, Tabita Ghete, David Pfrang, Nicholas Dickel, Meik Kunz, Daniela P. Schacherer, David A. Clunie, Andrey Fedorov, Max Westphal, Markus Metzler

    Abstract: Leukemia diagnosis primarily relies on manual microscopic analysis of bone marrow morphology supported by additional laboratory parameters, making it complex and time consuming. While artificial intelligence (AI) solutions have been proposed, most utilize private datasets and only cover parts of the diagnostic pipeline. Therefore, we present a large, high-quality, publicly available leukemia bone… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

  42. arXiv:2509.15643  [pdf, ps, other

    cs.IT

    Finite-blocklength Fluid Antenna Systems

    Authors: Zhentian Zhang, Kai-Kit Wong, David Morales-Jimenez, Hao Jiang, Hao Xu, Christos Masouros, Zaichen Zhang, Chan-Byoung Chae

    Abstract: This work introduces and investigates finite blocklength fluid antenna systems (FBL-FASs). To meet the stringent key performance indicators (KPIs) of 6G and beyond networks, including ultra-massive machine-type communications (mMTC), ultra-reliable low-latency communications (URLLC), and enhanced mobile broadband (eMBB), it is necessary to evaluate the performance of FAS under limited channel uses… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

  43. arXiv:2509.15373  [pdf, ps, other

    cs.CL

    Frustratingly Easy Data Augmentation for Low-Resource ASR

    Authors: Katsumi Ibaraki, David Chiang

    Abstract: This paper introduces three self-contained data augmentation methods for low-resource Automatic Speech Recognition (ASR). Our techniques first generate novel text--using gloss-based replacement, random replacement, or an LLM-based approach--and then apply Text-to-Speech (TTS) to produce synthetic audio. We apply these methods, which leverage only the original annotated data, to four languages with… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

    Comments: 5 pages, 2 figures, 2 tables, submitted to ICASSP 2026

  44. arXiv:2509.15335  [pdf, ps, other

    cs.CL

    PolBiX: Detecting LLMs' Political Bias in Fact-Checking through X-phemisms

    Authors: Charlott Jakob, David Harbecke, Patrick Parschan, Pia Wenzel Neves, Vera Schmitt

    Abstract: Large Language Models are increasingly used in applications requiring objective assessment, which could be compromised by political bias. Many studies found preferences for left-leaning positions in LLMs, but downstream effects on tasks like fact-checking remain underexplored. In this study, we systematically investigate political bias through exchanging words with euphemisms or dysphemisms in Ger… ▽ More

    Submitted 23 September, 2025; v1 submitted 18 September, 2025; originally announced September 2025.

    Comments: Accepted at Findings of EMNLP 2025, camera-ready version

  45. arXiv:2509.15325  [pdf, ps, other

    cs.RO cs.HC

    Measurement and Potential Field-Based Patient Modeling for Model-Mediated Tele-ultrasound

    Authors: Ryan S. Yeung, David G. Black, Septimiu E. Salcudean

    Abstract: Teleoperated ultrasound can improve diagnostic medical imaging access for remote communities. Having accurate force feedback is important for enabling sonographers to apply the appropriate probe contact force to optimize ultrasound image quality. However, large time delays in communication make direct force feedback impractical. Prior work investigated using point cloud-based model-mediated teleop… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

  46. arXiv:2509.15278  [pdf

    q-bio.OT cs.CR cs.CY eess.IV

    Assessing metadata privacy in neuroimaging

    Authors: Emilie Kibsgaard, Anita Sue Jwa, Christopher J Markiewicz, David Rodriguez Gonzalez, Judith Sainz Pardo, Russell A. Poldrack, Cyril R. Pernet

    Abstract: The ethical and legal imperative to share research data without causing harm requires careful attention to privacy risks. While mounting evidence demonstrates that data sharing benefits science, legitimate concerns persist regarding the potential leakage of personal information that could lead to reidentification and subsequent harm. We reviewed metadata accompanying neuroimaging datasets from six… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

    Comments: 19 pages, 7 tables, 2 figures, original analysis of 6 Open Datasets

  47. arXiv:2509.15273  [pdf, ps, other

    cs.RO

    Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI

    Authors: Fei Ni, Min Zhang, Pengyi Li, Yifu Yuan, Lingfeng Zhang, Yuecheng Liu, Peilong Han, Longxin Kou, Shaojin Ma, Jinbin Qiao, David Gamaliel Arcos Bravo, Yuening Wang, Xiao Hu, Zhanguang Zhang, Xianze Yao, Yutong Li, Zhao Zhang, Ying Wen, Ying-Cong Chen, Xiaodan Liang, Liang Lin, Bin He, Haitham Bou-Ammar, He Wang, Huazhe Xu , et al. (12 additional authors not shown)

    Abstract: Embodied AI development significantly lags behind large foundation models due to three critical challenges: (1) lack of systematic understanding of core capabilities needed for Embodied AI, making research lack clear objectives; (2) absence of unified and standardized evaluation systems, rendering cross-benchmark evaluation infeasible; and (3) underdeveloped automated and scalable acquisition meth… ▽ More

    Submitted 23 September, 2025; v1 submitted 18 September, 2025; originally announced September 2025.

    Comments: 32 pages, 5 figures, Embodied Arena Technical Report

  48. arXiv:2509.15263  [pdf, ps, other

    cs.HC cs.LG

    Subject Matter Expertise vs Professional Management in Collective Sequential Decision Making

    Authors: David Shoresh, Yonatan Loewenstein

    Abstract: Your company's CEO is retiring. You search for a successor. You can promote an employee from the company familiar with the company's operations, or recruit an external professional manager. Who should you prefer? It has not been clear how to address this question, the "subject matter expertise vs. professional manager debate", quantitatively and objectively. We note that a company's success depend… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

    Comments: Reinforcement Learning and Decision Making (RLDM) 2025. arXiv admin note: substantial text overlap with arXiv:2412.18593

    ACM Class: K.4.3

  49. arXiv:2509.15031  [pdf, ps, other

    cs.CV

    AutoEdit: Automatic Hyperparameter Tuning for Image Editing

    Authors: Chau Pham, Quan Dao, Mahesh Bhosale, Yunjie Tian, Dimitris Metaxas, David Doermann

    Abstract: Recent advances in diffusion models have revolutionized text-guided image editing, yet existing editing methods face critical challenges in hyperparameter identification. To get the reasonable editing performance, these methods often require the user to brute-force tune multiple interdependent hyperparameters, such as inversion timesteps and attention modification, \textit{etc.} This process incur… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

    Comments: Accepted to NeurIPS 2025

  50. arXiv:2509.14816  [pdf, ps, other

    cs.RO cs.LG

    Scalable Multi-Objective Robot Reinforcement Learning through Gradient Conflict Resolution

    Authors: Humphrey Munn, Brendan Tidd, Peter Böhm, Marcus Gallagher, David Howard

    Abstract: Reinforcement Learning (RL) robot controllers usually aggregate many task objectives into one scalar reward. While large-scale proximal policy optimisation (PPO) has enabled impressive results such as robust robot locomotion in the real world, many tasks still require careful reward tuning and are brittle to local optima. Tuning cost and sub-optimality grow with the number of objectives, limiting… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.