Skip to main content

Showing 101–150 of 420,950 results for author: T.

.
  1. arXiv:2506.17748  [pdf, ps, other

    cs.CL cs.AI

    HIDE and Seek: Detecting Hallucinations in Language Models via Decoupled Representations

    Authors: Anwoy Chatterjee, Yash Goel, Tanmoy Chakraborty

    Abstract: Contemporary Language Models (LMs), while impressively fluent, often generate content that is factually incorrect or unfaithful to the input context - a critical issue commonly referred to as 'hallucination'. This tendency of LMs to generate hallucinated content undermines their reliability, especially because these fabrications are often highly convincing and therefore difficult to detect. While… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  2. arXiv:2506.17747  [pdf, ps, other

    physics.geo-ph cs.CE cs.CV cs.LG cs.NE

    Pix2Geomodel: A Next-Generation Reservoir Geomodeling with Property-to-Property Translation

    Authors: Abdulrahman Al-Fakih, Ardiansyah Koeshidayatullah, Nabil A. Saraih, Tapan Mukerji, Rayan Kanfar, Abdulmohsen Alali, SanLinn I. Kaka

    Abstract: Accurate geological modeling is critical for reservoir characterization, yet traditional methods struggle with complex subsurface heterogeneity, and they have problems with conditioning to observed data. This study introduces Pix2Geomodel, a novel conditional generative adversarial network (cGAN) framework based on Pix2Pix, designed to predict reservoir properties (facies, porosity, permeability,… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 34 pages, 13 figures

  3. arXiv:2506.17743  [pdf, ps, other

    cs.DS

    Optimizing Periodic Operations for Efficient Inland Waterway Lock Management

    Authors: Julian Golak, Alexander Grigoriev, Freija van Lent, Tom van der Zanden

    Abstract: In inland waterways, the efficient management of water lock operations impacts the level of congestion and the resulting uncertainty in inland waterway transportation. To achieve reliable and efficient traffic, schedules should be easy to understand and implement, reducing the likelihood of errors. The simplest schedules follow periodic patterns, reducing complexity and facilitating predictable ma… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  4. arXiv:2506.17741  [pdf, ps, other

    cs.CY

    Experimental Evidence for the Propagation and Preservation of Machine Discoveries in Human Populations

    Authors: Levin Brinkmann, Thomas F. Eisenmann, Anne-Marie Nussberger, Maxim Derex, Sara Bonati, Valerii Chirkov, Iyad Rahwan

    Abstract: Intelligent machines with superhuman capabilities have the potential to uncover problem-solving strategies beyond human discovery. Emerging evidence from competitive gameplay, such as Go, demonstrates that AI systems are evolving from mere tools to sources of cultural innovation adopted by humans. However, the conditions under which intelligent machines transition from tools to drivers of persiste… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  5. arXiv:2506.17738  [pdf, ps, other

    math.GT

    The tangle-valued 1-cocycle for knots

    Authors: Thomas Fiedler

    Abstract: This paper contains the strongest and at the same time most calculable knot invariant ever. Let $Θ$ be the topological moduli space of all ordered oriented tangles in 3-space. We construct a non-trivial combinatorial 1-cocycle $\mathbb{L}$ for $Θ$ that takes its values in $H_0(Θ;\mathbb{Z})$. The 1-cocycle $\mathbb{L}$ has a very nice property, called the {\em scan-property}: if we slide a tangl… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 32 pages, 19 figures

    MSC Class: 57M25

  6. arXiv:2506.17719  [pdf, ps, other

    cond-mat.mtrl-sci cs.AI physics.comp-ph

    Resolving the Ti-V Phase Diagram Discrepancy with First-Principles Calculations and Bayesian Learning

    Authors: Timofei Miryashkin, Olga Klimanova, Alexander Shapeev

    Abstract: Conflicting experiments disagree on whether the titanium-vanadium (Ti-V) binary alloy exhibits a body-centred cubic (BCC) miscibility gap or remains completely soluble. A leading hypothesis attributes the miscibility gap to oxygen contamination during alloy preparation. To resolve this controversy, we use an ab initio + machine-learning workflow that couples an actively-trained Moment Tensor Poten… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  7. On sequentially Cohen-Macaulay modules and sequentially generalized Cohen-Macaulay modules

    Authors: Nguyen Xuan Linh, Le Thanh Nhan

    Abstract: We introduce the notions of sequential sequence and sequential f-sequence in order to characterize sequentially Cohen-Macaulay modules and sequentially generalized Cohen-Macaulay modules. Let R be a Noetherian local ring and M a finitely generated R-module. We show that M is sequentially Cohen-Macaulay (resp. sequentially generalized Cohen-Macaulay) if and only if there exists a system of paramete… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 18 pages

    MSC Class: 13E05; 13C14; 13D45

    Journal ref: J. Algebra 678 (2025) 635-653

  8. arXiv:2506.17709  [pdf, ps, other

    cs.LG cs.CR stat.ML

    CEGA: A Cost-Effective Approach for Graph-Based Model Extraction and Acquisition

    Authors: Zebin Wang, Menghan Lin, Bolin Shen, Ken Anderson, Molei Liu, Tianxi Cai, Yushun Dong

    Abstract: Graph Neural Networks (GNNs) have demonstrated remarkable utility across diverse applications, and their growing complexity has made Machine Learning as a Service (MLaaS) a viable platform for scalable deployment. However, this accessibility also exposes GNN to serious security threats, most notably model extraction attacks (MEAs), in which adversaries strategically query a deployed model to const… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  9. arXiv:2506.17705  [pdf, ps, other

    cs.CV

    DreamJourney: Perpetual View Generation with Video Diffusion Models

    Authors: Bo Pan, Yang Chen, Yingwei Pan, Ting Yao, Wei Chen, Tao Mei

    Abstract: Perpetual view generation aims to synthesize a long-term video corresponding to an arbitrary camera trajectory solely from a single input image. Recent methods commonly utilize a pre-trained text-to-image diffusion model to synthesize new content of previously unseen regions along camera movement. However, the underlying 2D diffusion model lacks 3D awareness and results in distorted artifacts. Mor… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  10. arXiv:2506.17699  [pdf, ps, other

    physics.app-ph

    On the Practicability of Ceramic-Tiled Walls for Sound Absorption by Tuning Cavities

    Authors: Ozgur T. Tugut, Brahim Lemkalli, Qingxiang Ji, Mahmoud Addouche, Benjamin Vial, Sébastien Guenneau, Richard Craster, Claudio Bizzaglia, Bogdan Ungureanu, Muamer Kadic

    Abstract: We present the practicality of structuring ceramic tiles for enhancing sound absorption on rigid walls. The cornerstone of our methodology is to structure walls with cavities so that walls effectively behave as heterogeneous absorbing surfaces over a large frequency bandwidth. Using this approach, ceramic tiled walls are developed by integrating tuned cavity structures based on Helmholtz resonator… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 18 pages, 8 figures

  11. arXiv:2506.17692  [pdf, ps, other

    cs.CL

    Resource-Friendly Dynamic Enhancement Chain for Multi-Hop Question Answering

    Authors: Binquan Ji, Haibo Luo, Yifei Lu, Lei Hei, Jiaqi Wang, Tingjing Liao, Lingyu Wang, Shichao Wang, Feiliang Ren

    Abstract: Knowledge-intensive multi-hop question answering (QA) tasks, which require integrating evidence from multiple sources to address complex queries, often necessitate multiple rounds of retrieval and iterative generation by large language models (LLMs). However, incorporating many documents and extended contexts poses challenges -such as hallucinations and semantic drift-for lightweight LLMs with few… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  12. arXiv:2506.17691  [pdf

    cond-mat.mtrl-sci

    Brightening dark trions in WS2 monolayers via introducing atomic sulfur vacancies

    Authors: Xuguang Cao, Wanggui Ye, Debao Zhang, Ji Zhou, Lei Peng, Changcheng Zheng, Kenji Watanabe, Takashi Taniguchi, Jiqiang Ning, Shijie Xu

    Abstract: Understanding the effects of atomic defects on the optical functionality of two-dimensional (2D) layered materials is critical to develop novel optical and optoelectronic applications of these ultimate materials. Herein, we correlate sulfur vacancies (VS) and luminescence properties of dark trions in monolayer WS2 through introducing VS defects and conducting a systematic optical spectroscopic cha… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  13. arXiv:2506.17690  [pdf, ps, other

    eess.AS

    Low-resource keyword spotting using contrastively trained transformer acoustic word embeddings

    Authors: Julian Herreilers, Christiaan Jacobs, Thomas Niesler

    Abstract: We introduce a new approach, the ContrastiveTransformer, that produces acoustic word embeddings (AWEs) for the purpose of very low-resource keyword spotting. The ContrastiveTransformer, an encoder-only model, directly optimises the embedding space using normalised temperature-scaled cross entropy (NT-Xent) loss. We use this model to perform keyword spotting for radio broadcasts in Luganda and Bamb… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 5 pages, 2 figures

  14. arXiv:2506.17661  [pdf, ps, other

    nucl-ex astro-ph.HE

    New Determination of the $^{14}$C(n, $γ$)$^{15}$C Reaction Rate and Its Astrophysical Implications

    Authors: Yuchen Jiang, Zhenyu He, Yudong Luo, Wenyu Xin, Jie Chen, Xinyue Li, Yangping Shen, Bing Guo, Guo Li, Danyang Pang, Tianli Ma, Weike Nan, Toshitaka Kajino, Weiping Liu

    Abstract: We present a novel experiment to investigate the spectroscopic factor of the $^{15}$C ground state for the first time using single-neutron $removal$ transfer reactions on $^{15}$C. Two consistent spectroscopic factors were derived from the (p, d) and (d, t) reactions, which were subsequently used to deduce the $^{14}$C(n, $γ$)$^{15}$C reaction cross section and the corresponding stellar reaction r… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 20 pages, 8 figures, accepted by "The Astrophysical Journal"

  15. arXiv:2506.17658  [pdf, ps, other

    cs.NI

    Non-Intrusive MLOps-Driven Performance Intelligence in Software Data Planes

    Authors: Qiong Liu, Jianke Lin, Tianzhu Zhang, Leonardo Linguaglossa

    Abstract: The last decade has witnessed the proliferation of network function virtualization (NFV) in the telco industry, thanks to its unparalleled flexibility, scalability, and cost-effectiveness. However, as the NFV infrastructure is shared by virtual network functions (VNFs), sporadic resource contentions are inevitable. Such contention makes it extremely challenging to guarantee the performance of the… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  16. arXiv:2506.17652  [pdf, ps, other

    math.CO

    Entropy Bounds for Perfect Matchings in Bipartite Hypergraphs

    Authors: Tantan Dai, Alexander Divoux, Tom Kelly

    Abstract: A hypergraph is \textit{bipartite with bipartition} $(A, B)$ if every edge has exactly one vertex in $A$, and a matching in such a hypergraph is \textit{$A$-perfect} if it saturates every vertex in $A$. We prove an upper bound on the number of $A$-perfect matchings in uniform hypergraphs with small maximum codegree. Using this result, we prove that there exist order-$n$ Latin squares with at most… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 10 pages, 1 figure

    MSC Class: 05D40; 05B15; 05C15

  17. arXiv:2506.17646  [pdf, ps, other

    cs.IT eess.SP

    Quantizing for Noisy Flash Memory Channels

    Authors: Juyun Oh, Taewoo Park, Jiwoong Im, Yuval Cassuto, Yongjune Kim

    Abstract: Flash memory-based processing-in-memory (flash-based PIM) offers high storage capacity and computational efficiency but faces significant reliability challenges due to noise in high-density multi-level cell (MLC) flash memories. Existing verify level optimization methods are designed for general storage scenarios and fail to address the unique requirements of flash-based PIM systems, where metrics… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  18. arXiv:2506.17632  [pdf, ps, other

    cs.CV

    Optimization-Free Patch Attack on Stereo Depth Estimation

    Authors: Hangcheng Liu, Xu Kuang, Xingshuo Han, Xingwan Wu, Haoran Ou, Shangwei Guo, Xingyi Huang, Tao Xiang, Tianwei Zhang

    Abstract: Stereo Depth Estimation (SDE) is essential for scene understanding in vision-based systems like autonomous driving. However, recent studies show that SDE models are vulnerable to adversarial attacks, which are often limited to unrealistic settings, e.g., digital perturbations on separate stereo views in static scenes, restricting their real-world applicability. This raises a critical question: how… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  19. arXiv:2506.17629  [pdf, ps, other

    cs.CV cs.AI cs.CL

    CLiViS: Unleashing Cognitive Map through Linguistic-Visual Synergy for Embodied Visual Reasoning

    Authors: Kailing Li, Qi'ao Xu, Tianwen Qian, Yuqian Fu, Yang Jiao, Xiaoling Wang

    Abstract: Embodied Visual Reasoning (EVR) seeks to follow complex, free-form instructions based on egocentric video, enabling semantic understanding and spatiotemporal reasoning in dynamic environments. Despite its promising potential, EVR encounters significant challenges stemming from the diversity of complex instructions and the intricate spatiotemporal dynamics in long-term egocentric videos. Prior solu… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  20. arXiv:2506.17611  [pdf, ps, other

    cs.CL

    OpusLM: A Family of Open Unified Speech Language Models

    Authors: Jinchuan Tian, William Chen, Yifan Peng, Jiatong Shi, Siddhant Arora, Shikhar Bharadwaj, Takashi Maekaku, Yusuke Shinohara, Keita Goto, Xiang Yue, Huck Yang, Shinji Watanabe

    Abstract: This paper presents Open Unified Speech Language Models (OpusLMs), a family of open foundational speech language models (SpeechLMs) up to 7B. Initialized from decoder-only text language models, the OpusLMs are continuously pre-trained on 213K hours of speech-text pairs and 292B text-only tokens. We demonstrate our OpusLMs achieve comparable (or even superior) performance with existing SpeechLMs in… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  21. arXiv:2506.17608  [pdf, ps, other

    cs.CV

    HIRE: Lightweight High-Resolution Image Feature Enrichment for Multimodal LLMs

    Authors: Nikitha SR, Aradhya Neeraj Mathur, Tarun Ram Menta, Rishabh Jain, Mausoom Sarkar

    Abstract: The integration of high-resolution image features in modern multimodal large language models has demonstrated significant improvements in fine-grained visual understanding tasks, achieving high performance across multiple benchmarks. Since these features are obtained from large image encoders like ViT, they come with a significant increase in computational costs due to multiple calls to these enco… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: Accepted in CVPR 2025 Workshop on What's Next in Multimodal Foundational Models

  22. arXiv:2506.17606  [pdf, ps, other

    cs.HC

    Full-body WPT: wireless powering with meandered e-textiles

    Authors: Ryo Takahashi, Takashi Sato, Wakako Yukita, Tomoyuki Yokota, Takao Someya, Yoshihiro Kawahara

    Abstract: We present Full-body WPT, wireless power networking around the human body using a meandered textile coil. Unlike traditional inductive systems that emit strong fields into the deep tissue inside the body, the meander coil enables localized generation of strong magnetic field constrained to the skin surface, even when scaled to the size of the human body. Such localized inductive system enhances bo… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  23. The second Hilbert coefficient of modules with almost maximal depth

    Authors: Van Duc Trung

    Abstract: Let $\mathbb{M} = \{ M_n \}$ be a good $\mathfrak{q}$-filtration of a finitely generated $R$-module $M$ of dimension $d$, where $(R,\mathfrak{m})$ is a local ring and $\mathfrak{q}$ is an $\mathfrak{m}$-primary ideal of $R$. In case $depth(M) \geq d-1$, we give an upper bound for the second Hilbert coefficient $e_2(\mathbb{M})$ generalizing results by Huckaba-Marley and Rossi-Valla proved assuming… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 12 pages

    MSC Class: 13H10 ACM Class: F.2.2; I.2.7

    Journal ref: Journal of Algebra and Its Applications, Vol. 18, No. 12, 1950240, 2019

  24. arXiv:2506.17587  [pdf, ps, other

    cs.CV cs.AI cs.LG

    HalluRNN: Mitigating Hallucinations via Recurrent Cross-Layer Reasoning in Large Vision-Language Models

    Authors: Le Yu, Kaishen Wang, Jianlong Xiong, Yue Cao, Tao He

    Abstract: Though Large Vision-Language Models (LVLMs) have achieved remarkable performance across various tasks, they are still prone to hallucinations-generating outputs that are textually plausible but visually ungrounded. While prior approaches generally address this issue through data-centric fine-tuning or innovative decoding strategies, these methods often require substantial resources or task-specifi… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: 6 figures, 9 tables

  25. arXiv:2506.17584  [pdf, ps, other

    astro-ph.GA astro-ph.CO

    Outflowing shocked gas dominates the NIR H$_2$ emission from the dual AGN NGC6240

    Authors: J. Carlsen, C. Cicone, B. Hagedorn, K. Rubinur, P. Andreani, K. Dasyra, P. Severgnini, C. Vignali, R. Morganti, T. Oosterloo, A. Lasrado, E. Lopez-Rodriguez, S. Shen

    Abstract: [Abridged] We present a multi-line study of the kinematics of the molecular and ionised gas phases in the central 2 kpc of NGC6240, based on JWST/NIRSpec and ALMA observations. We devised a new spectral-line fitting approach to de-blend rotating and non-rotating gas components, which is better tailored to the extreme feedback mechanisms at work in NGC6240. We find that ~65% of the Pa$α$, H$_2$, an… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: Submitted to A&A. Comments welcome

  26. arXiv:2506.17564  [pdf, ps, other

    cs.LG cs.AI cs.RO

    Accelerating Residual Reinforcement Learning with Uncertainty Estimation

    Authors: Lakshita Dodeja, Karl Schmeckpeper, Shivam Vats, Thomas Weng, Mingxi Jia, George Konidaris, Stefanie Tellex

    Abstract: Residual Reinforcement Learning (RL) is a popular approach for adapting pretrained policies by learning a lightweight residual policy that provides corrective actions. While Residual RL is more sample-efficient than finetuning the entire base policy, existing methods struggle with sparse rewards and are designed for deterministic base policies. We propose two improvements to Residual RL that furth… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  27. arXiv:2506.17554  [pdf, ps, other

    astro-ph.GA

    Dynamics of Multiphase Carbon in the Turbulent Circumgalactic Medium

    Authors: Yue Hu, Evan Scannapieco, Edward Buie II, Siyao Xu, Samuel T Sebastian, Om Biswal

    Abstract: The circumgalactic medium (CGM) plays a crucial role in regulating material and energy exchange between galaxies and their environments. The best means of observing this medium is through absorption-line spectroscopy, but we have yet to develop a consistent physical model that fully explains these results. Here we investigate the impact of turbulence and non-equilibrium chemistry on the properties… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 15 pages, 10 figures, accepted for publication in ApJ

  28. arXiv:2506.17553  [pdf

    cond-mat.mes-hall physics.app-ph

    Physisorption on Nanomechanical Resonators: The Overlooked Influence of Trace Moisture

    Authors: Hemant Kumar Verma, Suman Kumar Mandal, Darkasha Khan, Faizan Tariq Beigh, Manoj Kandpal, Jaspreet Singh, Sushobhan Avasthi, Srinivasan Raghavan, Akshay Naik

    Abstract: Short gas pulses introduced in a vacuum chamber have long been utilized to showcase the ultra-low mass resolutions achievable with nanomechanical resonators. The resonance frequency shifts are used as evidence of gas adsorption. However, there is very little clarity as to what exactly is adsorbing on to the resonators. We demonstrate that the physisorption of gases on cantilevers is predominantly… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  29. arXiv:2506.17552  [pdf

    cs.LG cs.CV

    DRIMV_TSK: An Interpretable Surgical Evaluation Model for Incomplete Multi-View Rectal Cancer Data

    Authors: Wei Zhang, Zi Wang, Hanwen Zhou, Zhaohong Deng, Weiping Ding, Yuxi Ge, Te Zhang, Yuanpeng Zhang, Kup-Sze Choi, Shitong Wang, Shudong Hu

    Abstract: A reliable evaluation of surgical difficulty can improve the success of the treatment for rectal cancer and the current evaluation method is based on clinical data. However, more data about rectal cancer can be collected with the development of technology. Meanwhile, with the development of artificial intelligence, its application in rectal cancer treatment is becoming possible. In this paper, a m… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  30. arXiv:2506.17540  [pdf, ps, other

    eess.IV cs.CV cs.LG

    MTSIC: Multi-stage Transformer-based GAN for Spectral Infrared Image Colorization

    Authors: Tingting Liu, Yuan Liu, Jinhui Tang, Liyin Yuan, Chengyu Liu, Chunlai Li, Xiubao Sui, Qian Chen

    Abstract: Thermal infrared (TIR) images, acquired through thermal radiation imaging, are unaffected by variations in lighting conditions and atmospheric haze. However, TIR images inherently lack color and texture information, limiting downstream tasks and potentially causing visual fatigue. Existing colorization methods primarily rely on single-band images with limited spectral information and insufficient… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  31. arXiv:2506.17533  [pdf, ps, other

    cs.CL

    DuaShepherd: Integrating Stepwise Correctness and Potential Rewards for Mathematical Reasoning

    Authors: Yuanhao Wu, Juntong Song, Hanning Zhang, Tong Zhang, Cheng Niu

    Abstract: In this paper, we propose DuaShepherd, a novel reward modeling framework that integrates two complementary reward signals, correctness and potential, to enhance the mathematical reasoning capabilities of Large Language Models (LLMs). While correctness-based signals emphasize identification of stepwise errors, potential-based signals focus on the likelihood of reaching the correct final answer. We… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  32. arXiv:2506.17532  [pdf, ps, other

    cond-mat.soft

    Interfacial instability of confined 3D active droplets

    Authors: Bennett C. Sessa, Federico Cao, Robert A. Pelcovits, Thomas R. Powers, Guillaume Duclos

    Abstract: Instabilities of fluid-fluid interfaces are ubiquitous in passive soft matter. Adding activity to the interface or either fluid can dramatically change the stability of the interface. Using experiment and theory, we investigate the interfacial instability of a deformable 3D active nematic liquid crystal droplet in the isotropic phase surrounded by a passive fluid and confined between two parallel… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  33. arXiv:2506.17525  [pdf, ps, other

    cs.CL cs.AI

    Data Quality Issues in Multilingual Speech Datasets: The Need for Sociolinguistic Awareness and Proactive Language Planning

    Authors: Mingfei Lau, Qian Chen, Yeming Fang, Tingting Xu, Tongzhou Chen, Pavel Golik

    Abstract: Our quality audit for three widely used public multilingual speech datasets - Mozilla Common Voice 17.0, FLEURS, and VoxPopuli - shows that in some languages, these datasets suffer from significant quality issues. We believe addressing these issues will make these datasets more useful as training and evaluation sets, and improve downstream models. We divide these quality issues into two categories… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: Accepted by ACL 2025 Main Conference

  34. arXiv:2506.17524  [pdf, ps, other

    math.NA math.DS

    Operator Splitting Methods: Numerical Solutions of Ordinary Differential Equations via Separation of Variables

    Authors: A. Banjara, I. AlJabea, T. Papamarkou, F. Neubrander

    Abstract: This paper applies the concept of linear semigroups induced by nonlinear flows, originally developed by Dorroh and Neuberger in the 1990s, to the approximation of uniquely solvable initial value problems for nonlinear ordinary differential equations. Building on a framework rooted in the earlier works of Lie, Kowalewski, and Groebner, we analyze nonlinear systems through the lens of the Koopman-Li… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    MSC Class: 65L05; 65L20; 47D03; 37M20

  35. arXiv:2506.17522  [pdf, ps, other

    hep-ex

    Radio emission from airplanes as observed with RNO-G

    Authors: RNO-G Collaboration, :, S. Agarwal, J. A. Aguilar, N. Alden, S. Ali, P. Allison, M. Betts, D. Besson, A. Bishop, O. Botner, S. Bouma, S. Buitink, R. Camphyn, J. Chan, S. Chiche, B. A. Clark, A. Coleman, K. Couberly, S. de Kockere, K. D. de Vries, C. Deaconu, P. Giri, C. Glaser, T. Glüsenkamp , et al. (58 additional authors not shown)

    Abstract: This paper describes how intentional and unintentional radio emission from airplanes is recorded with the Radio Neutrino Observatory Greenland (RNO-G). We characterize the received signals and define a procedure to extract a clean set of impulsive signals. These signals are highly suitable for instrument calibration, also for future experiments. A set of signals is used to probe the timing precisi… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  36. arXiv:2506.17514  [pdf, ps, other

    cs.AI

    Kaleidoscopic Teaming in Multi Agent Simulations

    Authors: Ninareh Mehrabi, Tharindu Kumarage, Kai-Wei Chang, Aram Galstyan, Rahul Gupta

    Abstract: Warning: This paper contains content that may be inappropriate or offensive. AI agents have gained significant recent attention due to their autonomous tool usage capabilities and their integration in various real-world applications. This autonomy poses novel challenges for the safety of such systems, both in single- and multi-agent scenarios. We argue that existing red teaming or safety evaluat… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  37. arXiv:2506.17510  [pdf, ps, other

    cs.CY cs.DC physics.soc-ph

    A Grassroots Network and Community Roadmap for Interconnected Autonomous Science Laboratories for Accelerated Discovery

    Authors: Rafael Ferreira da Silva, Milad Abolhasani, Dionysios A. Antonopoulos, Laura Biven, Ryan Coffee, Ian T. Foster, Leslie Hamilton, Shantenu Jha, Theresa Mayer, Benjamin Mintz, Robert G. Moore, Salahudin Nimer, Noah Paulson, Woong Shin, Frederic Suter, Mitra Taheri, Michela Taufer, Newell R. Washburn

    Abstract: Scientific discovery is being revolutionized by AI and autonomous systems, yet current autonomous laboratories remain isolated islands unable to collaborate across institutions. We present the Autonomous Interconnected Science Lab Ecosystem (AISLE), a grassroots network transforming fragmented capabilities into a unified system that shorten the path from ideation to innovation to impact and accele… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  38. arXiv:2506.17507  [pdf, ps, other

    cs.CG cs.DC

    Optimal Parallel Algorithms for Convex Hulls in 2D and 3D under Noisy Primitive Operations

    Authors: Michael T. Goodrich, Vinesh Sridhar

    Abstract: In the noisy primitives model, each primitive comparison performed by an algorithm, e.g., testing whether one value is greater than another, returns the incorrect answer with random, independent probability p < 1/2 and otherwise returns a correct answer. This model was first applied in the context of sorting and searching, and recent work by Eppstein, Goodrich, and Sridhar extends this model to se… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 17 pages, 3 figures. Accepted at the 37th Canadian Conference on Computational Geometry, 2025

  39. arXiv:2506.17490  [pdf, ps, other

    econ.GN

    Social Group Bias in AI Finance

    Authors: Thomas R. Cook, Sophia Kazinnik

    Abstract: Financial institutions increasingly rely on large language models (LLMs) for high-stakes decision-making. However, these models risk perpetuating harmful biases if deployed without careful oversight. This paper investigates racial bias in LLMs specifically through the lens of credit decision-making tasks, operating on the premise that biases identified here are indicative of broader concerns acros… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  40. arXiv:2506.17480  [pdf

    physics.optics physics.geo-ph

    FINCH EYE: The Optical and Optomechanical Design of a GRISM-based SWIR Hyperspectral Imaging Payload for a 3U CubeSat

    Authors: Iliya Shofman, Mario Ghio Neto, Theaswanth Ganesh, Kenya He, Aidan Armstrong, Ksenya Narkevich

    Abstract: Crop residue is an important metric used for agricultural land-use monitoring and climate science research. Estimating crop residue coverage is essential to sustainable agricultural practices. The University of Toronto Aerospace Team is developing FINCH EYE, the optical payload for the upcoming FINCH 3U CubeSat, to measure crop residue cover. We conceived of a novel ultra-compact push-broom archit… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 8 pages, 10 figures, submitted to Small Satellite Conference 2025 (preprint)

  41. arXiv:2506.17479  [pdf, ps, other

    physics.ins-det astro-ph.IM gr-qc

    An array of bulk-acoustic-wave sensors as a high-frequency antenna for gravitational waves

    Authors: G. Albani, M. Borghesi, L. Canonica, R. Carobene, F. De Guio, M. Faverzani, E. Ferri, R. Gerosa, A. Ghezzi, A. Giachero, C. Gotti, D. Labranca, L. Mariani, A. Nucciotti, G. Pessina, D. Rozza, T. Tabarelli de Fatis

    Abstract: In their simplest form, bulk acoustic wave (BAW) devices consist of a piezoelectric crystal between two electrodes that transduce the material's vibrations into electrical signals. They are adopted in frequency control and metrology, with well-established standards at frequencies of 5~MHz and above. Their use as a resonant-mass strain antenna for high-frequency gravitational waves has been recentl… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  42. arXiv:2506.17477  [pdf, ps, other

    physics.chem-ph

    Mixed Planewave and Localized Orbital Basis for Sparse-Stochastic Hybrid TDDFT

    Authors: Kyle Chen, Barry Y. Li, Tucker Allen, Daniel Neuhauser

    Abstract: We present a mixed basis-set approach to obtain optical absorption spectra within a generalized Kohn-Sham time-dependent density functional theory framework. All occupied valence molecular orbitals (MOs) are expanded in a plane-wave (PW) basis, while unoccupied MOs are derived primarily from localized atomic basis functions. The method accelerates spectral convergence when compared to fully PW-bas… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  43. arXiv:2506.17475  [pdf, ps, other

    cs.LG

    A geometric framework for momentum-based optimizers for low-rank training

    Authors: Steffen Schotthöfer, Timon Klein, Jonas Kusch

    Abstract: Low-rank pre-training and fine-tuning have recently emerged as promising techniques for reducing the computational and storage costs of large neural networks. Training low-rank parameterizations typically relies on conventional optimizers such as heavy ball momentum methods or Adam. In this work, we identify and analyze potential difficulties that these training methods encounter when used to trai… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  44. arXiv:2506.17469  [pdf

    cs.CV

    Photogranulometry -- Dataset of soil images with corresponding particle size distributions

    Authors: Thomas Plante St-Cyr, François Duhaime, Jean-Sébastien Dubé, Simon Grenier

    Abstract: Traditional particle size distribution (PSD) analyses create significant downtime and are expensive in labor and maintenance. These drawbacks could be alleviated using optical grain size analysis integrated into routine geotechnical laboratory workflow. This paper presents a high-resolution dataset of 12,714 images of 321 different soil samples collected in the Montreal, Quebec region, alongside t… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 8 pages, 10 figures, conference

    ACM Class: I.5.4; I.2.10

  45. arXiv:2506.17460  [pdf, ps, other

    cs.FL cs.LO

    Automata on $S$-adic words

    Authors: Valérie Berthé, Toghrul Karimov, Mihir Vahanwala

    Abstract: A fundamental question in logic and verification is the following: for which unary predicates $P_1, \ldots, P_k$ is the monadic second-order theory of $\langle \mathbb{N}; <, P_1, \ldots, P_k \rangle$ decidable? Equivalently, for which infinite words $α$ can we decide whether a given Büchi automaton $A$ accepts $α$? Carton and Thomas showed decidability in case $α$ is a fixed point of a letter-to-… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  46. arXiv:2506.17455  [pdf, ps, other

    cs.CV

    AQUA20: A Benchmark Dataset for Underwater Species Classification under Challenging Conditions

    Authors: Taufikur Rahman Fuad, Sabbir Ahmed, Shahriar Ivan

    Abstract: Robust visual recognition in underwater environments remains a significant challenge due to complex distortions such as turbidity, low illumination, and occlusion, which severely degrade the performance of standard vision systems. This paper introduces AQUA20, a comprehensive benchmark dataset comprising 8,171 underwater images across 20 marine species reflecting real-world environmental challenge… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: Submitted to AJSE Springer

  47. arXiv:2506.17451  [pdf, ps, other

    cs.DB

    Transient Concepts in Streaming Graphs

    Authors: Aida Sheshbolouki, M. Tamer Ozsu

    Abstract: Concept Drift (CD) occurs when a change in a hidden context can induce changes in a target concept. CD is a natural phenomenon in non-stationary settings such as data streams. Understanding, detection, and adaptation to CD in streaming data is (i) vital for effective and efficient analytics as reliable output depends on adaptation to fresh input, (ii) challenging as it requires efficient operation… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  48. arXiv:2506.17444  [pdf, ps, other

    math.PR

    The extinction of the contact process in a one-dimensional random environment with long-range interactions

    Authors: Pablo A. Gomes, Marcelo R. Hilário, Bernardo N. B. de Lima, Thomas Mountford

    Abstract: We study the contact process on the long-range percolation cluster on $\mathbb{Z}$ where each edge $\langle i,j \rangle$ is open with probability $|i-j|^{-s}$ for $s> 2$. Using a renormalization procedure we apply Peierls-type argument to prove that the contact process dies out if the transmission rate is smaller than a critical threshold. Our methods involve the control of crossing probabilities… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    MSC Class: 60K35; 82B43

  49. arXiv:2506.17434  [pdf, ps, other

    cs.AI

    Resource Rational Contractualism Should Guide AI Alignment

    Authors: Sydney Levine, Matija Franklin, Tan Zhi-Xuan, Secil Yanik Guyot, Lionel Wong, Daniel Kilov, Yejin Choi, Joshua B. Tenenbaum, Noah Goodman, Seth Lazar, Iason Gabriel

    Abstract: AI systems will soon have to navigate human environments and make decisions that affect people and other AI agents whose goals and values diverge. Contractualist alignment proposes grounding those decisions in agreements that diverse stakeholders would endorse under the right conditions, yet securing such agreement at scale remains costly and slow -- even for advanced AI. We therefore propose Reso… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 24 pages, 10 figures

  50. arXiv:2506.17423  [pdf, ps, other

    physics.atom-ph physics.optics

    A Liquid-Nitrogen-Cooled Ca+ Ion Optical Clock with a Systematic Uncertainty of 4.6E-19

    Authors: Baolin Zhang, Zixiao Ma, Yao Huang, Huili Han, Ruming Hu, Yuzhuo Wang, Huaqing Zhang, Liyan Tang, Tingyun Shi, Hua Guan, Kein Gao

    Abstract: We report a single-ion optical clock based on the 4S_1/2-3D_5/2 transition of the 40Ca+ ion, operated in a liquid nitrogen cryogenic environment,achieving a total systematic uncertainty of 4.6E-19. We employ a refined temperature evaluation scheme to reduce the frequency uncertainty due to blackbody radiation (BBR), and the 3D sideband cooling has been implemented to minimize the second-order Dopp… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 6 pages, 3 figures