Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for recent submissions

  • Fri, 29 Aug 2025
  • Thu, 28 Aug 2025
  • Wed, 27 Aug 2025
  • Tue, 26 Aug 2025
  • Mon, 25 Aug 2025

See today's new changes

Total of 781 entries
Showing up to 2000 entries per page: fewer | more | all

Tue, 26 Aug 2025 (continued, showing last 276 of 277 entries )

[376] arXiv:2508.18244 [pdf, other]
Title: Type-Compliant Adaptation Cascades: Adapting Programmatic LM Workflows to Data
Chu-Cheng Lin, Daiyi Peng, Yifeng Lu, Ming Zhang, Eugene Ie
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[377] arXiv:2508.18225 [pdf, html, other]
Title: Deep Learning and Matrix Completion-aided IoT Network Localization in the Outlier Scenarios
Sunwoo Kim
Comments: 4 pages, 2 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[378] arXiv:2508.18196 [pdf, html, other]
Title: HypER: Hyperbolic Echo State Networks for Capturing Stretch-and-Fold Dynamics in Chaotic Flows
Pradeep Singh, Sutirtha Ghosh, Ashutosh Kumar, Hrishit B P, Balasubramanian Raman
Comments: 8 pages, accepted in ECAI 2025
Subjects: Machine Learning (cs.LG)
[379] arXiv:2508.18182 [pdf, html, other]
Title: AdLoCo: adaptive batching significantly improves communications efficiency and convergence for Large Language Models
Nikolay Kutuzov, Makar Baderko, Stepan Kulibaba, Artem Dzhalilov, Daniel Bobrov, Maxim Mashtaler, Alexander Gasnikov
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[380] arXiv:2508.18175 [pdf, html, other]
Title: Amortized Sampling with Transferable Normalizing Flows
Charlie B. Tan, Majdi Hassan, Leon Klein, Saifuddin Syed, Dominique Beaini, Michael M. Bronstein, Alexander Tong, Kirill Neklyudov
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[381] arXiv:2508.18173 [pdf, html, other]
Title: Unveiling the Actual Performance of Neural-based Models for Equation Discovery on Graph Dynamical Systems
Riccardo Cappi, Paolo Frazzetto, Nicolò Navarin, Alessandro Sperduti
Comments: Preprint. Under Review
Subjects: Machine Learning (cs.LG)
[382] arXiv:2508.18130 [pdf, html, other]
Title: Frozen in Time: Parameter-Efficient Time Series Transformers via Reservoir-Induced Feature Expansion and Fixed Random Dynamics
Pradeep Singh, Mehak Sharma, Anupriya Dey, Balasubramanian Raman
Comments: 8 pages, 5 tables, 3 figures, accepted at ECAI 2025
Subjects: Machine Learning (cs.LG)
[383] arXiv:2508.18124 [pdf, other]
Title: CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Weida Wang, Dongchen Huang, Jiatong Li, Tengchao Yang, Ziyang Zheng, Di Zhang, Dong Han, Benteng Chen, Binzhao Luo, Zhiyu Liu, Kunling Liu, Zhiyuan Gao, Shiqi Geng, Wei Ma, Jiaming Su, Xin Li, Shuchen Pu, Yuhan Shui, Qianjia Cheng, Zhihao Dou, Dongfei Cui, Changyong He, Jin Zeng, Zeke Xie, Mao Su, Dongzhan Zhou, Yuqiang Li, Wanli Ouyang, Yunqi Cai, Xi Dai, Shufei Zhang, Lei Bai, Jinguang Cheng, Zhong Fang, Hongming Weng
Comments: 29 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[384] arXiv:2508.18122 [pdf, html, other]
Title: Provable Mixed-Noise Learning with Flow-Matching
Paul Hagemann, Robert Gruhlke, Bernhard Stankewitz, Claudia Schillings, Gabriele Steidl
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[385] arXiv:2508.18085 [pdf, other]
Title: Quantum-Classical Hybrid Framework for Zero-Day Time-Push GNSS Spoofing Detection
Abyad Enan, Mashrur Chowdhury, Sagar Dasgupta, Mizanur Rahman
Comments: This work has been submitted to the IEEE Internet of Things Journal for possible publication
Subjects: Machine Learning (cs.LG)
[386] arXiv:2508.18060 [pdf, html, other]
Title: FedGreed: A Byzantine-Robust Loss-Based Aggregation Method for Federated Learning
Emmanouil Kritharakis, Antonios Makris, Dusan Jakovetic, Konstantinos Tserpes
Comments: 8 pages, 4 figures
Subjects: Machine Learning (cs.LG)
[387] arXiv:2508.18052 [pdf, html, other]
Title: Weisfeiler-Lehman meets Events: An Expressivity Analysis for Continuous-Time Dynamic Graph Neural Networks
Silvia Beddar-Wiesing, Alice Moallemy-Oureh
Subjects: Machine Learning (cs.LG)
[388] arXiv:2508.18051 [pdf, html, other]
Title: Training Transformers for Mesh-Based Simulations
Paul Garnier, Vincent Lannelongue, Jonathan Viquerat, Elie Hachem
Subjects: Machine Learning (cs.LG)
[389] arXiv:2508.18045 [pdf, html, other]
Title: Riemannian Change Point Detection on Manifolds with Robust Centroid Estimation
Xiuheng Wang, Ricardo Borsoi, Arnaud Breloy, Cédric Richard
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[390] arXiv:2508.18037 [pdf, html, other]
Title: Enhancing Differentially Private Linear Regression via Public Second-Moment
Zilong Cao (1), Hai Zhang (1) ((1) The School of Mathematics, Northwest University)
Subjects: Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[391] arXiv:2508.18025 [pdf, html, other]
Title: AQ-PCDSys: An Adaptive Quantized Planetary Crater Detection System for Autonomous Space Exploration
Aditri Paul, Archan Paul
Comments: 17 pages, 6 figures. A research paper on a novel deep learning framework for planetary crater detection
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Systems and Control (eess.SY)
[392] arXiv:2508.18019 [pdf, html, other]
Title: Does simple trump complex? Comparing strategies for adversarial robustness in DNNs
William Brooks, Marelie H. Davel, Coenraad Mouton
Subjects: Machine Learning (cs.LG)
[393] arXiv:2508.18001 [pdf, other]
Title: A Novel Framework for Uncertainty Quantification via Proper Scores for Classification and Beyond
Sebastian G. Gruber
Comments: PhD Thesis (cumulative, spanning 6 peer-reviewed publications)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[394] arXiv:2508.17995 [pdf, html, other]
Title: Topology Aware Neural Interpolation of Scalar Fields
Mohamed Kissi, Keanu Sisouk, Joshua A. Levine, Julien Tierny
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[395] arXiv:2508.17957 [pdf, html, other]
Title: Generative Feature Imputing -- A Technique for Error-resilient Semantic Communication
Jianhao Huang, Qunsong Zeng, Hongyang Du, Kaibin Huang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[396] arXiv:2508.17954 [pdf, html, other]
Title: Choice Outweighs Effort: Facilitating Complementary Knowledge Fusion in Federated Learning via Re-calibration and Merit-discrimination
Ming Yang, Dongrun Li, Xin Wang, Xiaoyang Yu, Xiaoming Wu, Shibo He
Subjects: Machine Learning (cs.LG)
[397] arXiv:2508.17930 [pdf, other]
Title: Learning to Detect Label Errors by Making Them: A Method for Segmentation and Object Detection Datasets
Sarina Penquitt, Tobias Riedlinger, Timo Heller, Markus Reischl, Matthias Rottmann
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[398] arXiv:2508.17901 [pdf, html, other]
Title: Riemannian Optimization for LoRA on the Stiefel Manifold
Juneyoung Park, Minjae Kang, Seongbae Lee, Haegang Lee, Seongwan Kim, Jaeho Lee
Comments: EMNLP 2025 Findings
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[399] arXiv:2508.17872 [pdf, html, other]
Title: Spectrum Prediction in the Fractional Fourier Domain with Adaptive Filtering
Yanghao Qin, Bo Zhou, Guangliang Pan, Qihui Wu, Meixia Tao
Comments: Accepted by IEEE Wireless Communications Letters
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT)
[400] arXiv:2508.17867 [pdf, html, other]
Title: Ada-TransGNN: An Air Quality Prediction Model Based On Adaptive Graph Convolutional Networks
Dan Wang, Feng Jiang, Zhanquan Wang
Comments: 15 pages, 4 figures, 3 tables. This paper is accepted by ICONIP 2025 but not published
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[401] arXiv:2508.17850 [pdf, html, other]
Title: Group Expectation Policy Optimization for Stable Heterogeneous Reinforcement Learning in LLMs
Han Zhang, Ruibin Zheng, Zexuan Yi, Hanyang Peng, Hui Wang, Yue Yu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[402] arXiv:2508.17822 [pdf, html, other]
Title: Limits of message passing for node classification: How class-bottlenecks restrict signal-to-noise ratio
Jonathan Rubin, Sahil Loomba, Nick S. Jones
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Artificial Intelligence (cs.AI); Statistics Theory (math.ST); Machine Learning (stat.ML)
[403] arXiv:2508.17821 [pdf, html, other]
Title: Limitations of Normalization in Attention Mechanism
Timur Mudarisov, Mikhail Burtsev, Tatiana Petrova, Radu State
Comments: 10 pages, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[404] arXiv:2508.17815 [pdf, html, other]
Title: Multi-domain Distribution Learning for De Novo Drug Design
Arne Schneuing, Ilia Igashov, Adrian W. Dobbelstein, Thomas Castiglione, Michael Bronstein, Bruno Correia
Journal-ref: ICLR 2025: https://openreview.net/forum?id=g3VCIM94ke
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[405] arXiv:2508.17784 [pdf, html, other]
Title: Proximal Supervised Fine-Tuning
Wenhong Zhu, Ruobing Xie, Rui Wang, Xingwu Sun, Di Wang, Pengfei Liu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[406] arXiv:2508.17764 [pdf, html, other]
Title: Puzzle: Scheduling Multiple Deep Learning Models on Mobile Device with Heterogeneous Processors
Duseok Kang, Yunseong Lee, Junghoon Kim
Subjects: Machine Learning (cs.LG); Operating Systems (cs.OS)
[407] arXiv:2508.17761 [pdf, html, other]
Title: Evaluating the Quality of the Quantified Uncertainty for (Re)Calibration of Data-Driven Regression Models
Jelke Wibbeke, Nico Schönfisch, Sebastian Rohjans, Andreas Rauh
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[408] arXiv:2508.17756 [pdf, html, other]
Title: SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling
Fanjiang Ye, Zepeng Zhao, Yi Mu, Jucheng Shen, Renjie Li, Kaijian Wang, Desen Sun, Saurabh Agarwal, Myungjin Lee, Triston Cao, Aditya Akella, Arvind Krishnamurthy, T.S. Eugene Ng, Zhengzhong Tu, Yuke Wang
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[409] arXiv:2508.17751 [pdf, html, other]
Title: Multi-layer Abstraction for Nested Generation of Options (MANGO) in Hierarchical Reinforcement Learning
Alessio Arcudi, Davide Sartor, Alberto Sinigaglia, Vincent François-Lavet, Gian Antonio Susto
Subjects: Machine Learning (cs.LG)
[410] arXiv:2508.17744 [pdf, html, other]
Title: Randomly Removing 50% of Dimensions in Text Embeddings has Minimal Impact on Retrieval and Classification Tasks
Sotaro Takeshita, Yurina Takeshita, Daniel Ruffinelli, Simone Paolo Ponzetto
Comments: Accepted to EMNLP 2025 Main Conference, submitted version
Subjects: Machine Learning (cs.LG)
[411] arXiv:2508.17739 [pdf, html, other]
Title: Speculative Safety-Aware Decoding
Xuekang Wang, Shengyu Zhu, Xueqi Cheng
Comments: EMNLP'2025 main conference; more experiments will be added to the coming camera-ready version
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[412] arXiv:2508.17702 [pdf, html, other]
Title: Copyright Protection for 3D Molecular Structures with Watermarking
Runwen Hu, Peilin Chen, Keyan Ding, Shiqi Wang
Subjects: Machine Learning (cs.LG)
[413] arXiv:2508.17700 [pdf, other]
Title: Adaptive Ensemble Learning with Gaussian Copula for Load Forecasting
Junying Yang, Gang Lu, Xiaoqing Yan, Peng Xia, Di Wu
Subjects: Machine Learning (cs.LG)
[414] arXiv:2508.17697 [pdf, html, other]
Title: Rethinking Federated Learning Over the Air: The Blessing of Scaling Up
Jiaqi Zhu, Bikramjit Das, Yong Xie, Nikolaos Pappas, Howard H. Yang
Subjects: Machine Learning (cs.LG)
[415] arXiv:2508.17689 [pdf, other]
Title: On the Edge of Memorization in Diffusion Models
Sam Buchanan, Druv Pai, Yi Ma, Valentin De Bortoli
Comments: 10 main body pages, 43 total pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[416] arXiv:2508.17681 [pdf, html, other]
Title: Unlearning as Ablation: Toward a Falsifiable Benchmark for Generative Scientific Discovery
Robert Yang
Comments: 6 pages. NeurIPS 2025 AI4Science Workshop submission
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[417] arXiv:2508.17680 [pdf, html, other]
Title: Robustness Feature Adapter for Efficient Adversarial Training
Quanwei Wu, Jun Guo, Wei Wang, Yi Wang
Comments: The paper has been accepted for presentation at ECAI 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[418] arXiv:2508.17679 [pdf, html, other]
Title: Characterizing the Behavior of Training Mamba-based State Space Models on GPUs
Trinayan Baruah, Kaustubh Shivdikar, Sara Prescott, David Kaeli
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Computation and Language (cs.CL)
[419] arXiv:2508.17677 [pdf, html, other]
Title: TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training
Yifan Wang, Binbin Liu, Fengze Liu, Yuanfan Guo, Jiyao Deng, Xuecheng Wu, Weidong Zhou, Xiaohuan Zhou, Taifeng Wang
Subjects: Machine Learning (cs.LG)
[420] arXiv:2508.17675 [pdf, html, other]
Title: Towards Synthesizing Normative Data for Cognitive Assessments Using Generative Multimodal Large Language Models
Victoria Yan, Honor Chotkowski, Fengran Wang, Alex Fedorov
Comments: Preprint
Subjects: Machine Learning (cs.LG)
[421] arXiv:2508.17663 [pdf, html, other]
Title: Heterogeneous co-occurrence embedding for visual information exploration
Takuro Ishida, Tetsuo Furukawa
Comments: 36pages, 9 figures, Accepted to International Journal of Innovative Computing, Information and Control (IJICIC), 2025
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR)
[422] arXiv:2508.17649 [pdf, html, other]
Title: Longitudinal Progression Prediction of Alzheimer's Disease with Tabular Foundation Model
Yilang Ding, Jiawen Ren, Jiaying Lu, Gloria Hyunjung Kwak, Armin Iraji, Alex Fedorov
Subjects: Machine Learning (cs.LG)
[423] arXiv:2508.17631 [pdf, html, other]
Title: ControlEchoSynth: Boosting Ejection Fraction Estimation Models via Controlled Video Diffusion
Nima Kondori, Hanwen Liang, Hooman Vaseli, Bingyu Xie, Christina Luong, Purang Abolmaesumi, Teresa Tsang, Renjie Liao
Comments: Data Curation and Augmentation in Medical Imaging CVPR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[424] arXiv:2508.17630 [pdf, html, other]
Title: Quantum Graph Attention Network: A Novel Quantum Multi-Head Attention Mechanism for Graph Learning
An Ning, Tai Yue Li, Nan Yow Chen
Subjects: Machine Learning (cs.LG)
[425] arXiv:2508.17609 [pdf, other]
Title: A Proportional-Integral Controller-Incorporated SGD Algorithm for High Efficient Latent Factor Analysis
Jinli Li, Shiyu Long, Minglian Han
Subjects: Machine Learning (cs.LG)
[426] arXiv:2508.17608 [pdf, html, other]
Title: ChartMaster: Advancing Chart-to-Code Generation with Real-World Charts and Chart Similarity Reinforcement Learning
Wentao Tan, Qiong Cao, Chao Xue, Yibing Zhan, Changxing Ding, Xiaodong He
Subjects: Machine Learning (cs.LG)
[427] arXiv:2508.17586 [pdf, html, other]
Title: Exploring Efficient Learning of Small BERT Networks with LoRA and DoRA
Daniel Frees, Aditri Bhagirath, Moritz Bolling
Subjects: Machine Learning (cs.LG)
[428] arXiv:2508.17554 [pdf, html, other]
Title: Bridging Graph and State-Space Modeling for Intensive Care Unit Length of Stay Prediction
Shuqi Zi, Haitz Sáez de Ocáriz Borde, Emma Rocheteau, Pietro Lio'
Subjects: Machine Learning (cs.LG)
[429] arXiv:2508.17550 [pdf, html, other]
Title: In-Context Algorithm Emulation in Fixed-Weight Transformers
Jerry Yao-Chieh Hu, Hude Liu, Jennifer Yuntong Zhang, Han Liu
Comments: Code is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[430] arXiv:2508.17540 [pdf, other]
Title: Activation Transport Operators
Andrzej Szablewski, Marek Masiak
Comments: 4 pages, 4 figures, references and appendices
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[431] arXiv:2508.17531 [pdf, html, other]
Title: Gumbel-MPNN: Graph Rewiring with Gumbel-Softmax
Marcel Hoffmann, Lukas Galke, Ansgar Scherp
Subjects: Machine Learning (cs.LG)
[432] arXiv:2508.17521 [pdf, html, other]
Title: Modeling Irregular Astronomical Time Series with Neural Stochastic Delay Differential Equations
YongKyung Oh, Seungsu Kam, Dong-Young Lim, Sungil Kim
Subjects: Machine Learning (cs.LG)
[433] arXiv:2508.17519 [pdf, html, other]
Title: TANDEM: Temporal Attention-guided Neural Differential Equations for Missingness in Time Series Classification
YongKyung Oh, Dong-Young Lim, Sungil Kim, Alex Bui
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[434] arXiv:2508.17515 [pdf, html, other]
Title: GateTS: Versatile and Efficient Forecasting via Attention-Inspired routed Mixture-of-Experts
Kyrylo Yemets, Mykola Lukashchuk, Ivan Izonin
Subjects: Machine Learning (cs.LG)
[435] arXiv:2508.17512 [pdf, html, other]
Title: Learning Interpretable Differentiable Logic Networks for Time-Series Classification
Chang Yue, Niraj K. Jha
Subjects: Machine Learning (cs.LG)
[436] arXiv:2508.17497 [pdf, html, other]
Title: Multimodal Representation Learning Conditioned on Semantic Relations
Yang Qiao, Yuntong Hu, Liang Zhao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[437] arXiv:2508.17477 [pdf, html, other]
Title: A Human-In-The-Loop Approach for Improving Fairness in Predictive Business Process Monitoring
Martin Käppel, Julian Neuberger, Felix Möhrlein, Sven Weinzierl, Martin Matzner, Stefan Jablonski
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[438] arXiv:2508.17467 [pdf, html, other]
Title: MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models
Krishna Teja Chitty-Venkata, Sylvia Howland, Golara Azar, Daria Soboleva, Natalia Vassilieva, Siddhisanket Raskar, Murali Emani, Venkatram Vishwanath
Comments: Preprint
Subjects: Machine Learning (cs.LG); Performance (cs.PF)
[439] arXiv:2508.17456 [pdf, html, other]
Title: Adversarial Examples Are Not Bugs, They Are Superposition
Liv Gorton, Owen Lewis
Subjects: Machine Learning (cs.LG)
[440] arXiv:2508.17455 [pdf, other]
Title: A Systematic Literature Review on Multi-label Data Stream Classification
H. Freire-Oliveira, E. R. F. Paiva, J. Gama, L. Khan, R. Cerri
Comments: 48 pages, 12 figures
Subjects: Machine Learning (cs.LG)
[441] arXiv:2508.17452 [pdf, html, other]
Title: ReviBranch: Deep Reinforcement Learning for Branch-and-Bound with Revived Trajectories
Dou Jiabao, Nie Jiayi, Yihang Cheng, Jinwei Liu, Yingrui Ji, Canran Xiao, Feixiang Du, Jiaping Xiao
Comments: conference
Subjects: Machine Learning (cs.LG)
[442] arXiv:2508.17448 [pdf, html, other]
Title: Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Shaocong Ma, Ziyi Chen, Yi Zhou, Heng Huang
Subjects: Machine Learning (cs.LG)
[443] arXiv:2508.17445 [pdf, html, other]
Title: TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
Yizhi Li, Qingshui Gu, Zhoufutu Wen, Ziniu Li, Tianshun Xing, Shuyue Guo, Tianyu Zheng, Xin Zhou, Xingwei Qu, Wangchunshu Zhou, Zheng Zhang, Wei Shen, Qian Liu, Chenghua Lin, Jian Yang, Ge Zhang, Wenhao Huang
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[444] arXiv:2508.17426 [pdf, html, other]
Title: Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling
Haochen You, Baojing Liu, Hongyang He
Comments: Accepted as a conference paper at PRCV 2025
Subjects: Machine Learning (cs.LG)
[445] arXiv:2508.17412 [pdf, html, other]
Title: Convergence and Generalization of Anti-Regularization for Parametric Models
Dongseok Kim, Wonjun Jeong, Gisung Oh
Comments: 39 pages, 1 figure
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[446] arXiv:2508.17405 [pdf, html, other]
Title: FRAME : Comprehensive Risk Assessment Framework for Adversarial Machine Learning Threats
Avishag Shapira, Simon Shigol, Asaf Shabtai
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[447] arXiv:2508.17403 [pdf, html, other]
Title: Mutual Information Surprise: Rethinking Unexpectedness in Autonomous Systems
Yinsong Wang, Xiao Liu, Quan Zeng, Yu Ding
Comments: Pre-Submission Version
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[448] arXiv:2508.17400 [pdf, html, other]
Title: Retrieval Capabilities of Large Language Models Scale with Pretraining FLOPs
Jacob Portes, Connor Jennings, Erica Ji Yuen, Sasha Doubov, Michael Carbin
Comments: 15 pages, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[449] arXiv:2508.17388 [pdf, other]
Title: Effective Clustering for Large Multi-Relational Graphs
Xiaoyang Lin, Runhao Jiang, Renchi Yang
Comments: 23 pages. The technical report for the paper titled "Effective Clustering for Large Multi-Relational Graphs" in SIGMOD 2026
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Social and Information Networks (cs.SI)
[450] arXiv:2508.17387 [pdf, other]
Title: Graph-R1: Incentivizing the Zero-Shot Graph Learning Capability in LLMs via Explicit Reasoning
Yicong Wu, Guangyue Lu, Yuan Zuo, Huarong Zhang, Junjie Wu
Comments: Accepted at EMNLP 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[451] arXiv:2508.17381 [pdf, html, other]
Title: FedERL: Federated Efficient and Robust Learning for Common Corruptions
Omar Bekdache, Naresh Shanbhag
Subjects: Machine Learning (cs.LG)
[452] arXiv:2508.17376 [pdf, html, other]
Title: ShaLa: Multimodal Shared Latent Space Modelling
Jiali Cui, Yan-Ying Chen, Yanxia Zhang, Matthew Klenk
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[453] arXiv:2508.17361 [pdf, other]
Title: Trust Me, I Know This Function: Hijacking LLM Static Analysis using Bias
Shir Bernstein, David Beste, Daniel Ayzenshteyn, Lea Schonherr, Yisroel Mirsky
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[454] arXiv:2508.17345 [pdf, html, other]
Title: ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation
Yuxuan Song, Zhe Zhang, Yu Pei, Jingjing Gong, Qiying Yu, Zheng Zhang, Mingxuan Wang, Hao Zhou, Jingjing Liu, Wei-Ying Ma
Subjects: Machine Learning (cs.LG); Genomics (q-bio.GN)
[455] arXiv:2508.17341 [pdf, html, other]
Title: MetaFed: Advancing Privacy, Performance, and Sustainability in Federated Metaverse Systems
Muhammet Anil Yagiz, Zeynep Sude Cengiz, Polat Goktas
Comments: 2025 IEEE International Symposium on Emerging Metaverse (ISEMV)
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computers and Society (cs.CY); Distributed, Parallel, and Cluster Computing (cs.DC); Emerging Technologies (cs.ET)
[456] arXiv:2508.17323 [pdf, html, other]
Title: Is the Frequency Principle always valid?
Qijia Zhai
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[457] arXiv:2508.17320 [pdf, html, other]
Title: AdaptiveK Sparse Autoencoders: Dynamic Sparsity Allocation for Interpretable LLM Representations
Yifei Yao, Mengnan Du
Subjects: Machine Learning (cs.LG)
[458] arXiv:2508.17303 [pdf, other]
Title: Physics-informed neural network for fatigue life prediction of irradiated austenitic and ferritic/martensitic steels
Dhiraj S Kori, Abhinav Chandraker, Syed Abdur Rahman, Punit Rathore, Ankur Chauhan
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[459] arXiv:2508.17294 [pdf, html, other]
Title: Explainable AI (XAI) for Arrhythmia detection from electrocardiograms
Joschka Beck, Arlene John
Subjects: Machine Learning (cs.LG)
[460] arXiv:2508.17278 [pdf, other]
Title: DeepCFD: Efficient near-ground airfoil lift coefficient approximation with deep convolutional neural networks
Mohammad Amin Esabat, Saeed Jaamei, Fatemeh Asadi
Subjects: Machine Learning (cs.LG)
[461] arXiv:2508.17256 [pdf, html, other]
Title: Provable Generalization in Overparameterized Neural Nets
Aviral Dhingra
Comments: 8 Pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[462] arXiv:2508.17233 [pdf, html, other]
Title: Module-Aware Parameter-Efficient Machine Unlearning on Transformers
Wenjie Bao, Jian Lou, Yuke Hu, Xiaochen Li, Zhihao Liu, Jiaqi Liu, Zhan Qin, Kui Ren
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[463] arXiv:2508.17232 [pdf, html, other]
Title: Curvature Learning for Generalization of Hyperbolic Neural Networks
Xiaomeng Fan, Yuwei Wu, Zhi Gao, Mehrtash Harandi, Yunde Jia
Comments: Accepted by International Journal of Computer Vision (IJCV)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[464] arXiv:2508.17218 [pdf, html, other]
Title: GPG-HT: Generalized Policy Gradient with History-Aware Decision Transformer for Probabilistic Path Planning
Xing Wei, Yuqi Ouyang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[465] arXiv:2508.17215 [pdf, html, other]
Title: How to make Medical AI Systems safer? Simulating Vulnerabilities, and Threats in Multimodal Medical RAG System
Kaiwen Zuo, Zelin Liu, Raman Dutt, Ziyang Wang, Zhongtian Sun, Yeming Wang, Fan Mo, Pietro Liò
Comments: Sumbitted to 2025 AAAI main track
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[466] arXiv:2508.17196 [pdf, html, other]
Title: BudgetThinker: Empowering Budget-aware LLM Reasoning with Control Tokens
Hao Wen, Xinrui Wu, Yi Sun, Feifei Zhang, Liye Chen, Jie Wang, Yunxin Liu, Ya-Qin Zhang, Yuanchun Li
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[467] arXiv:2508.17182 [pdf, html, other]
Title: LLM Assertiveness can be Mechanistically Decomposed into Emotional and Logical Components
Hikaru Tsujimura, Arush Tagade
Comments: This preprint is under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[468] arXiv:2508.17175 [pdf, html, other]
Title: Scaling Graph Transformers: A Comparative Study of Sparse and Dense Attention
Leon Dimitrov
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[469] arXiv:2508.17174 [pdf, html, other]
Title: Sharpness-Aware Geometric Defense for Robust Out-Of-Distribution Detection
Jeng-Lin Li, Ming-Ching Chang, Wei-Chao Chen
Comments: under review
Subjects: Machine Learning (cs.LG)
[470] arXiv:2508.17169 [pdf, html, other]
Title: ONG: Orthogonal Natural Gradient Descent
Yajat Yadav, Jathin Korrapati, Patrick Mendoza
Comments: Code at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[471] arXiv:2508.17158 [pdf, other]
Title: Towards Safeguarding LLM Fine-tuning APIs against Cipher Attacks
Jack Youstra, Mohammed Mahfoud, Yang Yan, Henry Sleight, Ethan Perez, Mrinank Sharma
Subjects: Machine Learning (cs.LG)
[472] arXiv:2508.17150 [pdf, html, other]
Title: SACA: Selective Attention-Based Clustering Algorithm
Meysam Shirdel Bilehsavar, Razieh Ghaedi, Samira Seyed Taheri, Xinqi Fan, Christian O'Reilly
Comments: 22 pages, 10 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[473] arXiv:2508.17144 [pdf, html, other]
Title: Stochastic Gradient Descent with Strategic Querying
Nanfei Jiang, Hoi-To Wai, Mahnoosh Alizadeh
Comments: 18 pages, 2 figures. Accepted to IEEE Conference on Decision and Control (CDC) 2025. Includes appendix and supplementary discussion
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[474] arXiv:2508.17137 [pdf, html, other]
Title: MoE-Beyond: Learning-Based Expert Activation Prediction on Edge Devices
Nishant Gavhane, Arush Mehrotra, Rohit Chawla, Peter Proenca
Subjects: Machine Learning (cs.LG)
[475] arXiv:2508.17129 [pdf, other]
Title: Reconciling Communication Compression and Byzantine-Robustness in Distributed Learning
Diksha Gupta, Nirupam Gupta, Chuan Xu, Giovanni Neglia
Comments: 78 Pages, 1 figure
Subjects: Machine Learning (cs.LG)
[476] arXiv:2508.17097 [pdf, html, other]
Title: Two Birds with One Stone: Enhancing Uncertainty Quantification and Interpretability with Graph Functional Neural Process
Lingkai Kong, Haotian Sun, Yuchen Zhuang, Haorui Wang, Wenhao Mu, Chao Zhang
Comments: AISTATS'25
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[477] arXiv:2508.17096 [pdf, html, other]
Title: Convolutional Neural Networks for Accurate Measurement of Train Speed
Haitao Tian, Argyrios Zolotas, Miguel Arana-Catania
Comments: 15 pages, 12 figures, 2 tables. Proceedings of the Institution of Mechanical Engineers, Part F: Journal of Rail and Rapid Transit
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
[478] arXiv:2508.17083 [pdf, html, other]
Title: Learning ON Large Datasets Using Bit-String Trees
Prashant Gupta
Comments: PhD thesis
Subjects: Machine Learning (cs.LG)
[479] arXiv:2508.17056 [pdf, html, other]
Title: TabResFlow: A Normalizing Spline Flow Model for Probabilistic Univariate Tabular Regression
Kiran Madhusudhanan, Vijaya Krishna Yalavarthi, Jonas Sonntag, Maximilian Stubbemann, Lars Schmidt-Thieme
Comments: To be published in The European Conference on Artificial Intelligence, 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[480] arXiv:2508.17032 [pdf, other]
Title: Learned Structure in CARTRIDGES: Keys as Shareable Routers in Self-Studied Representations
Maurizio Diaz
Subjects: Machine Learning (cs.LG)
[481] arXiv:2508.16992 [pdf, html, other]
Title: Online Learning for Approximately-Convex Functions with Long-term Adversarial Constraints
Dhruv Sarkar, Samrat Mukhopadhyay, Abhishek Sinha
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[482] arXiv:2508.16989 [pdf, other]
Title: Unveiling the Latent Directions of Reflection in Large Language Models
Fu-Chieh Chang, Yu-Ting Lee, Pei-Yuan Wu
Subjects: Machine Learning (cs.LG)
[483] arXiv:2508.16950 [pdf, html, other]
Title: Disentangling Polysemantic Neurons with a Null-Calibrated Polysemanticity Index and Causal Patch Interventions
Manan Gupta, Dhruv Kumar
Comments: Under review. 13 pages
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[484] arXiv:2508.16949 [pdf, other]
Title: Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
Yang Zhou, Sunzhu Li, Shunyu Liu, Wenkai Fang, Jiale Zhao, Jingwen Yang, Jianwei Lv, Kongcheng Zhang, Yihe Zhou, Hengtong Lu, Wei Chen, Yan Xie, Mingli Song
Comments: This work is still in progress
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[485] arXiv:2508.16939 [pdf, html, other]
Title: Sig-DEG for Distillation: Making Diffusion Models Faster and Lighter
Lei Jiang, Wen Ge, Niels Cariou-Kotlarek, Mingxuan Yi, Po-Yu Chen, Lingyi Yang, Francois Buet-Golfouse, Gaurav Mittal, Hao Ni
Subjects: Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
[486] arXiv:2508.16931 [pdf, html, other]
Title: Degree of Staleness-Aware Data Updating in Federated Learning
Tao Liu, Xuehe Wang
Comments: accepted by European Conference on Artificial Intelligence
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[487] arXiv:2508.16929 [pdf, html, other]
Title: Attention Layers Add Into Low-Dimensional Residual Subspaces
Junxuan Wang, Xuyang Ge, Wentao Shu, Zhengfu He, Xipeng Qiu
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[488] arXiv:2508.16915 [pdf, html, other]
Title: Reinforcement-Guided Hyper-Heuristic Hyperparameter Optimization for Fair and Explainable Spiking Neural Network-Based Financial Fraud Detection
Sadman Mohammad Nasif, Md Abrar Jahin, M. F. Mridha
Subjects: Machine Learning (cs.LG)
[489] arXiv:2508.16905 [pdf, html, other]
Title: Tri-Accel: Curvature-Aware Precision-Adaptive and Memory-Elastic Optimization for Efficient GPU Usage
Mohsen Sheibanian, Pouya Shaeri, Alimohammad Beigi, Ryan T. Woo, Aryan Keluskar
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[490] arXiv:2508.16891 [pdf, html, other]
Title: Quantifying Out-of-Training Uncertainty of Neural-Network based Turbulence Closures
Cody Grogan, Som Dhulipala, Mauricio Tano, Izabela Gutowska, Som Dutta
Subjects: Machine Learning (cs.LG); Fluid Dynamics (physics.flu-dyn)
[491] arXiv:2508.16874 [pdf, html, other]
Title: UM3: Unsupervised Map to Map Matching
Chaolong Ying, Yinan Zhang, Lei Zhang, Jiazhuang Wang, Shujun Jia, Tianshu Yu
Comments: Accepted by ACM SIGSPATIAL 2025
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[492] arXiv:2508.16857 [pdf, html, other]
Title: Neural Contrast Expansion for Explainable Structure-Property Prediction and Random Microstructure Design
Guangyu Nie, Yang Jiao, Yi Ren
Subjects: Machine Learning (cs.LG)
[493] arXiv:2508.16836 [pdf, html, other]
Title: Physics-Inspired Spatial Temporal Graph Neural Networks for Predicting Industrial Chain Resilience
Bicheng Wang, Junping Wang, Yibo Xue
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE)
[494] arXiv:2508.16832 [pdf, html, other]
Title: Out of Distribution Detection for Efficient Continual Learning in Quality Prediction for Arc Welding
Yannik Hahn, Jan Voets, Antonin Koenigsfeld, Hasan Tercan, Tobias Meisen
Comments: Accepted at CIKM 2025 (Applied Research Papers)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[495] arXiv:2508.16829 [pdf, html, other]
Title: Understanding and Tackling Over-Dilution in Graph Neural Networks
Junhyun Lee, Veronika Thost, Bumsoo Kim, Jaewoo Kang, Tengfei Ma
Comments: Extended version of KDD '25 paper. 22 pages including appendix. Conference version: KDD '25 (Toronto, Aug 3-7, 2025), pp. 1253-1261. Code: this https URL
Journal-ref: Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2025), Toronto, Canada, Aug 3-7, 2025, pp. 1253-1261
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[496] arXiv:2508.16815 [pdf, other]
Title: Uncertainty Propagation Networks for Neural Ordinary Differential Equations
Hadi Jahanshahi, Zheng H. Zhu
Subjects: Machine Learning (cs.LG)
[497] arXiv:2508.16802 [pdf, html, other]
Title: Anchor-MoE: A Mean-Anchored Mixture of Experts For Probabilistic Regression
Baozhuo Su, Zhengxian Qu
Subjects: Machine Learning (cs.LG)
[498] arXiv:2508.16785 [pdf, html, other]
Title: Interpreting the Effects of Quantization on LLMs
Manpreet Singh, Hassan Sajjad
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[499] arXiv:2508.16776 [pdf, html, other]
Title: Latent Graph Learning in Generative Models of Neural Signals
Nathan X. Kodama, Kenneth A. Loparo
Subjects: Machine Learning (cs.LG)
[500] arXiv:2508.16769 [pdf, html, other]
Title: DR-CircuitGNN: Training Acceleration of Heterogeneous Circuit Graph Neural Network on GPUs
Yuebo Luo, Shiyang Li, Junran Tao, Kiran Thorat, Xi Xie, Hongwu Peng, Nuo Xu, Caiwen Ding, Shaoyi Huang
Journal-ref: In Proceedings of the 39th ACM International Conference on Supercomputing, 2025, 221-235
Subjects: Machine Learning (cs.LG)
[501] arXiv:2508.16748 [pdf, html, other]
Title: FAIRWELL: Fair Multimodal Self-Supervised Learning for Wellbeing Prediction
Jiaee Cheong, Abtin Mogharabin, Paul Liang, Hatice Gunes, Sinan Kalkan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[502] arXiv:2508.16745 [pdf, html, other]
Title: Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling
Ivan Rodkin, Daniil Orel, Konstantin Smirnov, Arman Bolatov, Bilal Elbouardi, Besher Hassan, Yuri Kuratov, Aydar Bulatov, Preslav Nakov, Timothy Baldwin, Artem Shelmanov, Mikhail Burtsev
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[503] arXiv:2508.16744 [pdf, html, other]
Title: Hyperbolic Multimodal Representation Learning for Biological Taxonomies
ZeMing Gong, Chuanqi Tang, Xiaoliang Huo, Nicholas Pellegrino, Austin T. Wang, Graham W. Taylor, Angel X. Chang, Scott C. Lowe, Joakim Bruslund Haurum
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[504] arXiv:2508.16741 [pdf, html, other]
Title: WST: Weak-to-Strong Knowledge Transfer via Reinforcement Learning
Haosen Ge, Shuo Li, Lianghuan Huang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[505] arXiv:2508.16737 [pdf, html, other]
Title: Deep Learning for Markov Chains: Lyapunov Functions, Poisson's Equation, and Stationary Distributions
Yanlin Qu, Jose Blanchet, Peter Glynn
Subjects: Machine Learning (cs.LG); Probability (math.PR)
[506] arXiv:2508.16734 [pdf, other]
Title: Aligning Distributionally Robust Optimization with Practical Deep Learning Needs
Dmitrii Feoktistov, Igor Ignashin, Andrey Veprikov, Nikita Borovko, Alexander Bogdanov, Savelii Chezhegov, Aleksandr Beznosikov
Comments: 13 pages, 1 table, 4 figures
Subjects: Machine Learning (cs.LG)
[507] arXiv:2508.16702 [pdf, html, other]
Title: A novel auxiliary equation neural networks method for exactly explicit solutions of nonlinear partial differential equations
Shanhao Yuan, Yanqin Liu, Runfa Zhang, Limei Yan, Shunjun Wu, Libo Feng
Subjects: Machine Learning (cs.LG)
[508] arXiv:2508.16687 [pdf, html, other]
Title: Native Logical and Hierarchical Representations with Subspace Embeddings
Gabriel Moreira, Zita Marinho, Manuel Marques, João Paulo Costeira, Chenyan Xiong
Subjects: Machine Learning (cs.LG)
[509] arXiv:2508.16686 [pdf, html, other]
Title: Multidimensional Distributional Neural Network Output Demonstrated in Super-Resolution of Surface Wind Speed
Harrison J. Goldwyn, Mitchell Krock, Johann Rudi, Daniel Getter, Julie Bessac
Subjects: Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[510] arXiv:2508.16685 [pdf, html, other]
Title: STGAtt: A Spatial-Temporal Unified Graph Attention Network for Traffic Flow Forecasting
Zhuding Liang, Jianxun Cui, Qingshuang Zeng, Feng Liu, Nenad Filipovic, Tijana Geroski
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[511] arXiv:2508.16680 [pdf, html, other]
Title: CALR: Corrective Adaptive Low-Rank Decomposition for Efficient Large Language Model Layer Compression
Muchammad Daniyal Kautsar, Afra Majida Hariono, Widyawan, Syukron Abu Ishaq Alfarozi, Kuntpong Woraratpanya
Comments: Submitted to IEEE Transactions on Artificial Intelligence. This is the preprint version, not peer-reviewed. The final version may differ after peer review. (11 pages, 3 figures)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[512] arXiv:2508.16677 [pdf, html, other]
Title: Recall-Extend Dynamics: Enhancing Small Language Models through Controlled Exploration and Refined Offline Integration
Zhong Guan, Likang Wu, Hongke Zhao, Jiahui Wang, Le Wu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[513] arXiv:2508.16676 [pdf, html, other]
Title: WISCA: A Lightweight Model Transition Method to Improve LLM Training via Weight Scaling
Jiacheng Li, Jianchao Tan, Zhidong Yang, Pingwei Sun, Feiye Huo, Jiayu Qin, Yerui Sun, Yuchen Xie, Xunliang Cai, Xiangyu Zhang, Maoxin He, Guangming Tan, Weile Jia, Tong Zhao
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[514] arXiv:2508.16656 [pdf, html, other]
Title: OASIS: Open-world Adaptive Self-supervised and Imbalanced-aware System
Miru Kim, Mugon Joe, Minhae Kwon
Comments: Accepted at the 34th ACM International Conference on Information and Knowledge Management (CIKM 2025)
Subjects: Machine Learning (cs.LG)
[515] arXiv:2508.16655 [pdf, other]
Title: A Laplace diffusion-based transformer model for heart rate forecasting within daily activity context
Andrei Mateescu, Ioana Hadarau, Ionut Anghel, Tudor Cioara, Ovidiu Anchidin, Ancuta Nemes
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[516] arXiv:2508.16651 [pdf, html, other]
Title: HiCL: Hippocampal-Inspired Continual Learning
Kushal Kapoor, Wyatt Mackey, Yiannis Aloimonos, Xiaomin Lin
Comments: Submitted to AAAI
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[517] arXiv:2508.16648 [pdf, html, other]
Title: LatentFlow: Cross-Frequency Experimental Flow Reconstruction from Sparse Pressure via Latent Mapping
Junle Liu, Chang Liu, Yanyu Ke, Qiuxiang Huang, Jiachen Zhao, Wenliang Chen, K.T. Tse, Gang Hu
Comments: The paper is submitted to IAAI26. Total 9 pages with 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Fluid Dynamics (physics.flu-dyn)
[518] arXiv:2508.16647 [pdf, html, other]
Title: AdapSNE: Adaptive Fireworks-Optimized and Entropy-Guided Dataset Sampling for Edge DNN Training
Boran Zhao, Hetian Liu, Zihang Yuan, Li Zhu, Fan Yang, Lina Xie Tian Xia, Wenzhe Zhao, Pengju Ren
Subjects: Machine Learning (cs.LG)
[519] arXiv:2508.16643 [pdf, html, other]
Title: From Classical Probabilistic Latent Variable Models to Modern Generative AI: A Unified Perspective
Tianhua Chen
Comments: This is a substantially improved and expanded version of an earlier manuscript hosted on SSRN: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[520] arXiv:2508.16641 [pdf, html, other]
Title: Enhancing Transformer-Based Foundation Models for Time Series Forecasting via Bagging, Boosting and Statistical Ensembles
Dhruv D. Modi, Rong Pan
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[521] arXiv:2508.16634 [pdf, html, other]
Title: Few-shot Class-incremental Fault Diagnosis by Preserving Class-Agnostic Knowledge with Dual-Granularity Representations
Zhendong Yang, Jie Wang, Liansong Zong, Xiaorong Liu, Quan Qian, Shiqian Chen
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[522] arXiv:2508.16633 [pdf, html, other]
Title: A Novel Unified Extended Matrix for Graph Signal Processing: Theory and Application
Yunyan Zheng, Zhichao Zhang, Wei Yao
Subjects: Machine Learning (cs.LG)
[523] arXiv:2508.16632 [pdf, html, other]
Title: Adaptive Variance-Penalized Continual Learning with Fisher Regularization
Krisanu Sarkar
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[524] arXiv:2508.16631 [pdf, html, other]
Title: Recurrent Transformer U-Net Surrogate for Flow Modeling and Data Assimilation in Subsurface Formations with Faults
Yifu Han, Louis J. Durlofsky
Subjects: Machine Learning (cs.LG)
[525] arXiv:2508.16629 [pdf, html, other]
Title: Learn to Memorize: Optimizing LLM-based Agents with Adaptive Memory Framework
Zeyu Zhang, Quanyu Dai, Rui Li, Xiaohe Bo, Xu Chen, Zhenhua Dong
Comments: 17 pages, 4 figures, 5 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[526] arXiv:2508.16623 [pdf, html, other]
Title: A Retrieval Augmented Spatio-Temporal Framework for Traffic Prediction
Weilin Ruan, Xilin Dang, Ziyu Zhou, Sisuo Lyu, Yuxuan Liang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[527] arXiv:2508.16620 [pdf, html, other]
Title: STRelay: A Universal Spatio-Temporal Relaying Framework for Location Prediction with Future Spatiotemporal Contexts
Bangchao Deng, Lianhua Ji, Chunhua Chen, Xin Jing, Ling Ding, Bingqing QU, Pengyang Wang, Dingqi Yang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[528] arXiv:2508.16617 [pdf, html, other]
Title: Leveraging the Christoffel Function for Outlier Detection in Data Streams
Kévin Ducharlet, Louise Travé-Massuyès, Jean-Bernard Lasserre, Marie-Véronique Le Lann, Youssef Miloudi
Journal-ref: International Journal of Data Science and Analytics, 2024, p. 1-17
Subjects: Machine Learning (cs.LG)
[529] arXiv:2508.16614 [pdf, html, other]
Title: CrystalDiT: A Diffusion Transformer for Crystal Generation
Xiaohan Yi, Guikun Xu, Xi Xiao, Zhong Zhang, Liu Liu, Yatao Bian, Peilin Zhao
Comments: 18 pages, 18 figures. Code available at this https URL. Updated to remove copyright notice
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[530] arXiv:2508.16611 [pdf, html, other]
Title: Quantum-Inspired DRL Approach with LSTM and OU Noise for Cut Order Planning Optimization
Yulison Herry Chrisnanto, Julian Evan Chrisnanto
Comments: 14 pages,3 figures, 4 tables
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[531] arXiv:2508.18224 (cross-list from cs.DC) [pdf, html, other]
Title: Flash Sparse Attention: An Alternative Efficient Implementation of Native Sparse Attention Kernel
Ran Yan, Youhe Jiang, Binhang Yuan
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[532] arXiv:2508.18211 (cross-list from q-bio.BM) [pdf, html, other]
Title: Flexibility-Conditioned Protein Structure Design with Flow Matching
Vsevolod Viliuga, Leif Seute, Nicolas Wolf, Simon Wagner, Arne Elofsson, Jan Stühmer, Frauke Gräter
Comments: ICML 2025
Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[533] arXiv:2508.18207 (cross-list from stat.ML) [pdf, other]
Title: Clinical characteristics, complications and outcomes of critically ill patients with Dengue in Brazil, 2012-2024: a nationwide, multicentre cohort study
Igor Tona Peres, Otavio T. Ranzani, Leonardo S.L. Bastos, Silvio Hamacher, Tom Edinburgh, Esteban Garcia-Gallo, Fernando Augusto Bozza
Journal-ref: Peres et al. "Clinical characteristics, complications and outcomes of critically ill patients with Dengue in Brazil, 2012-2024: a nationwide, multicentre cohort study." International Journal of Infectious Diseases (2025): 108023
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[534] arXiv:2508.18206 (cross-list from cs.DC) [pdf, html, other]
Title: Practical GPU Choices for Earth Observation: ResNet-50 Training Throughput on Integrated, Laptop, and Cloud Accelerators
Ritvik Chaturvedi
Comments: 10 pages, 5 figures
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[535] arXiv:2508.18192 (cross-list from cs.AI) [pdf, html, other]
Title: Unraveling the cognitive patterns of Large Language Models through module communities
Kushal Raj Bhandari, Pin-Yu Chen, Jianxi Gao
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[536] arXiv:2508.18186 (cross-list from cs.CV) [pdf, html, other]
Title: Emerging Semantic Segmentation from Positive and Negative Coarse Label Learning
Le Zhang, Fuping Wu, Arun Thirunavukarasu, Kevin Bronik, Thomas Nichols, Bartlomiej W. Papiez
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[537] arXiv:2508.18178 (cross-list from math.NA) [pdf, other]
Title: Introduction to Regularization and Learning Methods for Inverse Problems
Danielle Bednarski, Tim Roith
Comments: These lecture notes are based on a lecture taught by the authors in the winter semester 2024/2025 at the University of Hamburg
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[538] arXiv:2508.18177 (cross-list from cs.CV) [pdf, html, other]
Title: Scene-Aware Vectorized Memory Multi-Agent Framework with Cross-Modal Differentiated Quantization VLMs for Visually Impaired Assistance
Xiangxiang Wang, Xuanyu Wang, YiJia Luo, Yongbin Yu, Manping Fan, Jingtao Zhang, Liyong Ren
Comments: 28 pages,9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[539] arXiv:2508.18166 (cross-list from cs.IR) [pdf, html, other]
Title: PCR-CA: Parallel Codebook Representations with Contrastive Alignment for Multiple-Category App Recommendation
Bin Tan, Wangyao Ge, Yidi Wang, Xin Liu, Jeff Burtoft, Hao Fan, Hui Wang
Comments: 9 pages, 4 figures, conference
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[540] arXiv:2508.18162 (cross-list from cs.LO) [pdf, html, other]
Title: The Computational Complexity of Satisfiability in State Space Models
Eric Alsmann, Martin Lange
Comments: Accepted at ECAI 25
Subjects: Logic in Computer Science (cs.LO); Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Machine Learning (cs.LG)
[541] arXiv:2508.18161 (cross-list from quant-ph) [pdf, html, other]
Title: Hybrid Quantum-Classical Learning for Multiclass Image Classification
Shuchismita Anwar, Sowmitra Das, Muhammad Iqbal Hossain, Jishnu Mahmud
Comments: 13 pages, 8 figures
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG)
[542] arXiv:2508.18159 (cross-list from cs.CV) [pdf, html, other]
Title: SpotEdit: Evaluating Visually-Guided Image Editing Methods
Sara Ghazanfari, Wei-An Lin, Haitong Tian, Ersin Yumer
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[543] arXiv:2508.18154 (cross-list from cs.CV) [pdf, html, other]
Title: Assessing the Noise Robustness of Class Activation Maps: A Framework for Reliable Model Interpretability
Syamantak Sarkar, Revoti P. Bora, Bhupender Kaushal, Sudhish N George, Kiran Raja
Comments: Image and Vision Computing (2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[544] arXiv:2508.18136 (cross-list from cs.CV) [pdf, html, other]
Title: BirdRecorder's AI on Sky: Safeguarding birds of prey by detection and classification of tiny objects around wind turbines
Nico Klar, Nizam Gifary, Felix P. G. Ziegler, Frank Sehnke, Anton Kaifel, Eric Price, Aamir Ahmad
Comments: 18 pages, 1 figures, to appear in Proceedings of the 19th International Conference on Intelligent Autonomous Systems (IAS-19), Genoa, Italy, 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO); Systems and Control (eess.SY)
[545] arXiv:2508.18132 (cross-list from cs.IR) [pdf, html, other]
Title: Test-Time Scaling Strategies for Generative Retrieval in Multimodal Conversational Recommendations
Hung-Chun Hsu, Yuan-Ching Kuo, Chao-Han Huck Yang, Szu-Wei Fu, Hanrong Ye, Hongxu Yin, Yu-Chiang Frank Wang, Ming-Feng Tsai, Chuan-Ju Wang
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[546] arXiv:2508.18113 (cross-list from cs.AI) [pdf, html, other]
Title: The AI Data Scientist
Farkhad Akimov, Munachiso Samuel Nwadike, Zangir Iklassov, Martin Takáč
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[547] arXiv:2508.18098 (cross-list from cs.CL) [pdf, html, other]
Title: Detecting and Characterizing Planning in Language Models
Jatin Nainani, Sankaran Vaidyanathan, Connor Watts, Andre N. Assis, Alice Rigg
Comments: 9 pages, 4 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[548] arXiv:2508.18095 (cross-list from cs.CV) [pdf, html, other]
Title: Incorporating Pre-trained Diffusion Models in Solving the Schrödinger Bridge Problem
Zhicong Tang, Tiankai Hang, Shuyang Gu, Dong Chen, Baining Guo
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[549] arXiv:2508.18088 (cross-list from cs.CL) [pdf, other]
Title: How Quantization Shapes Bias in Large Language Models
Federico Marcuzzi, Xuefei Ning, Roy Schwartz, Iryna Gurevych
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[550] arXiv:2508.18066 (cross-list from cs.RO) [pdf, html, other]
Title: Arnold: a generalist muscle transformer policy
Alberto Silvio Chiappa, Boshi An, Merkourios Simos, Chengkun Li, Alexander Mathis
Comments: A.S.C. and B.A. contributed equally. Code is available at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[551] arXiv:2508.18012 (cross-list from cs.CV) [pdf, other]
Title: Development of a Neural Network Model for Currency Detection to aid visually impaired people in Nigeria
Sochukwuma Nwokoye, Desmond Moru
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[552] arXiv:2508.18006 (cross-list from eess.AS) [pdf, html, other]
Title: Unseen Speaker and Language Adaptation for Lightweight Text-To-Speech with Adapters
Alessio Falai, Ziyao Zhang, Akos Gangoly
Comments: Accepted at IEEE MLSP 2025
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[553] arXiv:2508.17988 (cross-list from cs.SE) [pdf, html, other]
Title: DesCartes Builder: A Tool to Develop Machine-Learning Based Digital Twins
Eduardo de Conto, Blaise Genest, Arvind Easwaran, Nicholas Ng, Shweta Menon
Comments: 5 pages, 4 figures. Accepted at EDTconf 2025
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG)
[554] arXiv:2508.17953 (cross-list from cs.CL) [pdf, html, other]
Title: Understanding Subword Compositionality of Large Language Models
Qiwei Peng, Yekun Chai, Anders Søgaard
Comments: EMNLP 2025 Main
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[555] arXiv:2508.17948 (cross-list from cs.CL) [pdf, html, other]
Title: Debiasing Multilingual LLMs in Cross-lingual Latent Space
Qiwei Peng, Guimin Hu, Yekun Chai, Anders Søgaard
Comments: EMNLP 2025 Main
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[556] arXiv:2508.17909 (cross-list from quant-ph) [pdf, html, other]
Title: Entanglement Detection with Quantum-inspired Kernels and SVMs
Ana Martínez-Sabiote, Michalis Skotiniotis, Jara J. Bermejo-Vega, Daniel Manzano, Carlos Cano
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG)
[557] arXiv:2508.17907 (cross-list from cs.GT) [pdf, html, other]
Title: WOMAC: A Mechanism For Prediction Competitions
Siddarth Srinivasan, Tao Lin, Connacher Murphy, Anish Thilagar, Yiling Chen, Ezra Karger
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
[558] arXiv:2508.17892 (cross-list from cs.CL) [pdf, html, other]
Title: ILRe: Intermediate Layer Retrieval for Context Compression in Causal Language Models
Manlai Liang, Mandi Liu, Jiangzhou Ji, Huaijun Li, Haobo Yang, Yaohan He, Jinlong Li
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[559] arXiv:2508.17874 (cross-list from cs.SD) [pdf, html, other]
Title: Vocoder-Projected Feature Discriminator
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Yuto Kondo
Comments: Accepted to Interspeech 2025. Project page: this https URL
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[560] arXiv:2508.17868 (cross-list from cs.SD) [pdf, html, other]
Title: FasterVoiceGrad: Faster One-step Diffusion-Based Voice Conversion with Adversarial Diffusion Conversion Distillation
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Yuto Kondo
Comments: Accepted to Interspeech 2025. Project page: this https URL
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[561] arXiv:2508.17846 (cross-list from cs.CV) [pdf, html, other]
Title: Alternating Training-based Label Smoothing Enhances Prompt Generalization
Yang Chen, Yanbin Wei, Ke Jin, Yi Kong, James Kwok, Yu Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[562] arXiv:2508.17844 (cross-list from cs.CV) [pdf, html, other]
Title: Diffusion-Based Data Augmentation for Medical Image Segmentation
Maham Nazir, Muhammad Aqeel, Francesco Setti
Comments: Accepted to CVAMD Workshop at ICCV 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[563] arXiv:2508.17827 (cross-list from cs.CV) [pdf, html, other]
Title: A Contrastive Learning-Guided Confident Meta-learning for Zero Shot Anomaly Detection
Muhammad Aqeel, Danijel Skocaj, Marco Cristani, Francesco Setti
Comments: Accepted to VISION Workshop at ICCV 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[564] arXiv:2508.17811 (cross-list from cs.GR) [pdf, html, other]
Title: MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting
Hanzhi Chang, Ruijie Zhu, Wenjie Chang, Mulin Yu, Yanzhe Liang, Jiahao Lu, Zhuoyuan Li, Tianzhu Zhang
Comments: 17 pages, 15 figures, 5 tables
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[565] arXiv:2508.17789 (cross-list from cs.CV) [pdf, html, other]
Title: Robust Anomaly Detection in Industrial Environments via Meta-Learning
Muhammad Aqeel, Shakiba Sharifi, Marco Cristani, Francesco Setti
Comments: Accepted to VISION Workshop at ICCV 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[566] arXiv:2508.17786 (cross-list from cs.AI) [pdf, other]
Title: Interpretable Early Failure Detection via Machine Learning and Trace Checking-based Monitoring
Andrea Brunello, Luca Geatti, Angelo Montanari, Nicola Saccomanno
Comments: Full version of the paper accepted for publication at the 28th European Conference on Artificial Intelligence (ECAI 2025)
Subjects: Artificial Intelligence (cs.AI); Formal Languages and Automata Theory (cs.FL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[567] arXiv:2508.17783 (cross-list from stat.ML) [pdf, html, other]
Title: Algebraic Approach to Ridge-Regularized Mean Squared Error Minimization in Minimal ReLU Neural Network
Ryoya Fukasaku, Yutaro Kabata, Akifumi Okuno
Comments: 44 pages, 5 figres
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Computation (stat.CO)
[568] arXiv:2508.17767 (cross-list from cs.CL) [pdf, html, other]
Title: ISACL: Internal State Analyzer for Copyrighted Training Data Leakage
Guangwei Zhang, Qisheng Su, Jiateng Liu, Cheng Qian, Yanzhou Pan, Yanjie Fu, Denghui Zhang
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[569] arXiv:2508.17728 (cross-list from cs.CV) [pdf, other]
Title: Segmentation and Classification of Pap Smear Images for Cervical Cancer Detection Using Deep Learning
Nisreen Albzour, Sarah S. Lam
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[570] arXiv:2508.17690 (cross-list from cs.CL) [pdf, html, other]
Title: Text Meets Topology: Rethinking Out-of-distribution Detection in Text-Rich Networks
Danny Wang, Ruihong Qiu, Guangdong Bai, Zi Huang
Comments: EMNLP2025 Main
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[571] arXiv:2508.17674 (cross-list from cs.CR) [pdf, html, other]
Title: Attacking LLMs and AI Agents: Advertisement Embedding Attacks Against Large Language Models
Qiming Guo, Jinwen Tang, Xingran Huang
Comments: 7 pages, 2 figures
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[572] arXiv:2508.17661 (cross-list from cs.AI) [pdf, other]
Title: Spacer: Towards Engineered Scientific Inspiration
Minhyeong Lee, Suyoung Hwang, Seunghyun Moon, Geonho Nah, Donghyun Koh, Youngjun Cho, Johyun Park, Hojin Yoo, Jiho Park, Haneul Choi, Sungbin Moon, Taehoon Hwang, Seungwon Kim, Jaeyeong Kim, Seongjun Kim, Juneau Jung
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[573] arXiv:2508.17648 (cross-list from cs.CY) [pdf, other]
Title: Citizen Centered Climate Intelligence: Operationalizing Open Tree Data for Urban Cooling and Eco-Routing in Indian Cities
Kaushik Ravi, Andreas Brück
Comments: Forthcoming book chapter, currently under review for the "HackYourDistrict" initiative at TU Berlin. 20 pages, 9 figures, 1 table
Subjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[574] arXiv:2508.17622 (cross-list from stat.ML) [pdf, html, other]
Title: The Statistical Fairness-Accuracy Frontier
Alireza Fallah, Michael I. Jordan, Annie Ulichney
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Theoretical Economics (econ.TH); Optimization and Control (math.OC)
[575] arXiv:2508.17600 (cross-list from cs.RO) [pdf, html, other]
Title: GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Guanxing Lu, Baoxiong Jia, Puhao Li, Yixin Chen, Ziwei Wang, Yansong Tang, Siyuan Huang
Comments: Published at ICCV 2025. Project page: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[576] arXiv:2508.17580 (cross-list from cs.CL) [pdf, other]
Title: UQ: Assessing Language Models on Unsolved Questions
Fan Nie, Ken Ziyu Liu, Zihao Wang, Rui Sun, Wei Liu, Weijia Shi, Huaxiu Yao, Linjun Zhang, Andrew Y. Ng, James Zou, Sanmi Koyejo, Yejin Choi, Percy Liang, Niklas Muennighoff
Comments: FN, KZL, and NM are project co-leads and contributed equally. Project website: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[577] arXiv:2508.17576 (cross-list from cs.CL) [pdf, html, other]
Title: CausalSent: Interpretable Sentiment Classification with RieszNet
Daniel Frees, Martin Pollack
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[578] arXiv:2508.17568 (cross-list from cs.CV) [pdf, html, other]
Title: MetaGen: A DSL, Database, and Benchmark for VLM-Assisted Metamaterial Generation
Liane Makatura, Benjamin Jones, Siyuan Bian, Wojciech Matusik
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Programming Languages (cs.PL)
[579] arXiv:2508.17567 (cross-list from cs.CV) [pdf, html, other]
Title: Towards Optimal Convolutional Transfer Learning Architectures for Breast Lesion Classification and ACL Tear Detection
Daniel Frees, Moritz Bolling, Aditri Bhagirath
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[580] arXiv:2508.17561 (cross-list from cs.AI) [pdf, html, other]
Title: Consciousness as a Functor
Sridhar Mahadevan
Comments: 31 pages
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[581] arXiv:2508.17555 (cross-list from q-bio.BM) [pdf, html, other]
Title: Boltzina: Efficient and Accurate Virtual Screening via Docking-Guided Binding Prediction with Boltz-2
Kairi Furui, Masahito Ohue
Subjects: Biomolecules (q-bio.BM); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[582] arXiv:2508.17547 (cross-list from cs.RO) [pdf, html, other]
Title: LodeStar: Long-horizon Dexterity via Synthetic Data Augmentation from Human Demonstrations
Weikang Wan, Jiawei Fu, Xiaodi Yuan, Yifeng Zhu, Hao Su
Comments: CoRL 2025
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[583] arXiv:2508.17545 (cross-list from stat.ML) [pdf, html, other]
Title: High-Order Langevin Monte Carlo Algorithms
Thanh Dang, Mert Gurbuzbalaban, Mohammad Rafiqul Islam, Nian Yao, Lingjiong Zhu
Comments: 73 pages, 3 figures, 1 table
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR)
[584] arXiv:2508.17527 (cross-list from cs.AI) [pdf, html, other]
Title: Evaluating Retrieval-Augmented Generation Strategies for Large Language Models in Travel Mode Choice Prediction
Yiming Xu, Junfeng Jiao
Subjects: Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[585] arXiv:2508.17490 (cross-list from cs.CL) [pdf, html, other]
Title: Efficient Zero-Shot Long Document Classification by Reducing Context Through Sentence Ranking
Prathamesh Kokate, Mitali Sarnaik, Manavi Khopade, Mukta Takalikar, Raviraj Joshi
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[586] arXiv:2508.17468 (cross-list from cs.CV) [pdf, html, other]
Title: A Synthetic Dataset for Manometry Recognition in Robotic Applications
Pedro Antonio Rabelo Saraiva, Enzo Ferreira de Souza, Joao Manoel Herrera Pinheiro, Thiago H. Segreto, Ricardo V. Godoy, Marcelo Becker
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[587] arXiv:2508.17466 (cross-list from cs.RO) [pdf, html, other]
Title: Optimizing Grasping in Legged Robots: A Deep Learning Approach to Loco-Manipulation
Dilermando Almeida, Guilherme Lazzarini, Juliano Negri, Thiago H. Segreto, Ricardo V. Godoy, Marcelo Becker
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[588] arXiv:2508.17444 (cross-list from cs.CL) [pdf, html, other]
Title: MahaParaphrase: A Marathi Paraphrase Detection Corpus and BERT-based Models
Suramya Jadhav, Abhay Shanbhag, Amogh Thakurdesai, Ridhima Sinare, Ananya Joshi, Raviraj Joshi
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[589] arXiv:2508.17440 (cross-list from physics.optics) [pdf, html, other]
Title: Programmable k-local Ising Machines and all-optical Kolmogorov-Arnold Networks on Photonic Platforms
Nikita Stroev, Natalia G. Berloff
Comments: 16 pages, 6 figures
Subjects: Optics (physics.optics); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[590] arXiv:2508.17431 (cross-list from cs.CV) [pdf, html, other]
Title: FedKLPR: Personalized Federated Learning for Person Re-Identification with Adaptive Pruning
Po-Hsien Yu, Yu-Syuan Tseng, Shao-Yi Chien
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[591] arXiv:2508.17353 (cross-list from cs.CY) [pdf, html, other]
Title: Detecting Struggling Student Programmers using Proficiency Taxonomies
Noga Schwartz, Roy Fairstein, Avi Segal, Kobi Gal
Comments: appears at ECAI 2025
Subjects: Computers and Society (cs.CY); Machine Learning (cs.LG)
[592] arXiv:2508.17344 (cross-list from cs.SE) [pdf, html, other]
Title: Who Wins the Race? (R Vs Python) - An Exploratory Study on Energy Consumption of Machine Learning Algorithms
Rajrupa Chattaraj, Sridhar Chimalakonda, Vibhu Saujanya Sharma, Vikrant Kaulgud
Comments: 18 pages including references, 5 figures
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG); Performance (cs.PF); Programming Languages (cs.PL)
[593] arXiv:2508.17337 (cross-list from cs.CL) [pdf, html, other]
Title: DropLoRA: Sparse Low-Rank Adaptation for Parameter-Efficient Fine-Tuning
Haojie Zhang
Comments: 8 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[594] arXiv:2508.17334 (cross-list from cs.CV) [pdf, html, other]
Title: Mind the (Language) Gap: Towards Probing Numerical and Cross-Lingual Limits of LVLMs
Somraj Gautam, Abhirama Subramanyam Penamakuri, Abhishek Bhandari, Gaurav Harit
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[595] arXiv:2508.17290 (cross-list from cs.AI) [pdf, html, other]
Title: MEENA (PersianMMMU): Multimodal-Multilingual Educational Exams for N-level Assessment
Omid Ghahroodi, Arshia Hemmat, Marzia Nouri, Seyed Mohammad Hadi Hosseini, Doratossadat Dastgheib, Mohammad Vali Sanian, Alireza Sahebi, Reihaneh Zohrabi, Mohammad Hossein Rohban, Ehsaneddin Asgari, Mahdieh Soleymani Baghshah
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[596] arXiv:2508.17283 (cross-list from cs.CV) [pdf, html, other]
Title: Quickly Tuning Foundation Models for Image Segmentation
Breenda Das, Lennart Purucker, Timur Carstensen, Frank Hutter
Comments: Accepted as a short paper at the non-archival content track of AutoML 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[597] arXiv:2508.17261 (cross-list from cs.CV) [pdf, html, other]
Title: CLIFF: Continual Learning for Incremental Flake Features in 2D Material Identification
Sankalp Pandey, Xuan Bac Nguyen, Nicholas Borys, Hugh Churchill, Khoa Luu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[598] arXiv:2508.17236 (cross-list from cs.SI) [pdf, html, other]
Title: Learning Short-Term and Long-Term Patterns of High-Order Dynamics in Real-World Networks
Yunyong Ko, Da Eun Lee, Song Kyung Yu, Sang-Wook Kim
Comments: 5 pages, 4 figures, 2 tables, ACM International Conference on Information and Knowledge Management (CIKM) 2025
Subjects: Social and Information Networks (cs.SI); Machine Learning (cs.LG)
[599] arXiv:2508.17229 (cross-list from cs.SD) [pdf, html, other]
Title: Multi-Metric Preference Alignment for Generative Speech Restoration
Junan Zhang, Xueyao Zhang, Jing Yang, Yuancheng Wang, Fan Fan, Zhizheng Wu
Comments: 16 pages, 10 figures. demopage: this https URL
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[600] arXiv:2508.17221 (cross-list from cs.AI) [pdf, html, other]
Title: MC3G: Model Agnostic Causally Constrained Counterfactual Generation
Sopam Dasgupta, Sadaf MD Halim, Joaquín Arias, Elmer Salazar, Gopal Gupta
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[601] arXiv:2508.17219 (cross-list from cs.DC) [pdf, html, other]
Title: TokenLake: A Unified Segment-level Prefix Cache Pool for Fine-grained Elastic Long-Context LLM Serving
Bingyang Wu, Zili Zhang, Yinmin Zhong, Guanzhe Huang, Yibo Zhu, Xuanzhe Liu, Xin Jin
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[602] arXiv:2508.17216 (cross-list from cs.CV) [pdf, html, other]
Title: Deep Learning with Self-Attention and Enhanced Preprocessing for Precise Diagnosis of Acute Lymphoblastic Leukemia from Bone Marrow Smears in Hemato-Oncology
Md. Maruf, Md.Mahbubul Haque, Bishowjit Paul
Comments: 26 pages, 15 figures, 8 tables. VGG19+MHSA with Focal Loss; test accuracy 99.25%
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[603] arXiv:2508.17180 (cross-list from cs.AI) [pdf, html, other]
Title: MaRVL-QA: A Benchmark for Mathematical Reasoning over Visual Landscapes
Nilay Pande, Sahiti Yerramilli, Jayant Sravan Tamarapalli, Rynaa Grover
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[604] arXiv:2508.17172 (cross-list from cs.CV) [pdf, html, other]
Title: VROOM - Visual Reconstruction over Onboard Multiview
Yajat Yadav, Varun Bharadwaj, Jathin Korrapati, Tanish Baranwal
Comments: Project page with videos and interactive 4D visualizations: this https URL, Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[605] arXiv:2508.17152 (cross-list from stat.ML) [pdf, html, other]
Title: On the sample complexity of semi-supervised multi-objective learning
Tobias Wegel, Geelon So, Junhyung Park, Fanny Yang
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[606] arXiv:2508.17151 (cross-list from econ.GN) [pdf, other]
Title: Integrative Experiments Identify How Punishment Impacts Welfare in Public Goods Games
Mohammed Alsobay, David G. Rand, Duncan J. Watts, Abdullah Almaatouq
Subjects: General Economics (econ.GN); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
[607] arXiv:2508.17142 (cross-list from eess.SY) [pdf, html, other]
Title: Frequency Response Identification of Low-Order Systems: Finite-Sample Analysis
Arya Honarpisheh, Mario Sznaier
Comments: 15 pages, Submitted to IEEE Transactions on Automatic Control
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Machine Learning (stat.ML)
[608] arXiv:2508.17136 (cross-list from stat.ML) [pdf, html, other]
Title: Factor Informed Double Deep Learning For Average Treatment Effect Estimation
Jianqing Fan, Soham Jana, Sanjeev Kulkarni, Qishuo Yin
Comments: 41 pages, 3 figures, 4 tables
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
[609] arXiv:2508.17135 (cross-list from stat.ML) [pdf, html, other]
Title: Rao Differential Privacy
Carlos Soto
Comments: 13 pages
Subjects: Machine Learning (stat.ML); Cryptography and Security (cs.CR); Information Theory (cs.IT); Machine Learning (cs.LG)
[610] arXiv:2508.17126 (cross-list from cs.CL) [pdf, other]
Title: Token Homogenization under Positional Bias
Viacheslav Yusupov, Danil Maksimov, Ameliia Alaeva, Tatiana Zaitceva, Antipina Anna, Anna Vasileva, Chenlin Liu, Rayuth Chheng, Danil Sazanakov, Andrey Chetvergov, Alina Ermilova, Egor Shvetsov
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[611] arXiv:2508.17122 (cross-list from math.OC) [pdf, html, other]
Title: HV Metric For Time-Domain Full Waveform Inversion
Matej Neumann, Yunan Yang
Comments: 30 Pages
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[612] arXiv:2508.17117 (cross-list from cs.CV) [pdf, html, other]
Title: PlantVillageVQA: A Visual Question Answering Dataset for Benchmarking Vision-Language Models in Plant Science
Syed Nazmus Sakib, Nafiul Haque, Mohammad Zabed Hossain, Shifat E. Arman
Comments: 17 pages, 15 figures and Submittd to Nature Scientific Data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[613] arXiv:2508.17107 (cross-list from cs.CV) [pdf, html, other]
Title: SugarcaneShuffleNet: A Very Fast, Lightweight Convolutional Neural Network for Diagnosis of 15 Sugarcane Leaf Diseases
Shifat E. Arman, Hasan Muhammad Abdullah, Syed Nazmus Sakib, RM Saiem, Shamima Nasrin Asha, Md Mehedi Hasan, Shahrear Bin Amin, S M Mahin Abrar
Comments: 18 pages, 19 figures, Submitted in Computers and Electronics in Agriculture
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[614] arXiv:2508.17092 (cross-list from cs.CY) [pdf, html, other]
Title: Enhancing Knowledge Tracing through Leakage-Free and Recency-Aware Embeddings
Yahya Badran, Christine Preisach
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[615] arXiv:2508.17090 (cross-list from stat.ML) [pdf, html, other]
Title: Neural Stochastic Differential Equations on Compact State-Spaces
Yue-Jane Liu, Malinda Lu, Matthew K. Nock, Yaniv Yacoby
Comments: Accepted at Methods and Opportunities at Small Scale (MOSS), ICML 2025, Vancouver, Canada
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[616] arXiv:2508.17086 (cross-list from q-fin.CP) [pdf, html, other]
Title: A Decoupled LOB Representation Framework for Multilevel Manipulation Detection with Supervised Contrastive Learning
Yushi Lin, Peng Yang
Subjects: Computational Finance (q-fin.CP); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Trading and Market Microstructure (q-fin.TR)
[617] arXiv:2508.17077 (cross-list from stat.ML) [pdf, html, other]
Title: CP4SBI: Local Conformal Calibration of Credible Sets in Simulation-Based Inference
Luben M. C. Cabezas, Vagner S. Santos, Thiago R. Ramos, Pedro L. C. Rodrigues, Rafael Izbicki
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[618] arXiv:2508.17057 (cross-list from cs.CL) [pdf, html, other]
Title: GRAID: Synthetic Data Generation with Geometric Constraints and Multi-Agentic Reflection for Harmful Content Detection
Melissa Kazemi Rad, Alberto Purpura, Himanshu Kumar, Emily Chen, Mohammad Shahed Sorower
Comments: 19 pages, 12 figures
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[619] arXiv:2508.17018 (cross-list from stat.ML) [pdf, html, other]
Title: Limitations of refinement methods for weak to strong generalization
Seamus Somerstep, Ya'acov Ritov, Mikhail Yurochkin, Subha Maity, Yuekai Sun
Comments: COLM 2025
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[620] arXiv:2508.17008 (cross-list from cs.CL) [pdf, html, other]
Title: EduRABSA: An Education Review Dataset for Aspect-based Sentiment Analysis Tasks
Yan Cathy Hua, Paul Denny, Jörg Wicker, Katerina Taskova
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[621] arXiv:2508.17000 (cross-list from cs.CL) [pdf, html, other]
Title: KL-Regularised Q-Learning: A Token-level Action-Value perspective on Online RLHF
Jason R Brown, Lennie Wells, Edward James Young, Sergio Bacallado
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[622] arXiv:2508.16995 (cross-list from stat.ML) [pdf, html, other]
Title: GraphPPD: Posterior Predictive Modelling for Graph-Level Inference
Soumyasundar Pal, Liheng Ma, Amine Natik, Yingxue Zhang, Mark Coates
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[623] arXiv:2508.16976 (cross-list from cs.CV) [pdf, html, other]
Title: Preserving Domain Generalization in Fine-Tuning via Joint Parameter Selection
Bin Pan, Shiyu Shen, Zongbin Wang, Zhenwei Shi, Xia Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[624] arXiv:2508.16916 (cross-list from physics.flu-dyn) [pdf, html, other]
Title: The compressible Neural Particle Method for Simulating Compressible Viscous Fluid Flows
Masato Shibukawa, Naoya Ozaki, Maximilien Berthet
Comments: 13 pages, 5 figures, submitted to PASJ
Subjects: Fluid Dynamics (physics.flu-dyn); Machine Learning (cs.LG)
[625] arXiv:2508.16860 (cross-list from cs.SE) [pdf, other]
Title: TriagerX: Dual Transformers for Bug Triaging Tasks with Content and Interaction Based Rankings
Md Afif Al Mamun, Gias Uddin, Lan Xia, Longyu Zhang
Comments: This work is currently under review at IEEE Transactions on Software Engineering. The replication package will be made publicly available upon acceptance
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[626] arXiv:2508.16845 (cross-list from cs.CV) [pdf, html, other]
Title: NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows
Denis Tarasov, Alexander Nikulin, Ilya Zisman, Albina Klepach, Nikita Lyubaykin, Andrei Polubarov, Alexander Derevyagin, Vladislav Kurenkov
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[627] arXiv:2508.16821 (cross-list from cs.AI) [pdf, html, other]
Title: PuzzleJAX: A Benchmark for Reasoning and Learning
Sam Earle, Graham Todd, Yuchen Li, Ahmed Khalifa, Muhammad Umair Nasir, Zehua Jiang, Andrzej Banburski-Fahey, Julian Togelius
Comments: 25 pages, 11 figures, 2 tables
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[628] arXiv:2508.16817 (cross-list from math.OC) [pdf, html, other]
Title: Predictability Enables Parallelization of Nonlinear State Space Models
Xavier Gonzalez, Leo Kozachkov, David M. Zoltowski, Kenneth L. Clarkson, Scott W. Linderman
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY); Dynamical Systems (math.DS); Machine Learning (stat.ML)
[629] arXiv:2508.16807 (cross-list from cs.RO) [pdf, html, other]
Title: Autonomous UAV Flight Navigation in Confined Spaces: A Reinforcement Learning Approach
Marco S. Tayar, Lucas K. de Oliveira, Juliano D. Negri, Thiago H. Segreto, Ricardo V. Godoy, Marcelo Becker
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Systems and Control (eess.SY)
[630] arXiv:2508.16793 (cross-list from cs.IR) [pdf, html, other]
Title: Bootstrapping Conditional Retrieval for User-to-Item Recommendations
Hongtao Lin, Haoyu Chen, Jaewon Jang, Jiajing Xu
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[631] arXiv:2508.16790 (cross-list from cs.SD) [pdf, other]
Title: TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling
Yuancheng Wang, Dekun Chen, Xueyao Zhang, Junan Zhang, Jiaqi Li, Zhizheng Wu
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[632] arXiv:2508.16767 (cross-list from math.NA) [pdf, html, other]
Title: Walk-on-Interfaces: A Monte Carlo Estimator for an Elliptic Interface Problem with Nonhomogeneous Flux Jump Conditions and a Neumann Boundary Condition
Xinwen Ding, Adam R Stinchcombe
Comments: 49 pages, 14 figures
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[633] arXiv:2508.16747 (cross-list from cs.AI) [pdf, other]
Title: Explainable AI for Predicting and Understanding Mathematics Achievement: A Cross-National Analysis of PISA 2018
Liu Liu, Rui Dai
Subjects: Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[634] arXiv:2508.16742 (cross-list from cs.CV) [pdf, other]
Title: CellEcoNet: Decoding the Cellular Language of Pathology with Deep Learning for Invasive Lung Adenocarcinoma Recurrence Prediction
Abdul Rehman Akbar, Usama Sajjad, Ziyu Su, Wencheng Li, Fei Xing, Jimmy Ruiz, Wei Chen, Muhammad Khalid Khan Niazi
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[635] arXiv:2508.16730 (cross-list from eess.IV) [pdf, html, other]
Title: Analysis of Transferability Estimation Metrics for Surgical Phase Recognition
Prabhant Singh, Yiping Li, Yasmina Al Khalil
Comments: Accepted at DEMI workshop MICCAI 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[636] arXiv:2508.16712 (cross-list from cs.PF) [pdf, html, other]
Title: Systematic Characterization of LLM Quantization: A Performance, Energy, and Quality Perspective
Tianyao Shi, Yi Ding
Comments: 14 pages, 10 figures, 4 tables
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[637] arXiv:2508.16707 (cross-list from cs.CL) [pdf, html, other]
Title: Sparse and Dense Retrievers Learn Better Together: Joint Sparse-Dense Optimization for Text-Image Retrieval
Jonghyun Song, Youngjune Lee, Gyu-Hwung Cho, Ilhyeon Song, Saehun Kim, Yohan Jo
Comments: accepted to CIKM 2025 short research paper track
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[638] arXiv:2508.16703 (cross-list from cs.PF) [pdf, html, other]
Title: Dynamic Sparse Attention on Mobile SoCs
Wangsong Yin, Daliang Xu, Mengwei Xu, Gang Huang, Xuanzhe Liu
Comments: Technical Report
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[639] arXiv:2508.16697 (cross-list from cs.CL) [pdf, html, other]
Title: QueryBandits for Hallucination Mitigation: Exploiting Semantic Features for No-Regret Rewriting
Nicole Cho, William Watson, Alec Koppel, Sumitra Ganesh, Manuela Veloso
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[640] arXiv:2508.16670 (cross-list from cs.CV) [pdf, other]
Title: COVID19 Prediction Based On CT Scans Of Lungs Using DenseNet Architecture
Deborup Sanyal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[641] arXiv:2508.16663 (cross-list from cs.CV) [pdf, html, other]
Title: The Loupe: A Plug-and-Play Attention Module for Amplifying Discriminative Features in Vision Transformers
Naren Sengodan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[642] arXiv:2508.16640 (cross-list from physics.geo-ph) [pdf, html, other]
Title: Generative Latent Diffusion Model for Inverse Modeling and Uncertainty Analysis in Geological Carbon Sequestration
Zhao Feng, Xin-Yang Liu, Meet Hemant Parikh, Junyi Guo, Pan Du, Bicheng Yan, Jian-Xun Wang
Subjects: Geophysics (physics.geo-ph); Machine Learning (cs.LG)
[643] arXiv:2508.16606 (cross-list from cs.HC) [pdf, html, other]
Title: Multimodal Appearance based Gaze-Controlled Virtual Keyboard with Synchronous Asynchronous Interaction for Low-Resource Settings
Yogesh Kumar Meena, Manish Salvi
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[644] arXiv:2508.16604 (cross-list from cs.HC) [pdf, html, other]
Title: WHAR Datasets: An Open Source Library for Wearable Human Activity Recognition
Maximilian Burzer, Tobias King, Till Riedel, Michael Beigl, Tobias Röddiger
Comments: 6 pages, 7 figures, to appear in Companion of the 2025 ACM International Joint Conference on Pervasive and Ubiquitous Computing (UbiComp), OpenWearables Workshop (accepted paper)
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[645] arXiv:2508.16603 (cross-list from cs.CL) [pdf, html, other]
Title: GreenTEA: Gradient Descent with Topic-modeling and Evolutionary Auto-prompting
Zheng Dong, Luming Shang, Gabriela Olinto
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[646] arXiv:2508.16597 (cross-list from q-bio.NC) [pdf, html, other]
Title: Bridging Foundation Models and Efficient Architectures: A Modular Brain Imaging Framework with Local Masking and Pretrained Representation Learning
Yanwen Wang, Xinglin Zhao, Yijin Song, Xiaobo Liu, Yanrong Hao, Rui Cao, Xin Wen
Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[647] arXiv:2508.16587 (cross-list from q-bio.BM) [pdf, html, other]
Title: HemePLM-Diffuse: A Scalable Generative Framework for Protein-Ligand Dynamics in Large Biomolecular System
Rakesh Thakur, Riya Gupta
Comments: 7 pages, 9 figures and 1 table
Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG)
[648] arXiv:2508.16582 (cross-list from cs.HC) [pdf, html, other]
Title: Predicting User Grasp Intentions in Virtual Reality
Linghao Zeng
Comments: 45 pages, 24 figures. This is a Master's thesis submitted as part of the M2 IASD (Artificial Intelligence, Systems, Data) program at Université PSL
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[649] arXiv:2508.16581 (cross-list from cs.HC) [pdf, html, other]
Title: Increasing Interaction Fidelity: Training Routines for Biomechanical Models in HCI
Michał Patryk Miazga, Patrick Ebel
Journal-ref: The 38th Annual ACM Symposium on User Interface Software and Technology (UIST Adjunct 2025)
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[650] arXiv:2508.15371 (cross-list from cs.CL) [pdf, other]
Title: Confidence-Modulated Speculative Decoding for Large Language Models
Jaydip Sen, Subhasis Dasgupta, Hetvi Waghela
Comments: This is the preprint of the paper, which has been accepted for oral presentation and publication in the proceedings of IEEE INDISCON 2025. The conference will be organized at the National Institute of Technology, Rourkela, India, from August 21 to 23, 2025. The paper is 10 pages long, and it contains 2 figures and 5 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[651] arXiv:2508.16576 [pdf, html, other]
Title: Benchmarking Training Paradigms, Dataset Composition, and Model Scaling for Child ASR in ESPnet
Anyu Ying, Natarajan Balaji Shankar, Chyi-Jiunn Lin, Mohan Shi, Pu Wang, Hye-jin Shim, Siddhant Arora, Hugo Van hamme, Abeer Alwan, Shinji Watanabe
Comments: 5 pages, 3 figures, presented at WOCCI 2025 (Workshop on Child Computer Interaction), satellite workshop of Interspeech 2025
Subjects: Machine Learning (cs.LG)

Mon, 25 Aug 2025 (showing 129 of 130 entries )

[652] arXiv:2508.16568 [pdf, html, other]
Title: Closer to Reality: Practical Semi-Supervised Federated Learning for Foundation Model Adaptation
Guangyu Sun, Jingtao Li, Weiming Zhuang, Chen Chen, Chen Chen, Lingjuan Lyu
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[653] arXiv:2508.16560 [pdf, html, other]
Title: Sparse but Wrong: Incorrect L0 Leads to Incorrect Features in Sparse Autoencoders
David Chanin, Adrià Garriga-Alonso
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[654] arXiv:2508.16553 [pdf, html, other]
Title: TinyML Towards Industry 4.0: Resource-Efficient Process Monitoring of a Milling Machine
Tim Langer, Matthias Widra, Volkhard Beyer
Comments: 10 pages, 5 figures, 1 table
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Signal Processing (eess.SP); Systems and Control (eess.SY)
[655] arXiv:2508.16546 [pdf, html, other]
Title: RL Is Neither a Panacea Nor a Mirage: Understanding Supervised vs. Reinforcement Learning Fine-Tuning for LLMs
Hangzhan Jin, Sicheng Lv, Sifan Wu, Mohammad Hamdaqa
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[656] arXiv:2508.16543 [pdf, html, other]
Title: Explainable AI in Deep Learning-Based Prediction of Solar Storms
Adam O. Rawashdeh, Jason T. L. Wang, Katherine G. Herbert
Comments: 6 pages, 8 figures
Subjects: Machine Learning (cs.LG)
[657] arXiv:2508.16540 [pdf, html, other]
Title: Escaping Saddle Points via Curvature-Calibrated Perturbations: A Complete Analysis with Explicit Constants and Empirical Validation
Faruk Alpay, Hamdi Alakkad
Comments: 16 pages. Perturbed gradient descent with fully explicit constants for escaping saddle points, validated empirically
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[658] arXiv:2508.16521 [pdf, html, other]
Title: Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation
Zhijian Zhou, Junyi An, Zongkai Liu, Yunfei Shi, Xuan Zhang, Fenglei Cao, Chao Qu, Yuan Qi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[659] arXiv:2508.16514 [pdf, html, other]
Title: FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline
Parker Seegmiller, Kartik Mehta, Soumya Saha, Chenyang Tao, Shereen Oraby, Arpit Gupta, Tagyoung Chung, Mohit Bansal, Nanyun Peng
Comments: To appear at EMNLP 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[660] arXiv:2508.16503 [pdf, html, other]
Title: MuST2-Learn: Multi-view Spatial-Temporal-Type Learning for Heterogeneous Municipal Service Time Estimation
Nadia Asif, Zhiqing Hong, Shaogang Ren, Xiaonan Zhang, Xiaojun Shang, Yukun Yuan
Comments: Accepted to SIGSPATIAL 2025
Subjects: Machine Learning (cs.LG)
[661] arXiv:2508.16496 [pdf, html, other]
Title: On Zero-Shot Reinforcement Learning
Scott Jeen
Comments: PhD thesis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[662] arXiv:2508.16495 [pdf, html, other]
Title: Post Hoc Regression Refinement via Pairwise Rankings
Kevin Tirta Wijaya, Michael Sun, Minghao Guo, Hans-Peter Seidel, Wojciech Matusik, Vahid Babaei
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[663] arXiv:2508.16487 [pdf, html, other]
Title: FraPPE: Fast and Efficient Preference-based Pure Exploration
Udvas Das, Apurv Shukla, Debabrota Basu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Statistics Theory (math.ST); Machine Learning (stat.ML)
[664] arXiv:2508.16481 [pdf, html, other]
Title: Benchmarking the Robustness of Agentic Systems to Adversarially-Induced Harms
Jonathan Nöther, Adish Singla, Goran Radanovic
Comments: 52 Pages
Subjects: Machine Learning (cs.LG)
[665] arXiv:2508.16476 [pdf, html, other]
Title: NOSTRA: A noise-resilient and sparse data framework for trust region based multi objective Bayesian optimization
Maryam Ghasemzadeh, Anton van Beek
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[666] arXiv:2508.16447 [pdf, html, other]
Title: Boardwalk: Towards a Framework for Creating Board Games with LLMs
Álvaro Guglielmin Becker, Gabriel Bauer de Oliveira, Lana Bertoldo Rossato, Anderson Rocha Tavares
Comments: Accepted at SBGames 2025
Subjects: Machine Learning (cs.LG)
[667] arXiv:2508.16420 [pdf, html, other]
Title: Double Check My Desired Return: Transformer with Target Alignment for Offline Reinforcement Learning
Yue Pei, Hongming Zhang, Chao Gao, Martin Müller, Mengxiao Zhu, Hao Sheng, Haogang Zhu, Liang Lin
Subjects: Machine Learning (cs.LG)
[668] arXiv:2508.16403 [pdf, html, other]
Title: Fast and Accurate RFIC Performance Prediction via Pin Level Graph Neural Networks and Probabilistic Flow
Anahita Asadi, Leonid Popryho, Inna Partin-Vaisband
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Machine Learning (cs.LG)
[669] arXiv:2508.16386 [pdf, html, other]
Title: Sequential Cohort Selection
Hortence Phalonne Nana, Christos Dimitrakakis
Comments: 9 pages, 7 figures
Subjects: Machine Learning (cs.LG)
[670] arXiv:2508.16377 [pdf, html, other]
Title: Applications and Challenges of Fairness APIs in Machine Learning Software
Ajoy Das, Gias Uddin, Shaiful Chowdhury, Mostafijur Rahman Akhond, Hadi Hemmati
Subjects: Machine Learning (cs.LG); Software Engineering (cs.SE)
[671] arXiv:2508.16359 [pdf, html, other]
Title: RotaTouille: Rotation Equivariant Deep Learning for Contours
Odin Hoff Gardaa, Nello Blaser
Comments: 12 pages, 4 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[672] arXiv:2508.16355 [pdf, html, other]
Title: Probabilistic Pretraining for Neural Regression
Boris N. Oreshkin, Shiv Tavker, Dmitry Efimov
Subjects: Machine Learning (cs.LG)
[673] arXiv:2508.16336 [pdf, html, other]
Title: Unsupervised Online Detection of Pipe Blockages and Leakages in Water Distribution Networks
Jin Li, Kleanthis Malialis, Stelios G. Vrachimis, Marios M. Polycarpou
Comments: This paper is accepted by the 6th International Conference on Control and Fault-Tolerant Systems (SysTol)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[674] arXiv:2508.16315 [pdf, html, other]
Title: OwkinZero: Accelerating Biological Discovery with AI
Nathan Bigaud, Vincent Cabeli, Meltem Gürel, Arthur Pignet, John Klein, Gilles Wainrib, Eric Durand
Comments: Preprint
Subjects: Machine Learning (cs.LG)
[675] arXiv:2508.16314 [pdf, html, other]
Title: Cyber Physical Awareness via Intent-Driven Threat Assessment: Enhanced Space Networks with Intershell Links
Selen Gecgel Cetin, Tolga Ovatman, Gunes Karabulut Kurt
Comments: in IEEE Wireless Communications Letters, 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET)
[676] arXiv:2508.16313 [pdf, html, other]
Title: Retrieval Enhanced Feedback via In-context Neural Error-book
Jongyeop Hyun, Bumsoo Kim
Comments: Accepted at EMNLP 2025 main conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[677] arXiv:2508.16269 [pdf, html, other]
Title: Representation Learning of Auxiliary Concepts for Improved Student Modeling and Exercise Recommendation
Yahya Badran, Christine Preisach
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[678] arXiv:2508.16261 [pdf, html, other]
Title: On the Evolution of Federated Post-Training Large Language Models: A Model Accessibility View
Tao Guo, Junxiao Wang, Fushuo Huo, Laizhong Cui, Song Guo, Jie Gui, Dacheng Tao
Subjects: Machine Learning (cs.LG)
[679] arXiv:2508.16255 [pdf, html, other]
Title: Chunked Data Shapley: A Scalable Dataset Quality Assessment for Machine Learning
Andreas Loizou, Dimitrios Tsoumakos
Journal-ref: Proceedings of the 34th ACM International Conference on Information and Knowledge Management (CIKM '25), November 10--14, 2025, Seoul, Republic of Korea
Subjects: Machine Learning (cs.LG)
[680] arXiv:2508.16254 [pdf, html, other]
Title: FEST: A Unified Framework for Evaluating Synthetic Tabular Data
Weijie Niu, Alberto Huertas Celdran, Karoline Siarsky, Burkhard Stiller
Comments: 11 pages, International Conference on Information Systems Security and Privacy
Subjects: Machine Learning (cs.LG)
[681] arXiv:2508.16244 [pdf, other]
Title: When Simpler Wins: Facebooks Prophet vs LSTM for Air Pollution Forecasting in Data-Constrained Northern Nigeria
Habeeb Balogun, Yahaya Zakari
Subjects: Machine Learning (cs.LG)
[682] arXiv:2508.16237 [pdf, html, other]
Title: A XAI-based Framework for Frequency Subband Characterization of Cough Spectrograms in Chronic Respiratory Disease
Patricia Amado-Caballero, Luis M. San-José-Revuelta, Xinheng Wang, José Ramón Garmendia-Leiza, Carlos Alberola-López, Pablo Casaseca-de-la-Higuera
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[683] arXiv:2508.16235 [pdf, html, other]
Title: PIANO: Physics Informed Autoregressive Network
Mayank Nagda, Jephte Abijuru, Phil Ostheimer, Marius Kloft, Sophie Fellenz
Subjects: Machine Learning (cs.LG)
[684] arXiv:2508.16227 [pdf, html, other]
Title: UMATO: Bridging Local and Global Structures for Reliable Visual Analytics with Dimensionality Reduction
Hyeon Jeon, Kwon Ko, Soohyun Lee, Jake Hyun, Taehyun Yang, Gyehun Go, Jaemin Jo, Jinwook Seo
Comments: IEEE Transactions on Visualization and Computer Graphics
Subjects: Machine Learning (cs.LG)
[685] arXiv:2508.16191 [pdf, html, other]
Title: GEM: A Scale-Aware and Distribution-Sensitive Sparse Fine-Tuning Framework for Effective Downstream Adaptation
Sungmin Kang, Jisoo Kim, Salman Avestimehr, Sunwoo Lee
Subjects: Machine Learning (cs.LG)
[686] arXiv:2508.16179 [pdf, other]
Title: Motor Imagery EEG Signal Classification Using Minimally Random Convolutional Kernel Transform and Hybrid Deep Learning
Jamal Hwaidi, Mohamed Chahine Ghanem
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)
[687] arXiv:2508.16171 [pdf, html, other]
Title: SPL-LNS: Sampling-Enhanced Large Neighborhood Search for Solving Integer Linear Programs
Shengyu Feng, Zhiqing Sun, Yiming Yang
Subjects: Machine Learning (cs.LG)
[688] arXiv:2508.16161 [pdf, html, other]
Title: STA-GANN: A Valid and Generalizable Spatio-Temporal Kriging Approach
Yujie Li, Zezhi Shao, Chengqing Yu, Tangwen Qian, Zhao Zhang, Yifan Du, Shaoming He, Fei Wang, Yongjun Xu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[689] arXiv:2508.16154 [pdf, html, other]
Title: On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models
Yi Zhang, Zhenyu Liao, Jingfeng Wu, Difan Zou
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[690] arXiv:2508.16153 [pdf, html, other]
Title: Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
Huichi Zhou, Yihang Chen, Siyuan Guo, Xue Yan, Kin Hei Lee, Zihan Wang, Ka Yiu Lee, Guchun Zhang, Kun Shao, Linyi Yang, Jun Wang
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[691] arXiv:2508.16135 [pdf, html, other]
Title: Machine Learning in Micromobility: A Systematic Review of Datasets, Techniques, and Applications
Sen Yan, Chinmaya Kaundanya, Noel E. O'Connor, Suzanne Little, Mingming Liu
Comments: 14 pages, 3 tables, and 4 figures, submitted to IEEE Transactions on Intelligent Vehicles
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Image and Video Processing (eess.IV)
[692] arXiv:2508.16134 [pdf, html, other]
Title: CommonKV: Compressing KV Cache with Cross-layer Parameter Sharing
Yixuan Wang, Haoyu Qiao, Lujun Li, Qingfu Zhu, Wanxiang Che
Comments: 10 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[693] arXiv:2508.16097 [pdf, html, other]
Title: Machine Learning for Medicine Must Be Interpretable, Shareable, Reproducible and Accountable by Design
Ayyüce Begüm Bektaş, Mithat Gönen
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[694] arXiv:2508.16090 [pdf, html, other]
Title: GPLight+: A Genetic Programming Method for Learning Symmetric Traffic Signal Control Policy
Xiao-Cheng Liao, Yi Mei, Mengjie Zhang
Journal-ref: IEEE Transactions on Evolutionary Computation, 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[695] arXiv:2508.16082 [pdf, other]
Title: On Task Vectors and Gradients
Luca Zhou, Daniele Solombrino, Donato Crisostomi, Maria Sofia Bucarelli, Giuseppe Alessio D'Inverno, Fabrizio Silvestri, Emanuele Rodolà
Comments: 9 pages of main paper, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[696] arXiv:2508.16073 [pdf, html, other]
Title: A State-Space Approach to Nonstationary Discriminant Analysis
Shuilian Xie, Mahdi Imani, Edward R. Dougherty, Ulisses M. Braga-Neto
Subjects: Machine Learning (cs.LG)
[697] arXiv:2508.16037 [pdf, html, other]
Title: Pareto Actor-Critic for Communication and Computation Co-Optimization in Non-Cooperative Federated Learning Services
Renxuan Tan, Rongpeng Li, Xiaoxue Yu, Xianfu Chen, Xing Xu, Zhifeng Zhao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[698] arXiv:2508.16015 [pdf, html, other]
Title: Tessellation Groups, Harmonic Analysis on Non-compact Symmetric Spaces and the Heat Kernel in view of Cartan Convolutional Neural Networks
Pietro Fré, Federico Milanesio, Marcelo Oyarzo, Matteo Santoro, Mario Trigiante
Comments: 82 pages + appendices
Subjects: Machine Learning (cs.LG); High Energy Physics - Theory (hep-th); Differential Geometry (math.DG)
[699] arXiv:2508.15998 [pdf, html, other]
Title: Quantum Federated Learning: A Comprehensive Survey
Dinh C. Nguyen, Md Raihan Uddin, Shaba Shaon, Ratun Rahman, Octavia Dobre, Dusit Niyato
Comments: 37 pages, under revision at IEEE Communications Surveys & Tutorials
Subjects: Machine Learning (cs.LG)
[700] arXiv:2508.15989 [pdf, html, other]
Title: Scalable Equilibrium Propagation via Intermediate Error Signals for Deep Convolutional CRNNs
Jiaqi Lin, Malyaban Bal, Abhronil Sengupta
Subjects: Machine Learning (cs.LG); Emerging Technologies (cs.ET)
[701] arXiv:2508.15966 [pdf, html, other]
Title: Vector preference-based contextual bandits under distributional shifts
Apurv Shukla, P.R. Kumar
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Probability (math.PR); Machine Learning (stat.ML)
[702] arXiv:2508.15963 [pdf, other]
Title: Advancing rail safety: An onboard measurement system of rolling stock wheel flange wear based on dynamic machine learning algorithms
Celestin Nkundineza, James Ndodana Njaji, Samrawit Abubeker, Omar Gatera, Damien Hanyurwimfura
Comments: Journal article published in Transportation Research Record: The Journal of Transportation Research Board
Journal-ref: Transportation Research Record, 2679(7), 791-810 (2025)
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Signal Processing (eess.SP); Systems and Control (eess.SY); Instrumentation and Detectors (physics.ins-det)
[703] arXiv:2508.15949 [pdf, html, other]
Title: An Efficient Hybridization of Graph Representation Learning and Metaheuristics for the Constrained Incremental Graph Drawing Problem
Bruna C. B. Charytitsch, María C. V. Nascimento
Comments: The paper has been accepted for publication in the European Journal of Operational Research. Supplementary material will be available on the journal website or upon request
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[704] arXiv:2508.15929 [pdf, html, other]
Title: Low-dimensional embeddings of high-dimensional data
Cyril de Bodt, Alex Diaz-Papkovich, Michael Bleher, Kerstin Bunte, Corinna Coupette, Sebastian Damrich, Enrique Fita Sanmartin, Fred A. Hamprecht, Emőke-Ágnes Horvát, Dhruv Kohli, Smita Krishnaswamy, John A. Lee, Boudewijn P. F. Lelieveldt, Leland McInnes, Ian T. Nabney, Maximilian Noichl, Pavlin G. Poličar, Bastian Rieck, Guy Wolf, Gal Mishne, Dmitry Kobak
Comments: This work was the result of Dagstuhl Seminar 24122
Subjects: Machine Learning (cs.LG)
[705] arXiv:2508.15928 [pdf, html, other]
Title: Transforming Causality: Transformer-Based Temporal Causal Discovery with Prior Knowledge Integration
Jihua Huang, Yi Yao, Ajay Divakaran
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[706] arXiv:2508.15881 [pdf, html, other]
Title: TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference
Xiaojuan Tang, Fanxu Meng, Pingzhi Tang, Yuxuan Wang, Di Yin, Xing Sun, Muhan Zhang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[707] arXiv:2508.15872 [pdf, other]
Title: Physics-Based Explainable AI for ECG Segmentation: A Lightweight Model
Muhammad Fathur Rohman Sidiq, Abdurrouf, Didik Rahadi Santoso
Comments: 16 pages
Subjects: Machine Learning (cs.LG)
[708] arXiv:2508.15852 [pdf, other]
Title: PGF-Net: A Progressive Gated-Fusion Framework for Efficient Multimodal Sentiment Analysis
Bin Wen, Tien-Ping Tan
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[709] arXiv:2508.15828 [pdf, html, other]
Title: Z-Pruner: Post-Training Pruning of Large Language Models for Efficiency without Retraining
Samiul Basir Bhuiyan, Md. Sazzad Hossain Adib, Mohammed Aman Bhuiyan, Muhammad Rafsan Kabir, Moshiur Farazi, Shafin Rahman, Nabeel Mohammed
Comments: Accepted at AICCSA 2025
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[710] arXiv:2508.16555 (cross-list from cs.CL) [pdf, html, other]
Title: Transfer Learning via Lexical Relatedness: A Sarcasm and Hate Speech Case Study
Angelly Cabrera, Linus Lei, Antonio Ortega
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[711] arXiv:2508.16554 (cross-list from cond-mat.mtrl-sci) [pdf, html, other]
Title: Machine Learning Time Propagators for Time-Dependent Density Functional Theory Simulations
Karan Shah, Attila Cangi
Comments: 20 pages, 5 figures
Subjects: Materials Science (cond-mat.mtrl-sci); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[712] arXiv:2508.16544 (cross-list from eess.SP) [pdf, html, other]
Title: Parameter-Free Logit Distillation via Sorting Mechanism
Stephen Ekaputra Limantoro
Comments: Accepted in IEEE Signal Processing Letters 2025
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[713] arXiv:2508.16531 (cross-list from cs.DS) [pdf, html, other]
Title: Quality control in sublinear time: a case study via random graphs
Cassandra Marcussen, Ronitt Rubinfeld, Madhu Sudan
Comments: 70 pages
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Combinatorics (math.CO); Probability (math.PR)
[714] arXiv:2508.16509 (cross-list from physics.bio-ph) [pdf, html, other]
Title: ML-PWS: Estimating the Mutual Information Between Experimental Time Series Using Neural Networks
Manuel Reinhardt, Gašper Tkačik, Pieter Rein ten Wolde
Comments: 9 pages, 2 figures
Subjects: Biological Physics (physics.bio-ph); Statistical Mechanics (cond-mat.stat-mech); Information Theory (cs.IT); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[715] arXiv:2508.16489 (cross-list from physics.ao-ph) [pdf, html, other]
Title: Ensembles of Neural Surrogates for Parametric Sensitivity in Ocean Modeling
Yixuan Sun, Romain Egele, Sri Hari Krishna Narayanan, Luke Van Roekel, Carmelo Gonzales, Steven Brus, Balu Nadiga, Sandeep Madireddy, Prasanna Balaprakash
Comments: 12 pages, 7 figures
Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (cs.LG)
[716] arXiv:2508.16485 (cross-list from stat.ML) [pdf, html, other]
Title: Underdamped Langevin MCMC with third order convergence
Maximilian Scott, Dáire O'Kane, Andraž Jelinčič, James Foster
Comments: 62 pages, 7 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA); Probability (math.PR); Statistics Theory (math.ST)
[717] arXiv:2508.16474 (cross-list from eess.SY) [pdf, html, other]
Title: Reinforcement Learning-based Control via Y-wise Affine Neural Networks (YANNs)
Austin Braniff, Yuhe Tian
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Optimization and Control (math.OC)
[718] arXiv:2508.16465 (cross-list from cs.CV) [pdf, html, other]
Title: HOSt3R: Keypoint-free Hand-Object 3D Reconstruction from RGB images
Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel, Jean-Sébastien Franco, Grégory Rogez
Comments: 12 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Robotics (cs.RO)
[719] arXiv:2508.16453 (cross-list from cs.SI) [pdf, html, other]
Title: Anti-establishment sentiment on TikTok: Implications for understanding influence(rs) and expertise on social media
Tianliang Xu, Ariel Hasell, Sabina Tomkins
Comments: 10 pages excluding references; 14 pages in total; 4 figures; Accepted by the AAAI Conference on Web and Social Media (ICWSM-2026)
Subjects: Social and Information Networks (cs.SI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[720] arXiv:2508.16448 (cross-list from cs.MM) [pdf, html, other]
Title: Beyond Interpretability: Exploring the Comprehensibility of Adaptive Video Streaming through Large Language Models
Lianchen Jia, Chaoyang Li, Ziqi Yuan, Jiahui Chen, Tianchi Huang, Jiangchuan Liu, Lifeng Sun
Comments: ACM Multimedia2025
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[721] arXiv:2508.16440 (cross-list from cs.MA) [pdf, other]
Title: Integrated Noise and Safety Management in UAM via A Unified Reinforcement Learning Framework
Surya Murthy, Zhenyu Gao, John-Paul Clarke, Ufuk Topcu
Subjects: Multiagent Systems (cs.MA); Machine Learning (cs.LG)
[722] arXiv:2508.16434 (cross-list from stat.ML) [pdf, html, other]
Title: Deep Intrinsic Coregionalization Multi-Output Gaussian Process Surrogate with Active Learning
Chun-Yi Chang, Chih-Li Sung
Comments: 41 pages, 12 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[723] arXiv:2508.16419 (cross-list from cs.SE) [pdf, html, other]
Title: LLM-GUARD: Large Language Model-Based Detection and Repair of Bugs and Security Vulnerabilities in C++ and Python
Akshay Mhatre, Noujoud Nader, Patrick Diehl, Deepti Gupta
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG)
[724] arXiv:2508.16401 (cross-list from cs.GR) [pdf, html, other]
Title: Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars
NVIDIA: Chaeyeon Chung, Ilya Fedorov, Michael Huang, Aleksey Karmanov, Dmitry Korobchenko, Roger Ribera, Yeongho Seol
Subjects: Graphics (cs.GR); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[725] arXiv:2508.16390 (cross-list from cs.CL) [pdf, html, other]
Title: MedQARo: A Large-Scale Benchmark for Medical Question Answering in Romanian
Ana-Cristina Rogoz, Radu Tudor Ionescu, Alexandra-Valentina Anghel, Ionut-Lucian Antone-Iordache, Simona Coniac, Andreea Iuliana Ionescu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[726] arXiv:2508.16345 (cross-list from cs.LO) [pdf, other]
Title: Uppaal Coshy: Automatic Synthesis of Compact Shields for Hybrid Systems
Asger Horn Brorholt, Andreas Holck Høeg-Petersen, Peter Gjøl Jensen, Kim Guldstrand Larsen, Marius Mikučionis, Christian Schilling, Andrzej Wąsowski
Comments: 12 pages and 6 figures. Additional abstract of 4 pages and 4 figures. Extended version with supplementary material for an article to appear in the 2025 International Conference on Reachability Problems (RP)
Subjects: Logic in Computer Science (cs.LO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[727] arXiv:2508.16306 (cross-list from stat.ML) [pdf, html, other]
Title: A Sharp KL-Convergence Analysis for Diffusion Models under Minimal Assumptions
Nishant Jain, Tong Zhang
Comments: 30 pages, 1 figure
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Analysis of PDEs (math.AP); Statistics Theory (math.ST)
[728] arXiv:2508.16271 (cross-list from cs.CV) [pdf, html, other]
Title: Structuring GUI Elements through Vision Language Models: Towards Action Space Generation
Yi Xu, Yesheng Zhang, jiajia Liu, Jingdong Chen
Comments: 10pageV0
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[729] arXiv:2508.16245 (cross-list from cs.GT) [pdf, html, other]
Title: Limit-Computable Grains of Truth for Arbitrary Computable Extensive-Form (Un)Known Games
Cole Wyeth, Marcus Hutter, Jan Leike, Jessica Taylor
Comments: 42 pages; 2 figures; 7 algorithms
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Theoretical Economics (econ.TH)
[730] arXiv:2508.16225 (cross-list from cs.CV) [pdf, html, other]
Title: An Investigation of Visual Foundation Models Robustness
Sandeep Gupta, Roberto Passerone
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[731] arXiv:2508.16223 (cross-list from cs.SI) [pdf, other]
Title: Dac-Fake: A Divide and Conquer Framework for Detecting Fake News on Social Media
Mayank Kumar Jain, Dinesh Gopalani, Yogesh Kumar Meena, Nishant Jain
Subjects: Social and Information Networks (cs.SI); Machine Learning (cs.LG)
[732] arXiv:2508.16216 (cross-list from cs.NE) [pdf, html, other]
Title: Spike Agreement Dependent Plasticity: A scalable Bio-Inspired learning paradigm for Spiking Neural Networks
Saptarshi Bej, Muhammed Sahad E, Gouri Lakshmi, Harshit Kumar, Pritam Kar, Bikas C Das
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
[733] arXiv:2508.16212 (cross-list from cs.CV) [pdf, html, other]
Title: OmniCache: A Trajectory-Oriented Global Perspective on Training-Free Cache Reuse for Diffusion Transformer Models
Huanpeng Chu, Wei Wu, Guanyu Fen, Yutao Zhang
Comments: Accepted by ICCV 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[734] arXiv:2508.16210 (cross-list from cs.IR) [pdf, html, other]
Title: Modeling User Preferences as Distributions for Optimal Transport-based Cross-domain Recommendation under Non-overlapping Settings
Ziyin Xiao, Toyotaro Suzumura
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[735] arXiv:2508.16209 (cross-list from physics.med-ph) [pdf, other]
Title: Deep learning-enabled virtual multiplexed immunostaining of label-free tissue for vascular invasion assessment
Yijie Zhang, Cagatay Isil, Xilin Yang, Yuzhu Li, Anna Elia, Karin Atlan, William Dean Wallace, Nir Pillar, Aydogan Ozcan
Comments: 29 Pages, 7 Figures
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[736] arXiv:2508.16200 (cross-list from cs.ET) [pdf, html, other]
Title: Set Transformer Architectures and Synthetic Data Generation for Flow-Guided Nanoscale Localization
Mika Leo Hube, Filip Lemic, Ethungshan Shitiri, Gerard Calvo Bartra, Sergi Abadal, Xavier Costa Pérez
Comments: 6 pages, 4 figures, 4 tables, 26 references, accepted at ACM NanoCom'25
Subjects: Emerging Technologies (cs.ET); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[737] arXiv:2508.16124 (cross-list from cs.CV) [pdf, html, other]
Title: Domain Adaptation via Feature Refinement
Savvas Karatsiolis, Andreas Kamilaris
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[738] arXiv:2508.16114 (cross-list from astro-ph.GA) [pdf, html, other]
Title: Neural-Network Chemical Emulator for First-Star Formation: Robust Iterative Predictions over a Wide Density Range
Sojun Ono, Kazuyuki Sugimura
Comments: 18 pages, 7 figures, Submitted to ApJ
Subjects: Astrophysics of Galaxies (astro-ph.GA); Instrumentation and Methods for Astrophysics (astro-ph.IM); Solar and Stellar Astrophysics (astro-ph.SR); Machine Learning (cs.LG)
[739] arXiv:2508.16109 (cross-list from cs.CL) [pdf, html, other]
Title: From Indirect Object Identification to Syllogisms: Exploring Binary Mechanisms in Transformer Circuits
Karim Saraipour, Shichang Zhang
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[740] arXiv:2508.16100 (cross-list from cs.CL) [pdf, html, other]
Title: CYCLE-INSTRUCT: Fully Seed-Free Instruction Tuning via Dual Self-Training and Cycle Consistency
Zhanming Shen, Hao Chen, Yulei Tang, Shaolin Zhu, Wentao Ye, Xiaomeng Hu, Haobo Wang, Gang Chen, Junbo Zhao
Comments: EMNLP 2025 Main
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[741] arXiv:2508.16081 (cross-list from cs.CL) [pdf, html, other]
Title: CEQuest: Benchmarking Large Language Models for Construction Estimation
Yanzhao Wu, Lufan Wang, Rui Liu
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[742] arXiv:2508.16077 (cross-list from cs.HC) [pdf, html, other]
Title: Cooperative Design Optimization through Natural Language Interaction
Ryogo Niwa, Shigeo Yoshida, Yuki Koyama, Yoshitaka Ushiku
Comments: 25 pages, 20 figures, to appear in Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology (UIST '25), September 28-October 1, 2025, Busan, Republic of Korea
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[743] arXiv:2508.16067 (cross-list from physics.comp-ph) [pdf, html, other]
Title: Training a Foundation Model for Materials on a Budget
Teddy Koker, Tess Smidt
Subjects: Computational Physics (physics.comp-ph); Machine Learning (cs.LG)
[744] arXiv:2508.16030 (cross-list from cs.CV) [pdf, html, other]
Title: CoVeRaP: Cooperative Vehicular Perception through mmWave FMCW Radars
Jinyue Song, Hansol Ku, Jayneel Vora, Nelson Lee, Ahmad Kamari, Prasant Mohapatra, Parth Pathak
Comments: Accepted at ICCCN 2025 (IEEE International Conference on Computer Communications and Networks), Tokyo, Japan, August 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[745] arXiv:2508.16027 (cross-list from stat.ML) [pdf, html, other]
Title: Optimal Dynamic Regret by Transformers for Non-Stationary Reinforcement Learning
Baiyuan Chen, Shinji Ito, Masaaki Imaizumi
Comments: 28 pages
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[746] arXiv:2508.16012 (cross-list from cond-mat.mtrl-sci) [pdf, html, other]
Title: FIRE-GNN: Force-informed, Relaxed Equivariance Graph Neural Network for Rapid and Accurate Prediction of Surface Properties
Circe Hsu, Claire Schlesinger, Karan Mudaliar, Jordan Leung, Robin Walters, Peter Schindler
Subjects: Materials Science (cond-mat.mtrl-sci); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[747] arXiv:2508.16011 (cross-list from cs.ET) [pdf, other]
Title: HePGA: A Heterogeneous Processing-in-Memory based GNN Training Accelerator
Chukwufumnanya Ogbogu, Gaurav Narang, Biresh Kumar Joardar, Janardhan Rao Doppa, Krishnendu Chakrabarty, Partha Pratim Pande
Subjects: Emerging Technologies (cs.ET); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[748] arXiv:2508.16000 (cross-list from eess.IV) [pdf, other]
Title: Cross-Attention Multimodal Fusion for Breast Cancer Diagnosis: Integrating Mammography and Clinical Data with Explainability
Muhaisin Tiyumba Nantogmah, Abdul-Barik Alhassan, Salamudeen Alhassan
Comments: 11 pages, 9 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[749] arXiv:2508.15987 (cross-list from cs.CR) [pdf, html, other]
Title: PickleBall: Secure Deserialization of Pickle-based Machine Learning Models
Andreas D. Kellas, Neophytos Christou, Wenxin Jiang, Penghui Li, Laurent Simon, Yaniv David, Vasileios P. Kemerlis, James C. Davis, Junfeng Yang
Comments: To be published in the proceedings of 2025 ACM CCS
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[750] arXiv:2508.15983 (cross-list from cond-mat.mtrl-sci) [pdf, html, other]
Title: A simulation-based training framework for machine-learning applications in ARPES
MengXing Na, Chris Zhou, Sydney K. Y. Dufresne, Matteo Michiardi, Andrea Damascelli
Comments: 9 pages, 6 figures
Subjects: Materials Science (cond-mat.mtrl-sci); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[751] arXiv:2508.15951 (cross-list from math.OC) [pdf, html, other]
Title: A User Manual for cuHALLaR: A GPU Accelerated Low-Rank Semidefinite Programming Solver
Jacob Aguirre, Diego Cifuentes, Vincent Guigues, Renato D.C. Monteiro, Victor Hugo Nascimento, Arnesh Sujanani
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Mathematical Software (cs.MS); Numerical Analysis (math.NA)
[752] arXiv:2508.15947 (cross-list from eess.SP) [pdf, html, other]
Title: Continuous Determination of Respiratory Rate in Hospitalized Patients using Machine Learning Applied to Electrocardiogram Telemetry
Thomas Kite, Brian Ayers, Nicholas Houstis, Asishana A. Osho, Thoralf M. Sundt, Aaron D Aguirre
Comments: 15 pages, 8 figures, 2 tables
Subjects: Signal Processing (eess.SP); Computers and Society (cs.CY); Machine Learning (cs.LG)
[753] arXiv:2508.15934 (cross-list from cs.CR) [pdf, html, other]
Title: Strategic Sample Selection for Improved Clean-Label Backdoor Attacks in Text Classification
Onur Alp Kirci, M. Emre Gursoy
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[754] arXiv:2508.15932 (cross-list from stat.ML) [pdf, html, other]
Title: Interpretable Kernels
Patrick J.F. Groenen, Michael Greenacre
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[755] arXiv:2508.15922 (cross-list from q-fin.ST) [pdf, html, other]
Title: Probabilistic Forecasting Cryptocurrencies Volatility: From Point to Quantile Forecasts
Grzegorz Dudek, Witold Orzeszko, Piotr Fiszeder
Comments: DSAA'25 conference paper
Subjects: Statistical Finance (q-fin.ST); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[756] arXiv:2508.15899 (cross-list from astro-ph.CO) [pdf, html, other]
Title: CIGaRS I: Combined simulation-based inference from SNae Ia and host photometry
Konstantin Karchev, Roberto Trotta, Raul Jimenez
Comments: submitted to Nature Astronomy; 8 pages, 6 figures + supplementary material
Subjects: Cosmology and Nongalactic Astrophysics (astro-ph.CO); Astrophysics of Galaxies (astro-ph.GA); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (cs.LG)
[757] arXiv:2508.15884 (cross-list from cs.CL) [pdf, html, other]
Title: Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
Yuxian Gu, Qinghao Hu, Shang Yang, Haocheng Xi, Junyu Chen, Song Han, Han Cai
Comments: Tech Report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[758] arXiv:2508.15883 (cross-list from eess.IV) [pdf, html, other]
Title: Beyond Imaging: Vision Transformer Digital Twin Surrogates for 3D+T Biological Tissue Dynamics
Kaan Berke Ugurlar, Joaquín de Navascués, Michael Taynnan Barros
Comments: Submitted for journal publication
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
[759] arXiv:2508.15882 (cross-list from cs.SD) [pdf, html, other]
Title: Beyond Transcription: Mechanistic Interpretability in ASR
Neta Glazer, Yael Segal-Feldman, Hilit Segev, Aviv Shamsian, Asaf Buchnick, Gill Hetz, Ethan Fetaya, Joseph Keshet, Aviv Navon
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[760] arXiv:2508.15878 (cross-list from cs.LO) [pdf, html, other]
Title: Lean Meets Theoretical Computer Science: Scalable Synthesis of Theorem Proving Challenges in Formal-Informal Pairs
Terry Jingchen Zhang, Wenyuan Jiang, Rongchuan Liu, Yisong Wang, Junran Yang, Ning Wang, Nicole Ni, Yinya Huang, Mrinmaya Sachan
Comments: Accepted to AI4MATH@ICML2025
Subjects: Logic in Computer Science (cs.LO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[761] arXiv:2508.15877 (cross-list from cs.CL) [pdf, html, other]
Title: Annif at the GermEval-2025 LLMs4Subjects Task: Traditional XMTC Augmented by Efficient LLMs
Osma Suominen, Juho Inkinen, Mona Lehtinen
Comments: 5 pages, 4 figures, accepted at KONVENS 2025. arXiv admin note: substantial text overlap with arXiv:2504.19675
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[762] arXiv:2508.15866 (cross-list from cs.PL) [pdf, html, other]
Title: Correctness-Guaranteed Code Generation via Constrained Decoding
Lingxiao Li, Salar Rahili, Yiwei Zhao
Comments: Published at COLM 2025
Subjects: Programming Languages (cs.PL); Machine Learning (cs.LG); Software Engineering (cs.SE)
[763] arXiv:2508.15850 (cross-list from cs.CR) [pdf, html, other]
Title: Linkage Attacks Expose Identity Risks in Public ECG Data Sharing
Ziyu Wang, Elahe Khatibi, Farshad Firouzi, Sanaz Rahimi Mousavi, Krishnendu Chakrabarty, Amir M. Rahmani
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[764] arXiv:2508.15847 (cross-list from cs.CL) [pdf, html, other]
Title: Mechanistic Exploration of Backdoored Large Language Model Attention Patterns
Mohammed Abu Baker, Lakshmi Babu-Saheer
Comments: 13 pages. Mechanistic analysis of backdoored LLMs (Qwen2.5-3B). Code: this https URL. Base model: unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit. Finetuned models: this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[765] arXiv:2508.15842 (cross-list from cs.CL) [pdf, html, other]
Title: Lexical Hints of Accuracy in LLM Reasoning Chains
Arne Vanhoyweghen, Brecht Verbeken, Andres Algaba, Vincent Ginis
Comments: 21 pages, 7 figures, 6 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[766] arXiv:2508.15841 (cross-list from cs.CL) [pdf, other]
Title: A Review of Developmental Interpretability in Large Language Models
Ihor Kendiukhov
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[767] arXiv:2508.15837 (cross-list from cs.CL) [pdf, other]
Title: Statistical Comparative Analysis of Semantic Similarities and Model Transferability Across Datasets for Short Answer Grading
Sridevi Bonthu, S.Rama Sree, M.H.M. Krishna Prasad
Journal-ref: Int. J. Intell. Syst. Appl. Eng., 12(15s), 530-538, 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[768] arXiv:2508.15836 (cross-list from cs.CL) [pdf, html, other]
Title: MorphNAS: Differentiable Architecture Search for Morphologically-Aware Multilingual NER
Prathamesh Devadiga, Omkaar Jayadev Shetty, Hiya Nachnani, Prema R
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[769] arXiv:2508.15829 (cross-list from cs.CL) [pdf, html, other]
Title: Mining Mental Health Signals: A Comparative Study of Four Machine Learning Methods for Depression Detection from Social Media Posts in Sorani Kurdish
Idrees Mohammed, Hossein Hassani
Comments: 13 pages, 4 figures, 5 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[770] arXiv:2508.15827 (cross-list from cs.CL) [pdf, html, other]
Title: Mini-Omni-Reasoner: Token-Level Thinking-in-Speaking in Large Speech Models
Zhifei Xie, Ziyang Ma, Zihang Liu, Kaiyu Pang, Hongyu Li, Jialin Zhang, Yue Liao, Deheng Ye, Chunyan Miao, Shuicheng Yan
Comments: Technical report; Work in progress. Project page: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[771] arXiv:2508.15823 (cross-list from cs.CL) [pdf, html, other]
Title: SDEC: Semantic Deep Embedded Clustering
Mohammad Wali Ur Rahman, Ric Nevarez, Lamia Tasnim Mim, Salim Hariri
Comments: Accepted for publication in IEEE Transactions on Big Data
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[772] arXiv:2508.15816 (cross-list from cs.NI) [pdf, html, other]
Title: Better Together: Leveraging Multiple Digital Twins for Deployment Optimization of Airborne Base Stations
Mauro Belgiovine, Chris Dick, Kaushik Chowdhury
Comments: Submitted to IEEE Transactions on Mobile Computing (second round of review)
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
[773] arXiv:2508.15810 (cross-list from cs.CL) [pdf, html, other]
Title: Detecting Hope, Hate, and Emotion in Arabic Textual Speech and Multi-modal Memes Using Large Language Models
Nouar AlDahoul, Yasir Zaki
Comments: 26 pages, 12 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[774] arXiv:2508.15805 (cross-list from cs.CL) [pdf, html, other]
Title: ALAS: Autonomous Learning Agent for Self-Updating Language Models
Dhruv Atreja
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[775] arXiv:2508.15801 (cross-list from cs.CL) [pdf, other]
Title: LingVarBench: Benchmarking LLM for Automated Named Entity Recognition in Structured Synthetic Spoken Transcriptions
Seyedali Mohammadi, Manas Paldhe, Amit Chhabra
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[776] arXiv:2508.15800 (cross-list from cs.CL) [pdf, html, other]
Title: A BERT-based Hierarchical Classification Model with Applications in Chinese Commodity Classification
Kun Liu, Tuozhen Liu, Feifei Wang, Rui Pan
Comments: 29 pages, 3 figures, and 8 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[777] arXiv:2508.15797 (cross-list from cs.CL) [pdf, html, other]
Title: Benchmarking the Medical Understanding and Reasoning of Large Language Models in Arabic Healthcare Tasks
Nouar AlDahoul, Yasir Zaki
Comments: 5 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[778] arXiv:2508.15796 (cross-list from cs.CL) [pdf, html, other]
Title: Benchmarking the Legal Reasoning of LLMs in Arabic Islamic Inheritance Cases
Nouar AlDahoul, Yasir Zaki
Comments: 5 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[779] arXiv:2508.15784 (cross-list from q-bio.NC) [pdf, html, other]
Title: Emergent time-keeping mechanisms in a deep reinforcement learning agent performing an interval timing task
Amrapali Pednekar, Alvaro Garrido, Pieter Simoens, Yara Khaluf
Comments: Accepted at 2025 Artificial Life Conference
Subjects: Neurons and Cognition (q-bio.NC); Machine Learning (cs.LG)
[780] arXiv:2508.11696 (cross-list from cs.CV) [pdf, html, other]
Title: A Deep Learning-Based CCTV System for Automatic Smoking Detection in Fire Exit Zones
Sami Sadat, Mohammad Irtiza Hossain, Junaid Ahmed Sifat, Suhail Haque Rafi, Md. Waseq Alauddin Alvi, Md. Khalilur Rhaman
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Total of 781 entries
Showing up to 2000 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack