close this message
arXiv smileybones

arXiv Is Hiring a DevOps Engineer

Work on one of the world's most important websites and make an impact on open science.

View Jobs
Skip to main content
Cornell University

arXiv Is Hiring a DevOps Engineer

View Jobs
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for recent submissions

  • Fri, 16 May 2025
  • Thu, 15 May 2025
  • Wed, 14 May 2025
  • Tue, 13 May 2025
  • Mon, 12 May 2025

See today's new changes

Total of 799 entries : 1-50 51-100 101-150 151-200 ... 751-799
Showing up to 50 entries per page: fewer | more | all

Fri, 16 May 2025 (showing first 50 of 145 entries )

[1] arXiv:2505.10559 [pdf, html, other]
Title: Neural Thermodynamic Laws for Large Language Model Training
Ziming Liu, Yizhou Liu, Jeff Gore, Max Tegmark
Comments: 18 pages, 10 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
[2] arXiv:2505.10556 [pdf, other]
Title: An AI-driven framework for the prediction of personalised health response to air pollution
Nazanin Zounemat Kermani, Sadjad Naderi, Claire H. Dilliway, Claire E. Heaney, Shrreya Behll, Boyang Chen, Hisham Abubakar-Waziri, Alexandra E. Porter, Marc Chadeau-Hyam, Fangxin Fang, Ian M. Adcock, Kian Fan Chung, Christopher C. Pain
Comments: Kermani and Naderi share first authorship. 20 pages, 6 figures and 1 table
Subjects: Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph)
[3] arXiv:2505.10545 [pdf, html, other]
Title: Pharmacophore-Conditioned Diffusion Model for Ligand-Based De Novo Drug Design
Amira Alakhdar, Barnabas Poczos, Newell Washburn
Subjects: Machine Learning (cs.LG)
[4] arXiv:2505.10526 [pdf, html, other]
Title: MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models
Mugilan Ganesan, Shane Segal, Ankur Aggarwal, Nish Sinnadurai, Sean Lie, Vithursan Thangarasa
Comments: Main paper: 11 pp., 4 figs., 3 tabs.; Supplementary: 2 pp
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[5] arXiv:2505.10515 [pdf, html, other]
Title: PnPXAI: A Universal XAI Framework Providing Automatic Explanations Across Diverse Modalities and Models
Seongun Kim, Sol A Kim, Geonhyeong Kim, Enver Menadjiev, Chanwoo Lee, Seongwook Chung, Nari Kim, Jaesik Choi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[6] arXiv:2505.10495 [pdf, html, other]
Title: RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs
Vibha Belavadi, Tushar Vatsa, Dewang Sultania, Suhas Suresha, Ishita Verma, Cheng Chen, Tracy Holloway King, Michael Friedrich
Comments: Proceedings of the 4th International Workshop on Knowledge-Augmented Methods for Natural Language Processing
Journal-ref: https://aclanthology.org/2025.knowledgenlp-1.10/ KnowledgeNLP 2025
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[7] arXiv:2505.10484 [pdf, html, other]
Title: Fixing Incomplete Value Function Decomposition for Multi-Agent Reinforcement Learning
Andrea Baisero, Rupali Bhati, Shuo Liu, Aathira Pillai, Christopher Amato
Subjects: Machine Learning (cs.LG)
[8] arXiv:2505.10482 [pdf, html, other]
Title: Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Ningyuan Yang, Jiaxuan Gao, Feng Gao, Yi Wu, Chao Yu
Comments: 9 pages for main text, 23 pages in total, submitted to Neurips, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[9] arXiv:2505.10475 [pdf, other]
Title: Parallel Scaling Law for Language Models
Mouxiang Chen, Binyuan Hui, Zeyu Cui, Jiaxi Yang, Dayiheng Liu, Jianling Sun, Junyang Lin, Zhongxin Liu
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[10] arXiv:2505.10472 [pdf, html, other]
Title: Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI
Agnik Saha, Victoria Churchill, Anny D. Rodriguez, Ugur Kursuncu, Muhammed Y. Idris
Subjects: Machine Learning (cs.LG)
[11] arXiv:2505.10465 [pdf, html, other]
Title: Superposition Yields Robust Neural Scaling
Yizhou liu, Ziming Liu, Jeff Gore
Comments: 30 pages, 23 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[12] arXiv:2505.10457 [pdf, html, other]
Title: SEAL: Searching Expandable Architectures for Incremental Learning
Matteo Gambella, Vicente Javier Castro Solar, Manuel Roveri
Comments: 8 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[13] arXiv:2505.10441 [pdf, html, other]
Title: PIF: Anomaly detection via preference embedding
Filippo Leveni, Luca Magri, Giacomo Boracchi, Cesare Alippi
Comments: Accepted at International Conference on Pattern Recognition (ICPR 2020)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[14] arXiv:2505.10438 [pdf, html, other]
Title: Identification and Optimal Nonlinear Control of Turbojet Engine Using Koopman Eigenfunction Model
David Grasev
Comments: 51 pages, 28 figures
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[15] arXiv:2505.10432 [pdf, html, other]
Title: Score-based diffusion nowcasting of GOES imagery
Randy J. Chase, Katherine Haynes, Lander Ver Hoef, Imme Ebert-Uphoff
Subjects: Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph)
[16] arXiv:2505.10425 [pdf, html, other]
Title: Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs
Jingyao Wang, Wenwen Qiang, Zeen Song, Changwen Zheng, Hui Xiong
Subjects: Machine Learning (cs.LG)
[17] arXiv:2505.10423 [pdf, html, other]
Title: The Power of Random Features and the Limits of Distribution-Free Gradient Descent
Ari Karchmer, Eran Malach
Subjects: Machine Learning (cs.LG)
[18] arXiv:2505.10422 [pdf, html, other]
Title: Decomposed Inductive Procedure Learning: Learning Academic Tasks with Human-Like Data Efficiency
Daniel Weitekamp, Christopher MacLellan, Erik Harpstead, Kenneth Koedinger
Comments: To appear in CogSci 2025
Subjects: Machine Learning (cs.LG)
[19] arXiv:2505.10407 [pdf, html, other]
Title: Two-Stage Generative Model for Intracranial Aneurysm Meshes with Morphological Marker Conditioning
Wenhao Ding, Choon Hwai Yap, Kangjun Ji, Simão Castro
Comments: 10 pages, 2 figures
Subjects: Machine Learning (cs.LG)
[20] arXiv:2505.10392 [pdf, html, other]
Title: Schreier-Coset Graph Propagation
Aryan Mishra, Lizhen Lin
Comments: 9 pages, 1 figure , preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[21] arXiv:2505.10360 [pdf, html, other]
Title: FactsR: A Safer Method for Producing High Quality Healthcare Documentation
Victor Petrén Bach Hansen, Lasse Krogsbøll, Jonas Lyngsø, Mathias Baltzersen, Andreas Motzfeldt, Kevin Pelgrims, Lars Maaløe
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP)
[22] arXiv:2505.10347 [pdf, html, other]
Title: Uniform Loss vs. Specialized Optimization: A Comparative Analysis in Multi-Task Learning
Gabriel S. Gama, Valdir Grassi Jr
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[23] arXiv:2505.10344 [pdf, other]
Title: An Introduction to Discrete Variational Autoencoders
Alan Jeffares, Liyuan Liu
Comments: Tutorial paper
Subjects: Machine Learning (cs.LG)
[24] arXiv:2505.10331 [pdf, html, other]
Title: Emergence of Structure in Ensembles of Random Neural Networks
Luca Muscarnera, Luigi Loreti, Giovanni Todeschini, Alessio Fumagalli, Francesco Regazzoni
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[25] arXiv:2505.10330 [pdf, other]
Title: Efficient Adaptation of Reinforcement Learning Agents to Sudden Environmental Change
Jonathan Clifford Balloch
Comments: PhD Dissertation, 131 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[26] arXiv:2505.10325 [pdf, html, other]
Title: A Representation Learning Approach to Feature Drift Detection in Wireless Networks
Athanasios Tziouvaras, Blaz Bertalanic, George Floros, Kostas Kolomvatsos, Panagiotis Sarigiannidis, Carolina Fortuna
Subjects: Machine Learning (cs.LG)
[27] arXiv:2505.10322 [pdf, html, other]
Title: Asynchronous Decentralized SGD under Non-Convexity: A Block-Coordinate Descent Framework
Yijie Zhou, Shi Pu
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[28] arXiv:2505.10307 [pdf, html, other]
Title: Negative Metric Learning for Graphs
Yiyang Zhao, Chengpei Wu, Lilin Zhang, Ning Yang
Subjects: Machine Learning (cs.LG)
[29] arXiv:2505.10297 [pdf, html, other]
Title: Defending the Edge: Representative-Attention for Mitigating Backdoor Attacks in Federated Learning
Chibueze Peace Obioma, Youcheng Sun, Mustafa A. Mustafa
Comments: Submitted to ESORICS 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[30] arXiv:2505.10296 [pdf, html, other]
Title: Optimizing Electric Bus Charging Scheduling with Uncertainties Using Hierarchical Deep Reinforcement Learning
Jiaju Qi, Lei Lei, Thorsteinn Jonsson, Dusit Niyato
Subjects: Machine Learning (cs.LG)
[31] arXiv:2505.10272 [pdf, html, other]
Title: Spike-timing-dependent Hebbian learning as noisy gradient descent
Niklas Dexheimer, Sascha Gaudlitz, Johannes Schmidt-Hieber
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST)
[32] arXiv:2505.10271 [pdf, html, other]
Title: RainPro-8: An Efficient Deep Learning Model to Estimate Rainfall Probabilities Over 8 Hours
Rafael Pablos Sarabia, Joachim Nyborg, Morten Birk, Jeppe Liborius Sjørup, Anders Lillevang Vesterholt, Ira Assent
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[33] arXiv:2505.10264 [pdf, html, other]
Title: Cutting Through Privacy: A Hyperplane-Based Data Reconstruction Attack in Federated Learning
Francesco Diana, André Nusser, Chuan Xu, Giovanni Neglia
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[34] arXiv:2505.10262 [pdf, html, other]
Title: Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
Jiaju Qi, Lei Lei, Thorsteinn Jonsson, Lajos Hanzo
Subjects: Machine Learning (cs.LG)
[35] arXiv:2505.10259 [pdf, html, other]
Title: SpecOffload: Unlocking Latent GPU Capacity for LLM Inference on Resource-Constrained Devices
Xiangwen Zhuge, Xu Shen, Zeyu Wang, Fan Dang, Xuan Ding, Danyang Li, Yahui Han, Tianxiang Hao, Zheng Yang
Subjects: Machine Learning (cs.LG)
[36] arXiv:2505.10222 [pdf, html, other]
Title: ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
Jintian Shao, Hongyi Huang, Jiayi Wu, Beiwen Zhang, ZhiYu Wu, You Shan, MingKai Zheng
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[37] arXiv:2505.10213 [pdf, html, other]
Title: Informed Forecasting: Leveraging Auxiliary Knowledge to Boost LLM Performance on Time Series Forecasting
Mohammadmahdi Ghasemloo, Alireza Moradi
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[38] arXiv:2505.10198 [pdf, html, other]
Title: A multi-head deep fusion model for recognition of cattle foraging events using sound and movement signals
Mariano Ferrero, José Omar Chelotti, Luciano Sebastián Martinez-Rau, Leandro Vignolo, Martín Pires, Julio Ricardo Galli, Leonardo Luis Giovanini, Hugo Leonardo Rufiner
Comments: Preprint submitted to Engineering Applications of Artificial Intelligence
Subjects: Machine Learning (cs.LG)
[39] arXiv:2505.10192 [pdf, html, other]
Title: Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data
Prashant P. Shinde, Priyadarshini P. Pai, Shashishekar P. Adiga, K. Subramanya Mayya, Yongbeom Seo, Myungsoo Hwang, Heeyoung Go, Changmin Park
Subjects: Machine Learning (cs.LG)
[40] arXiv:2505.10172 [pdf, html, other]
Title: Does Scaling Law Apply in Time Series Forecasting?
Zeyan Li, Libing Chen, Yin Tang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[41] arXiv:2505.10167 [pdf, html, other]
Title: QuXAI: Explainers for Hybrid Quantum Machine Learning Models
Saikat Barua, Mostafizur Rahman, Shehenaz Khaled, Md Jafor Sadek, Rafiul Islam, Shahnewaz Siddique
Comments: 16 pages, 6 figures, 7 equations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantum Physics (quant-ph)
[42] arXiv:2505.10147 [pdf, html, other]
Title: Near Optimal Best Arm Identification for Clustered Bandits
Yash, Nikhil Karamchandani, Avishek Ghosh
Comments: To be published in ICML 2025
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[43] arXiv:2505.10128 [pdf, html, other]
Title: Robust Federated Learning on Edge Devices with Domain Heterogeneity
Huy Q. Le, Latif U. Khan, Choong Seon Hong
Comments: IWCMC 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[44] arXiv:2505.10125 [pdf, html, other]
Title: Enhancing the Performance of Global Model by Improving the Adaptability of Local Models in Federated Learning
Wujun Zhou, Shu Ding, ZeLin Li, Wei Wang
Subjects: Machine Learning (cs.LG)
[45] arXiv:2505.10120 [pdf, other]
Title: All You Need Is Synthetic Task Augmentation
Guillaume Godin
Comments: 14 pages, 3 Figures, 6 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[46] arXiv:2505.10117 [pdf, html, other]
Title: Learning Virtual Machine Scheduling in Cloud Computing through Language Agents
JieHao Wu, Ziwei Wang, Junjie Sheng, Wenhao Li, Xiangfei Wang, Jun Luo
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[47] arXiv:2505.10083 [pdf, html, other]
Title: ChronoSteer: Bridging Large Language Model and Time Series Foundation Model via Synthetic Data
Chengsen Wang, Qi Qi, Zhongwen Rao, Lujia Pan, Jingyu Wang, Jianxin Liao
Subjects: Machine Learning (cs.LG)
[48] arXiv:2505.10057 [pdf, html, other]
Title: JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation
Tiancong Cheng, Ying Zhang, Yuxuan Liang, Roger Zimmermann, Zhiwen Yu, Bin Guo
Subjects: Machine Learning (cs.LG)
[49] arXiv:2505.10050 [pdf, html, other]
Title: Financial Fraud Detection Using Explainable AI and Stacking Ensemble Methods
Fahad Almalki, Mehedi Masud
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[50] arXiv:2505.10040 [pdf, html, other]
Title: Instance-Prototype Affinity Learning for Non-Exemplar Continual Graph Learning
Lei Song, Jiaxing Li, Shihan Guan, Youyong Kong
Subjects: Machine Learning (cs.LG)
Total of 799 entries : 1-50 51-100 101-150 151-200 ... 751-799
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack