close this message
arXiv smileybones

arXiv Is Hiring a DevOps Engineer

Work on one of the world's most important websites and make an impact on open science.

View Jobs
Skip to main content
Cornell University

arXiv Is Hiring a DevOps Engineer

View Jobs
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for March 2025

Total of 3681 entries : 1-50 51-100 101-150 151-200 ... 3651-3681
Showing up to 50 entries per page: fewer | more | all
[1] arXiv:2503.00028 [pdf, html, other]
Title: Genetics-Driven Personalized Disease Progression Model
Haoyu Yang, Sanjoy Dey, Pablo Meyer
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[2] arXiv:2503.00029 [pdf, html, other]
Title: Streaming Looking Ahead with Token-level Self-reward
Hongming Zhang, Ruixin Hong, Dong Yu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3] arXiv:2503.00030 [pdf, html, other]
Title: Game-Theoretic Regularized Self-Play Alignment of Large Language Models
Xiaohang Tang, Sangwoong Yoon, Seongho Son, Huizhuo Yuan, Quanquan Gu, Ilija Bogunovic
Comments: Preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[4] arXiv:2503.00031 [pdf, html, other]
Title: Efficient Test-Time Scaling via Self-Calibration
Chengsong Huang, Langlin Huang, Jixuan Leng, Jiacheng Liu, Jiaxin Huang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[5] arXiv:2503.00033 [pdf, html, other]
Title: optimizn: a Python Library for Developing Customized Optimization Algorithms
Akshay Sathiya, Rohit Pandey
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[6] arXiv:2503.00034 [pdf, html, other]
Title: MergeIT: From Selection to Merging for Efficient Instruction Tuning
Hongyi Cai, Yuqian Fu, Hongming Fu, Bo Zhao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[7] arXiv:2503.00094 [pdf, html, other]
Title: Gaussian process surrogate model to approximate power grid simulators -- An application to the certification of a congestion management controller
Pierre Houdouin, Lucas Saludjian
Comments: 6 pages, 7 figures, conference
Subjects: Machine Learning (cs.LG)
[8] arXiv:2503.00127 [pdf, other]
Title: DISCO: Internal Evaluation of Density-Based Clustering
Anna Beer, Lena Krieger, Pascal Weber, Martin Ritzert, Ira Assent, Claudia Plant
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[9] arXiv:2503.00152 [pdf, html, other]
Title: Invariant Tokenization of Crystalline Materials for Language Model Enabled Generation
Keqiang Yan, Xiner Li, Hongyi Ling, Kenna Ashen, Carl Edwards, Raymundo Arróyave, Marinka Zitnik, Heng Ji, Xiaofeng Qian, Xiaoning Qian, Shuiwang Ji
Comments: This paper has been accepted as a NeurIPS 2024 Poster
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[10] arXiv:2503.00174 [pdf, html, other]
Title: Optimal Transfer Learning for Missing Not-at-Random Matrix Completion
Akhil Jalan, Yassir Jedra, Arya Mazumdar, Soumendu Sundar Mukherjee, Purnamrita Sarkar
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[11] arXiv:2503.00177 [pdf, html, other]
Title: Steering Large Language Model Activations in Sparse Spaces
Reza Bayat, Ali Rahimi-Kalahroudi, Mohammad Pezeshki, Sarath Chandar, Pascal Vincent
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[12] arXiv:2503.00205 [pdf, html, other]
Title: AnalogGenie: A Generative Engine for Automatic Discovery of Analog Circuit Topologies
Jian Gao, Weidong Cao, Junyi Yang, Xuan Zhang
Comments: ICLR 2025 camera ready
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[13] arXiv:2503.00206 [pdf, html, other]
Title: Quantifying First-Order Markov Violations in Noisy Reinforcement Learning: A Causal Discovery Approach
Naveen Mysore
Comments: Under review for RLC 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[14] arXiv:2503.00210 [pdf, html, other]
Title: Foundation-Model-Boosted Multimodal Learning for fMRI-based Neuropathic Pain Drug Response Prediction
Wenrui Fan, L. M. Riza Rizky, Jiayang Zhang, Chen Chen, Haiping Lu, Kevin Teh, Dinesh Selvarajah, Shuo Zhou
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[15] arXiv:2503.00229 [pdf, html, other]
Title: Armijo Line-search Makes (Stochastic) Gradient Descent Go Fast
Sharan Vaswani, Reza Babanezhad
Comments: 33 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[16] arXiv:2503.00234 [pdf, html, other]
Title: Investigating the Relationship Between Debiasing and Artifact Removal using Saliency Maps
Lukasz Sztukiewicz, Ignacy Stępka, Michał Wiliński, Jerzy Stefanowski
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[17] arXiv:2503.00240 [pdf, html, other]
Title: 1-Lipschitz Network Initialization for Certifiably Robust Classification Applications: A Decay Problem
Marius F. R. Juston, William R. Norris, Dustin Nottage, Ahmet Soylemezoglu
Comments: 12 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[18] arXiv:2503.00245 [pdf, other]
Title: CoSMoEs: Compact Sparse Mixture of Experts
Patrick Huber, Akshat Shrivastava, Ernie Chang, Chinnadhurai Sankar, Ahmed Aly, Adithya Sagar
Comments: 11 pages, 8 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[19] arXiv:2503.00268 [pdf, html, other]
Title: Input Specific Neural Networks
Asghar A. Jadoon, D. Thomas Seidl, Reese E. Jones, Jan N. Fuhg
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Neural and Evolutionary Computing (cs.NE)
[20] arXiv:2503.00269 [pdf, html, other]
Title: Reducing Large Language Model Safety Risks in Women's Health using Semantic Entropy
Jahan C. Penny-Dimri, Magdalena Bachmann, William R. Cooke, Sam Mathewlynn, Samuel Dockree, John Tolladay, Jannik Kossen, Lin Li, Yarin Gal, Gabriel Davis Jones
Comments: 15 pages, 6 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[21] arXiv:2503.00286 [pdf, html, other]
Title: A Unified Framework for Heterogeneous Semi-supervised Learning
Marzi Heidari, Abdullah Alchihabi, Hao Yan, Yuhong Guo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[22] arXiv:2503.00299 [pdf, html, other]
Title: Hidden Convexity of Fair PCA and Fast Solver via Eigenvalue Optimization
Junhui Shen, Aaron J. Davis, Ding Lu, Zhaojun Bai
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[23] arXiv:2503.00300 [pdf, html, other]
Title: Cauchy Random Features for Operator Learning in Sobolev Space
Chunyang Liao, Deanna Needell, Hayden Schaeffer
Comments: 31 pages
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[24] arXiv:2503.00307 [pdf, html, other]
Title: Remasking Discrete Diffusion Models with Inference-Time Scaling
Guanghan Wang, Yair Schiff, Subham Sekhar Sahoo, Volodymyr Kuleshov
Comments: Project page: this https URL
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[25] arXiv:2503.00317 [pdf, html, other]
Title: DeepONet Augmented by Randomized Neural Networks for Efficient Operator Learning in PDEs
Zhaoxi Jiang, Fei Wang
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Computational Physics (physics.comp-ph)
[26] arXiv:2503.00323 [pdf, html, other]
Title: FLStore: Efficient Federated Learning Storage for non-training workloads
Ahmad Faraz Khan, Samuel Fountain, Ahmed M. Abdelmoniem, Ali R. Butt, Ali Anwar
Comments: 11 pages, 19 figures, 2 tables This paper has been accepted at the The Eighth Annual Conference on Machine Learning and Systems (MLSys 2025)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[27] arXiv:2503.00331 [pdf, other]
Title: PINN-DT: Optimizing Energy Consumption in Smart Building Using Hybrid Physics-Informed Neural Networks and Digital Twin Framework with Blockchain Security
Hajar Kazemi Naeini, Roya Shomali, Abolhassan Pishahang, Hamidreza Hasanzadeh, Mahdieh Mohammadi, Saeid Asadi, Ahmad Gholizadeh Lonbar
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[28] arXiv:2503.00334 [pdf, html, other]
Title: MCNet: Monotonic Calibration Networks for Expressive Uncertainty Calibration in Online Advertising
Quanyu Dai, Jiaren Xiao, Zhaocheng Du, Jieming Zhu, Chengxiao Luo, Xiao-Ming Wu, Zhenhua Dong
Comments: Accepted by WWW2025
Journal-ref: THE ACM WEB CONFERENCE 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[29] arXiv:2503.00345 [pdf, html, other]
Title: Towards Understanding the Benefit of Multitask Representation Learning in Decision Process
Rui Lu, Yang Yue, Andrew Zhao, Simon Du, Gao Huang
Comments: arXiv admin note: substantial text overlap with arXiv:2205.15701
Subjects: Machine Learning (cs.LG)
[30] arXiv:2503.00378 [pdf, html, other]
Title: Conditioning on Local Statistics for Scalable Heterogeneous Federated Learning
Rickard Brännvall
Comments: 7 pages, 2 figures, 7 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[31] arXiv:2503.00379 [pdf, html, other]
Title: Improving clustering quality evaluation in noisy Gaussian mixtures
Renato Cordeiro de Amorim, Vladimir Makarenkov
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[32] arXiv:2503.00383 [pdf, html, other]
Title: Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems
Song Xia, Yi Yu, Wenhan Yang, Meiwen Ding, Zhuo Chen, Ling-Yu Duan, Alex C. Kot, Xudong Jiang
Comments: accepted by CVPR2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[33] arXiv:2503.00392 [pdf, html, other]
Title: Progressive Sparse Attention: Algorithm and System Co-design for Efficient Attention in LLM Serving
Qihui Zhou, Peiqi Yin, Pengfei Zuo, James Cheng
Comments: 12 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[34] arXiv:2503.00393 [pdf, html, other]
Title: Reservoir Network with Structural Plasticity for Human Activity Recognition
Abdullah M. Zyarah, Alaa M. Abdul-Hadi, Dhireesha Kudithipudi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[35] arXiv:2503.00407 [pdf, other]
Title: Asynchronous Personalized Federated Learning through Global Memorization
Fan Wan, Yuchen Li, Xueqi Qiu, Rui Sun, Leyuan Zhang, Xingyu Miao, Tianyu Zhang, Haoran Duan, Yang Long
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[36] arXiv:2503.00419 [pdf, html, other]
Title: Heavy-Tailed Linear Bandits: Huber Regression with One-Pass Update
Jing Wang, Yu-Jie Zhang, Peng Zhao, Zhi-Hua Zhou
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[37] arXiv:2503.00426 [pdf, html, other]
Title: Auto-encoding Molecules: Graph-Matching Capabilities Matter
Magnus Cunow, Gerrit Großmann
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[38] arXiv:2503.00458 [pdf, html, other]
Title: Using Machine Learning for move sequence visualization and generation in climbing
Thomas Rimbot, Martin Jaggi, Luis Barba
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[39] arXiv:2503.00470 [pdf, html, other]
Title: Rapid morphology characterization of two-dimensional TMDs and lateral heterostructures based on deep learning
Junqi He, Yujie Zhang, Jialu Wang, Tao Wang, Pan Zhang, Chengjie Cai, Jinxing Yang, Xiao Lin, Xiaohui Yang
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[40] arXiv:2503.00476 [pdf, html, other]
Title: G-OSR: A Comprehensive Benchmark for Graph Open-Set Recognition
Yicong Dong, Rundong He, Guangyao Chen, Wentao Zhang, Zhongyi Han, Jieming Shi, Yilong Yin
Comments: 10 pages,2 figures
Subjects: Machine Learning (cs.LG)
[41] arXiv:2503.00479 [pdf, html, other]
Title: Bayesian Active Learning for Multi-Criteria Comparative Judgement in Educational Assessment
Andy Gray, Alma Rahat, Tom Crick, Stephen Lindsay
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Machine Learning (stat.ML)
[42] arXiv:2503.00485 [pdf, html, other]
Title: Homomorphism Expressivity of Spectral Invariant Graph Neural Networks
Jingchu Gai, Yiheng Du, Bohang Zhang, Haggai Maron, Liwei Wang
Comments: 42 pages
Journal-ref: ICLR 2025 Oral
Subjects: Machine Learning (cs.LG)
[43] arXiv:2503.00499 [pdf, html, other]
Title: Shaping Laser Pulses with Reinforcement Learning
Francesco Capuano, Davorin Peceli, Gabriele Tiboni
Comments: 14 pages
Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph); Optics (physics.optics)
[44] arXiv:2503.00507 [pdf, html, other]
Title: Projection Head is Secretly an Information Bottleneck
Zhuo Ouyang, Kaiwen Hu, Qi Zhang, Yifei Wang, Yisen Wang
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT)
[45] arXiv:2503.00509 [pdf, html, other]
Title: Functional multi-armed bandit and the best function identification problems
Yuriy Dorn, Aleksandr Katrutsa, Ilgam Latypov, Anastasiia Soboleva
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[46] arXiv:2503.00522 [pdf, html, other]
Title: Periodic Materials Generation using Text-Guided Joint Diffusion Model
Kishalay Das, Subhojyoti Khastagir, Pawan Goyal, Seung-Cheol Lee, Satadeep Bhattacharjee, Niloy Ganguly
Comments: ICLR 2025
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[47] arXiv:2503.00524 [pdf, html, other]
Title: End-To-End Learning of Gaussian Mixture Priors for Diffusion Sampler
Denis Blessing, Xiaogang Jia, Gerhard Neumann
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[48] arXiv:2503.00528 [pdf, html, other]
Title: Efficient Prompting for Continual Adaptation to Missing Modalities
Zirun Guo, Shulei Wang, Wang Lin, Weicai Yan, Yangyang Wu, Tao Jin
Comments: Accepted to NAACL 2025 Main
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[49] arXiv:2503.00535 [pdf, html, other]
Title: What Makes a Good Diffusion Planner for Decision Making?
Haofei Lu, Dongqi Han, Yifei Shen, Dongsheng Li
Comments: ICLR 2025 (Spotlight), Code: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[50] arXiv:2503.00537 [pdf, html, other]
Title: Scalable Reinforcement Learning for Virtual Machine Scheduling
Junjie Sheng, Jiehao Wu, Haochuan Cui, Yiqiu Hu, Wenli Zhou, Lei Zhu, Qian Peng, Wenhao Li, Xiangfeng Wang
Comments: 23 pages, 12 figures
Subjects: Machine Learning (cs.LG)
Total of 3681 entries : 1-50 51-100 101-150 151-200 ... 3651-3681
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack