close this message
arXiv smileybones

arXiv Is Hiring a DevOps Engineer

Work on one of the world's most important websites and make an impact on open science.

View Jobs
Skip to main content
Cornell University

arXiv Is Hiring a DevOps Engineer

View Jobs
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for March 2025

Total of 3681 entries : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 ... 3651-3681
Showing up to 50 entries per page: fewer | more | all
[151] arXiv:2503.01507 [pdf, html, other]
Title: Compare different SG-Schemes based on large least square problems
Ramkrishna Acharya
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[152] arXiv:2503.01521 [pdf, html, other]
Title: R2VF: A Two-Step Regularization Algorithm to Cluster Categories in GLMs
Yuval Ben Dror
Subjects: Machine Learning (cs.LG)
[153] arXiv:2503.01530 [pdf, html, other]
Title: Stability-based Generalization Analysis of Randomized Coordinate Descent for Pairwise Learning
Liang Wu, Ruixi Hu, Yunwen Lei
Comments: To appear in AAAI 2025
Subjects: Machine Learning (cs.LG)
[154] arXiv:2503.01544 [pdf, html, other]
Title: Compositional Reasoning with Transformers, RNNs, and Chain of Thought
Gilad Yehudai, Noah Amsel, Joan Bruna
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[155] arXiv:2503.01556 [pdf, html, other]
Title: Effective High-order Graph Representation Learning for Credit Card Fraud Detection
Yao Zou, Dawei Cheng
Comments: 9 pages, 5 figures, accepted at IJCAI 2024
Journal-ref: Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI 2024), pages 7581-7589
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[156] arXiv:2503.01557 [pdf, html, other]
Title: MoCFL: Mobile Cluster Federated Learning Framework for Highly Dynamic Network
Kai Fang, Jiangtao Deng, Chengzu Dong, Usman Naseem, Tongcun Liu, Hailin Feng, Wei Wang
Comments: 10 pages, 7 figures, conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[157] arXiv:2503.01580 [pdf, html, other]
Title: A Selective Learning Method for Temporal Graph Continual Learning
Hanmo Liu, Shimin Di, Haoyang Li, Xun Jian, Yue Wang, Lei Chen
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[158] arXiv:2503.01586 [pdf, html, other]
Title: EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection
Yuhao Zhou, Sirui Song, Boyang Liu, Zhiheng Xi, Senjie Jin, Xiaoran Fan, Zhihao Zhang, Wei Li, Xuanjing Huang
Comments: 13 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[159] arXiv:2503.01595 [pdf, html, other]
Title: STAR: Stability-Inducing Weight Perturbation for Continual Learning
Masih Eskandar, Tooba Imtiaz, Davin Hill, Zifeng Wang, Jennifer Dy
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[160] arXiv:2503.01598 [pdf, html, other]
Title: Heterogeneity Matters even More in Distributed Learning: Study from Generalization Perspective
Masoud Kavian, Milad Sefidgaran, Abdellatif Zaidi, Romain Chor
Comments: 42 pages, 11 figures
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[161] arXiv:2503.01630 [pdf, html, other]
Title: Machine Learners Should Acknowledge the Legal Implications of Large Language Models as Personal Data
Henrik Nolte, Michèle Finck, Kristof Meding
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[162] arXiv:2503.01650 [pdf, html, other]
Title: CAPS: Context-Aware Priority Sampling for Enhanced Imitation Learning in Autonomous Driving
Hamidreza Mirkhani, Behzad Khamidehi, Ehsan Ahmadi, Fazel Arasteh, Mohammed Elmahgiubi, Weize Zhang, Umar Rajguru, Kasra Rezaee
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[163] arXiv:2503.01653 [pdf, html, other]
Title: Distilled Prompt Learning for Incomplete Multimodal Survival Prediction
Yingxue Xu, Fengtao Zhou, Chenyu Zhao, Yihui Wang, Can Yang, Hao Chen
Comments: Accepted by CVPR2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[164] arXiv:2503.01658 [pdf, html, other]
Title: CoPL: Collaborative Preference Learning for Personalizing LLMs
Youngbin Choi, Seunghyuk Cho, Minjong Lee, MoonJeong Park, Yesong Ko, Jungseul Ok, Dongwoo Kim
Comments: 13pages, 4 figures, 6tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[165] arXiv:2503.01660 [pdf, other]
Title: Non-convergence to the optimal risk for Adam and stochastic gradient descent optimization in the training of deep neural networks
Thang Do, Arnulf Jentzen, Adrian Riekert
Comments: 42 pages
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[166] arXiv:2503.01664 [pdf, html, other]
Title: Merging Hazy Sets with m-Schemes: A Geometric Approach to Data Visualization
Lukas Silvester Barth, Hannaneh Fahimi, Parvaneh Joharinad, Jürgen Jost, Janis Keck
Subjects: Machine Learning (cs.LG); Discrete Mathematics (cs.DM); Metric Geometry (math.MG)
[167] arXiv:2503.01669 [pdf, html, other]
Title: An Efficient Continual Learning Framework for Multivariate Time Series Prediction Tasks with Application to Vehicle State Estimation
Arvin Hosseinzadeh, Ladan Khoshnevisan, Mohammad Pirani, Shojaeddin Chenouri, Amir Khajepour
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[168] arXiv:2503.01675 [pdf, html, other]
Title: Using (Not so) Large Language Models for Generating Simulation Models in a Formal DSL -- A Study on Reaction Networks
Justin N. Kreikemeyer, Miłosz Jankowski, Pia Wilsdorf, Adelinde M. Uhrmacher
Comments: 18 pages, 5 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[169] arXiv:2503.01682 [pdf, html, other]
Title: GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models
Mufan Qiu, Xinyu Hu, Fengwei Zhan, Sukwon Yun, Jie Peng, Ruichen Zhang, Bhavya Kailkhura, Jiekun Yang, Tianlong Chen
Subjects: Machine Learning (cs.LG)
[170] arXiv:2503.01702 [pdf, html, other]
Title: Relating Piecewise Linear Kolmogorov Arnold Networks to ReLU Networks
Nandi Schoots, Mattia Jacopo Villani, Niels uit de Bos
Comments: accepted to AISTATS 2025; 12 pages including bibliography and appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[171] arXiv:2503.01703 [pdf, html, other]
Title: On the Development of Binary Classification Algorithm Based on Principles of Geometry and Statistical Inference
Vatsal Srivastava
Comments: 20 pages and some figures might give overfull warnings but compiled successfully and looks good so can be ignored
Subjects: Machine Learning (cs.LG); Algebraic Geometry (math.AG)
[172] arXiv:2503.01704 [pdf, html, other]
Title: DILEMMA: Joint LLM Quantization and Distributed LLM Inference Over Edge Computing Systems
Minoo Hosseinzadeh, Hana Khamfroush
Subjects: Machine Learning (cs.LG)
[173] arXiv:2503.01713 [pdf, html, other]
Title: SAGE: A Framework of Precise Retrieval for RAG
Jintao Zhang, Guoliang Li, Jinyang Su
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
[174] arXiv:2503.01718 [pdf, html, other]
Title: Learning Surrogate Equations for the Analysis of an Agent-Based Cancer Model
Kevin Burrage, Pamela Burrage, Justin N. Kreikemeyer, Adelinde M. Uhrmacher, Hasitha N. Weerasinghe
Comments: 15 pages, 6 figures
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[175] arXiv:2503.01720 [pdf, html, other]
Title: Quality Measures for Dynamic Graph Generative Models
Ryien Hosseini, Filippo Simini, Venkatram Vishwanath, Rebecca Willett, Henry Hoffmann
Comments: To appear as a spotlight presentation at ICLR 2025
Subjects: Machine Learning (cs.LG)
[176] arXiv:2503.01723 [pdf, html, other]
Title: How Low Can You Go? Searching for the Intrinsic Dimensionality of Complex Networks using Metric Node Embeddings
Nikolaos Nakis, Niels Raunkjær Holm, Andreas Lyhne Fiehn, Morten Mørup
Comments: Published as a conference paper at ICLR 2025
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[177] arXiv:2503.01727 [pdf, html, other]
Title: Mamba base PKD for efficient knowledge compression
José Medina, Amnir Hadachi, Paul Honeine, Abdelaziz Bensrhair
Comments: A preliminary version of this work was presented as a short poster titled "Mamba-PKD: A Framework for Efficient and Scalable Model Compression in Image Classification" at The 40th ACM/SIGAPP Symposium on Applied Computing this https URL
Subjects: Machine Learning (cs.LG)
[178] arXiv:2503.01728 [pdf, html, other]
Title: DeepSuM: Deep Sufficient Modality Learning Framework
Zhe Gao, Jian Huang, Ting Li, Xueqin Wang
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[179] arXiv:2503.01737 [pdf, html, other]
Title: Self-attention-based Diffusion Model for Time-series Imputation in Partial Blackout Scenarios
Mohammad Rafid Ul Islam, Prasad Tadepalli, Alan Fern
Comments: 7 pages, 2 figures, 3 tables, Accepted in AAAI 2025 Main Track
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[180] arXiv:2503.01750 [pdf, html, other]
Title: ECG-EmotionNet: Nested Mixture of Expert (NMoE) Adaptation of ECG-Foundation Model for Driver Emotion Recognition
Nastaran Mansourian, Arash Mohammadi, M. Omair Ahmad, M.N.S. Swamy
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[181] arXiv:2503.01768 [pdf, html, other]
Title: SHADE-AD: An LLM-Based Framework for Synthesizing Activity Data of Alzheimer's Patients
Heming Fu, Hongkai Chen, Shan Lin, Guoliang Xing
Comments: 7 pages, 6 figures, ACM SenSys'25
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[182] arXiv:2503.01776 [pdf, html, other]
Title: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation
Tiansheng Wen, Yifei Wang, Zequn Zeng, Zhong Peng, Yudi Su, Xinyang Liu, Bo Chen, Hongwei Liu, Stefanie Jegelka, Chenyu You
Comments: A novel sparse coding framework designed for learning adaptive representation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[183] arXiv:2503.01803 [pdf, html, other]
Title: Deep Reinforcement Learning-Based User Association in Hybrid LiFi/WiFi Indoor Networks
Peijun Hou, Nan Cen
Comments: 12 pages, 15 figures
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[184] arXiv:2503.01805 [pdf, html, other]
Title: Depth-Width tradeoffs in Algorithmic Reasoning of Graph Tasks with Transformers
Gilad Yehudai, Clayton Sanford, Maya Bechler-Speicher, Orr Fischer, Ran Gilad-Bachrach, Amir Globerson
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[185] arXiv:2503.01817 [pdf, html, other]
Title: Noise to the Rescue: Escaping Local Minima in Neurosymbolic Local Search
Alessandro Daniele, Emile van Krieken
Subjects: Machine Learning (cs.LG)
[186] arXiv:2503.01820 [pdf, html, other]
Title: RSQ: Learning from Important Tokens Leads to Better Quantized LLMs
Yi-Lin Sung, Prateek Yadav, Jialu Li, Jaehong Yoon, Mohit Bansal
Comments: Our code is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[187] arXiv:2503.01821 [pdf, other]
Title: On the Power of Context-Enhanced Learning in LLMs
Xingyu Zhu, Abhishek Panigrahi, Sanjeev Arora
Comments: 76 pages, 17 figures; Pre-print
Subjects: Machine Learning (cs.LG)
[188] arXiv:2503.01822 [pdf, html, other]
Title: Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry
Sai Sumedh R. Hindupur, Ekdeep Singh Lubana, Thomas Fel, Demba Ba
Comments: Preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[189] arXiv:2503.01824 [pdf, html, other]
Title: From superposition to sparse codes: interpretable representations in neural networks
David Klindt, Charles O'Neill, Patrik Reizinger, Harald Maurer, Nina Miolane
Subjects: Machine Learning (cs.LG)
[190] arXiv:2503.01827 [pdf, other]
Title: Open-source framework for detecting bias and overfitting for large pathology images
Anders Sildnes, Nikita Shvetsov, Masoud Tafavvoghi, Vi Ngoc-Nha Tran, Kajsa Møllersen, Lill-Tove Rasmussen Busund, Thomas K. Kilvær, Lars Ailo Bongo
Subjects: Machine Learning (cs.LG); Software Engineering (cs.SE); Image and Video Processing (eess.IV)
[191] arXiv:2503.01837 [pdf, html, other]
Title: Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning
Adrià López Escoriza, Nicklas Hansen, Stone Tao, Tongzhou Mu, Hao Su
Comments: Project page can be found at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[192] arXiv:2503.01838 [pdf, other]
Title: GRAIN: Exact Graph Reconstruction from Gradients
Maria Drencheva, Ivo Petrov, Maximilian Baader, Dimitar I. Dimitrov, Martin Vechev
Comments: Published at The Thirteenth International Conference on Learning Representations (ICLR) 2025
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[193] arXiv:2503.01843 [pdf, other]
Title: When Can You Get Away with Low Memory Adam?
Dayal Singh Kalra, John Kirchenbauer, Maissam Barkeshli, Tom Goldstein
Comments: Acknowledgement updates and minor writing edits
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (stat.ML)
[194] arXiv:2503.01864 [pdf, html, other]
Title: Larger or Smaller Reward Margins to Select Preferences for Alignment?
Kexin Huang, Junkang Wu, Ziqian Chen, Xue Wang, Jinyang Gao, Bolin Ding, Jiancan Wu, Xiangnan He, Xiang Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[195] arXiv:2503.01865 [pdf, other]
Title: Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
Junxiao Yang, Zhexin Zhang, Shiyao Cui, Hongning Wang, Minlie Huang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[196] arXiv:2503.01868 [pdf, other]
Title: Systems and Algorithms for Convolutional Multi-Hybrid Language Models at Scale
Jerome Ku, Eric Nguyen, David W. Romero, Garyk Brixi, Brandon Yang, Anton Vorontsov, Ali Taghibakhshi, Amy X. Lu, Dave P. Burke, Greg Brockman, Stefano Massaroli, Christopher Ré, Patrick D. Hsu, Brian L. Hie, Stefano Ermon, Michael Poli
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[197] arXiv:2503.01871 [pdf, html, other]
Title: Data Augmentation for Instruction Following Policies via Trajectory Segmentation
Niklas Höpner, Ilaria Tiddi, Herke van Hoof
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[198] arXiv:2503.01872 [pdf, html, other]
Title: FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance
Mintong Kang, Vinayshekhar Bannihatti Kumar, Shamik Roy, Abhishek Kumar, Sopan Khosla, Balakrishnan Murali Narayanaswamy, Rashmi Gangadharaiah
Comments: Under submission
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[199] arXiv:2503.01873 [pdf, html, other]
Title: Online Pseudo-average Shifting Attention(PASA) for Robust Low-precision LLM Inference: Algorithms and Numerical Analysis
Long Cheng, Qichen Liao, Fan Wu, Junlin Mu, Tengfei Han, Zhe Qiu, Lianqiang Li, Tianyi Liu, Fangzheng Miao, Keming Gao, Liang Wang, Zhen Zhang, Qiande Yin
Comments: 21 Pages, 14 figures, conference paper
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Performance (cs.PF); Numerical Analysis (math.NA)
[200] arXiv:2503.01874 [pdf, html, other]
Title: CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging
Zongzhen Yang, Binhang Qi, Hailong Sun, Wenrui Long, Ruobing Zhao, Xiang Gao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Total of 3681 entries : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 ... 3651-3681
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack