Machine Learning

Authors and titles for March 2025

Total of 3681 entries : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 ... 3651-3681

Showing up to 50 entries per page: fewer | more | all

[151] arXiv:2503.01507 [pdf, html, other]: Title: Compare different SG-Schemes based on large least square problems

Ramkrishna Acharya

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[152] arXiv:2503.01521 [pdf, html, other]: Title: R2VF: A Two-Step Regularization Algorithm to Cluster Categories in GLMs

Yuval Ben Dror

Subjects: Machine Learning (cs.LG)
[153] arXiv:2503.01530 [pdf, html, other]: Title: Stability-based Generalization Analysis of Randomized Coordinate Descent for Pairwise Learning

Liang Wu, Ruixi Hu, Yunwen Lei

Comments: To appear in AAAI 2025

Subjects: Machine Learning (cs.LG)
[154] arXiv:2503.01544 [pdf, html, other]: Title: Compositional Reasoning with Transformers, RNNs, and Chain of Thought

Gilad Yehudai, Noah Amsel, Joan Bruna

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[155] arXiv:2503.01556 [pdf, html, other]: Title: Effective High-order Graph Representation Learning for Credit Card Fraud Detection

Yao Zou, Dawei Cheng

Comments: 9 pages, 5 figures, accepted at IJCAI 2024

Journal-ref: Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI 2024), pages 7581-7589

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[156] arXiv:2503.01557 [pdf, html, other]: Title: MoCFL: Mobile Cluster Federated Learning Framework for Highly Dynamic Network

Kai Fang, Jiangtao Deng, Chengzu Dong, Usman Naseem, Tongcun Liu, Hailin Feng, Wei Wang

Comments: 10 pages, 7 figures, conference

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[157] arXiv:2503.01580 [pdf, html, other]: Title: A Selective Learning Method for Temporal Graph Continual Learning

Hanmo Liu, Shimin Di, Haoyang Li, Xun Jian, Yue Wang, Lei Chen

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[158] arXiv:2503.01586 [pdf, html, other]: Title: EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection

Yuhao Zhou, Sirui Song, Boyang Liu, Zhiheng Xi, Senjie Jin, Xiaoran Fan, Zhihao Zhang, Wei Li, Xuanjing Huang

Comments: 13 pages, 8 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[159] arXiv:2503.01595 [pdf, html, other]: Title: STAR: Stability-Inducing Weight Perturbation for Continual Learning

Masih Eskandar, Tooba Imtiaz, Davin Hill, Zifeng Wang, Jennifer Dy

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[160] arXiv:2503.01598 [pdf, html, other]: Title: Heterogeneity Matters even More in Distributed Learning: Study from Generalization Perspective

Masoud Kavian, Milad Sefidgaran, Abdellatif Zaidi, Romain Chor

Comments: 42 pages, 11 figures

Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[161] arXiv:2503.01630 [pdf, html, other]: Title: Machine Learners Should Acknowledge the Legal Implications of Large Language Models as Personal Data

Henrik Nolte, Michèle Finck, Kristof Meding

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[162] arXiv:2503.01650 [pdf, html, other]: Title: CAPS: Context-Aware Priority Sampling for Enhanced Imitation Learning in Autonomous Driving

Hamidreza Mirkhani, Behzad Khamidehi, Ehsan Ahmadi, Fazel Arasteh, Mohammed Elmahgiubi, Weize Zhang, Umar Rajguru, Kasra Rezaee

Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[163] arXiv:2503.01653 [pdf, html, other]: Title: Distilled Prompt Learning for Incomplete Multimodal Survival Prediction

Yingxue Xu, Fengtao Zhou, Chenyu Zhao, Yihui Wang, Can Yang, Hao Chen

Comments: Accepted by CVPR2025

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[164] arXiv:2503.01658 [pdf, html, other]: Title: CoPL: Collaborative Preference Learning for Personalizing LLMs

Youngbin Choi, Seunghyuk Cho, Minjong Lee, MoonJeong Park, Yesong Ko, Jungseul Ok, Dongwoo Kim

Comments: 13pages, 4 figures, 6tables

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[165] arXiv:2503.01660 [pdf, other]: Title: Non-convergence to the optimal risk for Adam and stochastic gradient descent optimization in the training of deep neural networks

Thang Do, Arnulf Jentzen, Adrian Riekert

Comments: 42 pages

Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[166] arXiv:2503.01664 [pdf, html, other]: Title: Merging Hazy Sets with m-Schemes: A Geometric Approach to Data Visualization

Lukas Silvester Barth, Hannaneh Fahimi, Parvaneh Joharinad, Jürgen Jost, Janis Keck

Subjects: Machine Learning (cs.LG); Discrete Mathematics (cs.DM); Metric Geometry (math.MG)
[167] arXiv:2503.01669 [pdf, html, other]: Title: An Efficient Continual Learning Framework for Multivariate Time Series Prediction Tasks with Application to Vehicle State Estimation

Arvin Hosseinzadeh, Ladan Khoshnevisan, Mohammad Pirani, Shojaeddin Chenouri, Amir Khajepour

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[168] arXiv:2503.01675 [pdf, html, other]: Title: Using (Not so) Large Language Models for Generating Simulation Models in a Formal DSL -- A Study on Reaction Networks

Justin N. Kreikemeyer, Miłosz Jankowski, Pia Wilsdorf, Adelinde M. Uhrmacher

Comments: 18 pages, 5 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[169] arXiv:2503.01682 [pdf, html, other]: Title: GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models

Mufan Qiu, Xinyu Hu, Fengwei Zhan, Sukwon Yun, Jie Peng, Ruichen Zhang, Bhavya Kailkhura, Jiekun Yang, Tianlong Chen

Subjects: Machine Learning (cs.LG)
[170] arXiv:2503.01702 [pdf, html, other]: Title: Relating Piecewise Linear Kolmogorov Arnold Networks to ReLU Networks

Nandi Schoots, Mattia Jacopo Villani, Niels uit de Bos

Comments: accepted to AISTATS 2025; 12 pages including bibliography and appendix

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[171] arXiv:2503.01703 [pdf, html, other]: Title: On the Development of Binary Classification Algorithm Based on Principles of Geometry and Statistical Inference

Vatsal Srivastava

Comments: 20 pages and some figures might give overfull warnings but compiled successfully and looks good so can be ignored

Subjects: Machine Learning (cs.LG); Algebraic Geometry (math.AG)
[172] arXiv:2503.01704 [pdf, html, other]: Title: DILEMMA: Joint LLM Quantization and Distributed LLM Inference Over Edge Computing Systems

Minoo Hosseinzadeh, Hana Khamfroush

Subjects: Machine Learning (cs.LG)
[173] arXiv:2503.01713 [pdf, html, other]: Title: SAGE: A Framework of Precise Retrieval for RAG

Jintao Zhang, Guoliang Li, Jinyang Su

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
[174] arXiv:2503.01718 [pdf, html, other]: Title: Learning Surrogate Equations for the Analysis of an Agent-Based Cancer Model

Kevin Burrage, Pamela Burrage, Justin N. Kreikemeyer, Adelinde M. Uhrmacher, Hasitha N. Weerasinghe

Comments: 15 pages, 6 figures

Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[175] arXiv:2503.01720 [pdf, html, other]: Title: Quality Measures for Dynamic Graph Generative Models

Ryien Hosseini, Filippo Simini, Venkatram Vishwanath, Rebecca Willett, Henry Hoffmann

Comments: To appear as a spotlight presentation at ICLR 2025

Subjects: Machine Learning (cs.LG)
[176] arXiv:2503.01723 [pdf, html, other]: Title: How Low Can You Go? Searching for the Intrinsic Dimensionality of Complex Networks using Metric Node Embeddings

Nikolaos Nakis, Niels Raunkjær Holm, Andreas Lyhne Fiehn, Morten Mørup

Comments: Published as a conference paper at ICLR 2025

Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[177] arXiv:2503.01727 [pdf, html, other]: Title: Mamba base PKD for efficient knowledge compression

José Medina, Amnir Hadachi, Paul Honeine, Abdelaziz Bensrhair

Comments: A preliminary version of this work was presented as a short poster titled "Mamba-PKD: A Framework for Efficient and Scalable Model Compression in Image Classification" at The 40th ACM/SIGAPP Symposium on Applied Computing this https URL

Subjects: Machine Learning (cs.LG)
[178] arXiv:2503.01728 [pdf, html, other]: Title: DeepSuM: Deep Sufficient Modality Learning Framework

Zhe Gao, Jian Huang, Ting Li, Xueqin Wang

Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[179] arXiv:2503.01737 [pdf, html, other]: Title: Self-attention-based Diffusion Model for Time-series Imputation in Partial Blackout Scenarios

Mohammad Rafid Ul Islam, Prasad Tadepalli, Alan Fern

Comments: 7 pages, 2 figures, 3 tables, Accepted in AAAI 2025 Main Track

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[180] arXiv:2503.01750 [pdf, html, other]: Title: ECG-EmotionNet: Nested Mixture of Expert (NMoE) Adaptation of ECG-Foundation Model for Driver Emotion Recognition

Nastaran Mansourian, Arash Mohammadi, M. Omair Ahmad, M.N.S. Swamy

Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[181] arXiv:2503.01768 [pdf, html, other]: Title: SHADE-AD: An LLM-Based Framework for Synthesizing Activity Data of Alzheimer's Patients

Heming Fu, Hongkai Chen, Shan Lin, Guoliang Xing

Comments: 7 pages, 6 figures, ACM SenSys'25

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[182] arXiv:2503.01776 [pdf, html, other]: Title: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation

Tiansheng Wen, Yifei Wang, Zequn Zeng, Zhong Peng, Yudi Su, Xinyang Liu, Bo Chen, Hongwei Liu, Stefanie Jegelka, Chenyu You

Comments: A novel sparse coding framework designed for learning adaptive representation

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[183] arXiv:2503.01803 [pdf, html, other]: Title: Deep Reinforcement Learning-Based User Association in Hybrid LiFi/WiFi Indoor Networks

Peijun Hou, Nan Cen

Comments: 12 pages, 15 figures

Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[184] arXiv:2503.01805 [pdf, html, other]: Title: Depth-Width tradeoffs in Algorithmic Reasoning of Graph Tasks with Transformers

Gilad Yehudai, Clayton Sanford, Maya Bechler-Speicher, Orr Fischer, Ran Gilad-Bachrach, Amir Globerson

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[185] arXiv:2503.01817 [pdf, html, other]: Title: Noise to the Rescue: Escaping Local Minima in Neurosymbolic Local Search

Alessandro Daniele, Emile van Krieken

Subjects: Machine Learning (cs.LG)
[186] arXiv:2503.01820 [pdf, html, other]: Title: RSQ: Learning from Important Tokens Leads to Better Quantized LLMs

Yi-Lin Sung, Prateek Yadav, Jialu Li, Jaehong Yoon, Mohit Bansal

Comments: Our code is available at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[187] arXiv:2503.01821 [pdf, other]: Title: On the Power of Context-Enhanced Learning in LLMs

Xingyu Zhu, Abhishek Panigrahi, Sanjeev Arora

Comments: 76 pages, 17 figures; Pre-print

Subjects: Machine Learning (cs.LG)
[188] arXiv:2503.01822 [pdf, html, other]: Title: Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry

Sai Sumedh R. Hindupur, Ekdeep Singh Lubana, Thomas Fel, Demba Ba

Comments: Preprint

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[189] arXiv:2503.01824 [pdf, html, other]: Title: From superposition to sparse codes: interpretable representations in neural networks

David Klindt, Charles O'Neill, Patrik Reizinger, Harald Maurer, Nina Miolane

Subjects: Machine Learning (cs.LG)
[190] arXiv:2503.01827 [pdf, other]: Title: Open-source framework for detecting bias and overfitting for large pathology images

Anders Sildnes, Nikita Shvetsov, Masoud Tafavvoghi, Vi Ngoc-Nha Tran, Kajsa Møllersen, Lill-Tove Rasmussen Busund, Thomas K. Kilvær, Lars Ailo Bongo

Subjects: Machine Learning (cs.LG); Software Engineering (cs.SE); Image and Video Processing (eess.IV)
[191] arXiv:2503.01837 [pdf, html, other]: Title: Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning

Adrià López Escoriza, Nicklas Hansen, Stone Tao, Tongzhou Mu, Hao Su

Comments: Project page can be found at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[192] arXiv:2503.01838 [pdf, other]: Title: GRAIN: Exact Graph Reconstruction from Gradients

Maria Drencheva, Ivo Petrov, Maximilian Baader, Dimitar I. Dimitrov, Martin Vechev

Comments: Published at The Thirteenth International Conference on Learning Representations (ICLR) 2025

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[193] arXiv:2503.01843 [pdf, other]: Title: When Can You Get Away with Low Memory Adam?

Dayal Singh Kalra, John Kirchenbauer, Maissam Barkeshli, Tom Goldstein

Comments: Acknowledgement updates and minor writing edits

Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (stat.ML)
[194] arXiv:2503.01864 [pdf, html, other]: Title: Larger or Smaller Reward Margins to Select Preferences for Alignment?

Kexin Huang, Junkang Wu, Ziqian Chen, Xue Wang, Jinyang Gao, Bolin Ding, Jiancan Wu, Xiangnan He, Xiang Wang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[195] arXiv:2503.01865 [pdf, other]: Title: Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints

Junxiao Yang, Zhexin Zhang, Shiyao Cui, Hongning Wang, Minlie Huang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[196] arXiv:2503.01868 [pdf, other]: Title: Systems and Algorithms for Convolutional Multi-Hybrid Language Models at Scale

Jerome Ku, Eric Nguyen, David W. Romero, Garyk Brixi, Brandon Yang, Anton Vorontsov, Ali Taghibakhshi, Amy X. Lu, Dave P. Burke, Greg Brockman, Stefano Massaroli, Christopher Ré, Patrick D. Hsu, Brian L. Hie, Stefano Ermon, Michael Poli

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[197] arXiv:2503.01871 [pdf, html, other]: Title: Data Augmentation for Instruction Following Policies via Trajectory Segmentation

Niklas Höpner, Ilaria Tiddi, Herke van Hoof

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[198] arXiv:2503.01872 [pdf, html, other]: Title: FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance

Mintong Kang, Vinayshekhar Bannihatti Kumar, Shamik Roy, Abhishek Kumar, Sopan Khosla, Balakrishnan Murali Narayanaswamy, Rashmi Gangadharaiah

Comments: Under submission

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[199] arXiv:2503.01873 [pdf, html, other]: Title: Online Pseudo-average Shifting Attention(PASA) for Robust Low-precision LLM Inference: Algorithms and Numerical Analysis

Long Cheng, Qichen Liao, Fan Wu, Junlin Mu, Tengfei Han, Zhe Qiu, Lianqiang Li, Tianyi Liu, Fangzheng Miao, Keming Gao, Liang Wang, Zhen Zhang, Qiande Yin

Comments: 21 Pages, 14 figures, conference paper

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Performance (cs.PF); Numerical Analysis (math.NA)
[200] arXiv:2503.01874 [pdf, html, other]: Title: CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging

Zongzhen Yang, Binhang Qi, Hailong Sun, Wenrui Long, Ruobing Zhao, Xiang Gao

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)

Total of 3681 entries : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 ... 3651-3681

Showing up to 50 entries per page: fewer | more | all