Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for February 2024

Total of 3960 entries : 1-250 251-500 401-650 501-750 751-1000 1001-1250 ... 3751-3960
Showing up to 250 entries per page: fewer | more | all
[401] arXiv:2402.03264 [pdf, html, other]
Title: MobilityGPT: Enhanced Human Mobility Modeling with a GPT model
Ammar Haydari, Dongjie Chen, Zhengfeng Lai, Michael Zhang, Chen-Nee Chuah
Subjects: Machine Learning (cs.LG)
[402] arXiv:2402.03268 [pdf, html, other]
Title: Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation
Xinyi Wang, Alfonso Amayuelas, Kexun Zhang, Liangming Pan, Wenhu Chen, William Yang Wang
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[403] arXiv:2402.03270 [pdf, other]
Title: Multiclass Classification Procedure for Detecting Attacks on MQTT-IoT Protocol
Hector Alaiz-Moreton (1), Jose Aveleira-Mata (2), Jorge Ondicol-Garcia (2), Angel Luis Muñoz-Castañeda (2), Isaías García (1), Carmen Benavides (1) ((1) Escuela de Ingenierías, Universidad de León, (2) Research Institute of Applied Sciences in Cybersecurity, Universidad de León)
Journal-ref: Complexity (New York, N.Y.), 2019, Vol.2019, p.1-11
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[404] arXiv:2402.03282 [pdf, html, other]
Title: A Theoretical Framework for Partially Observed Reward-States in RLHF
Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari
Comments: 64 pages. 14 pages for main paper, 50 pages for references + appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[405] arXiv:2402.03287 [pdf, html, other]
Title: A Lennard-Jones Layer for Distribution Normalization
Mulun Na, Jonathan Klein, Biao Zhang, Wojtek Pałubicki, Sören Pirk, Dominik L. Michels
Comments: Upon request, we are happy to share the source code to generate the results presented in this paper. Please contact the first or the last author of this manuscript
Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[406] arXiv:2402.03289 [pdf, other]
Title: Make Every Move Count: LLM-based High-Quality RTL Code Generation Using MCTS
Matthew DeLorenzo, Animesh Basak Chowdhury, Vasudev Gohil, Shailja Thakur, Ramesh Karri, Siddharth Garg, Jeyavijayan Rajendran
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[407] arXiv:2402.03292 [pdf, html, other]
Title: Detecting Out-of-Distribution Objects through Class-Conditioned Inpainting
Quang-Huy Nguyen, Jin Peng Zhou, Zhenzhen Liu, Khanh-Huyen Bui, Kilian Q. Weinberger, Wei-Lun Chao, Dung D. Le
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2402.03293 [pdf, html, other]
Title: Flora: Low-Rank Adapters Are Secretly Gradient Compressors
Yongchang Hao, Yanshuai Cao, Lili Mou
Comments: Accepted @ ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[409] arXiv:2402.03295 [pdf, other]
Title: Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks
Yongchang Hao, Yanshuai Cao, Lili Mou
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[410] arXiv:2402.03299 [pdf, other]
Title: GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models
Haibo Jin, Ruoxi Chen, Peiyan Zhang, Andy Zhou, Yang Zhang, Haohan Wang
Comments: 28 papges
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[411] arXiv:2402.03305 [pdf, html, other]
Title: Do Diffusion Models Learn Semantically Meaningful and Efficient Representations?
Qiyao Liang, Ziming Liu, Ila Fiete
Comments: 13 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[412] arXiv:2402.03386 [pdf, other]
Title: A generalized decision tree ensemble based on the NeuralNetworks architecture: Distributed Gradient Boosting Forest (DGBF)
Ángel Delgado-Panadero, José Alberto Benítez-Andrades, María Teresa García-Ordás
Journal-ref: Applied Intelligence, Volume 53, July 2023, pages 22991-23003
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[413] arXiv:2402.03448 [pdf, other]
Title: Decentralized Sporadic Federated Learning: A Unified Algorithmic Framework with Convergence Guarantees
Shahryar Zehtabi, Dong-Jun Han, Rohit Parasnis, Seyyedali Hosseinalipour, Christopher G. Brinton
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[414] arXiv:2402.03457 [pdf, html, other]
Title: Efficient and Interpretable Traffic Destination Prediction using Explainable Boosting Machines
Yasin Yousif, Jörg Müller
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[415] arXiv:2402.03467 [pdf, html, other]
Title: Stochastic Modified Flows for Riemannian Stochastic Gradient Descent
Benjamin Gess, Sebastian Kassing, Nimit Rana
Journal-ref: SIAM J. Control Optim. 62(6): 3288-3314 (2024)
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Probability (math.PR); Machine Learning (stat.ML)
[416] arXiv:2402.03468 [pdf, html, other]
Title: Exact Tensor Completion Powered by Slim Transforms
Li Ge, Lin Chen, Yudong Chen, Xue Jiang
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[417] arXiv:2402.03469 [pdf, html, other]
Title: Rethinking the Role of Proxy Rewards in Language Model Alignment
Sungdong Kim, Minjoon Seo
Comments: Accepted to EMNLP 2024 main conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[418] arXiv:2402.03471 [pdf, html, other]
Title: The Information of Large Language Model Geometry
Zhiquan Tan, Chenghai Li, Weiran Huang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Theory (cs.IT)
[419] arXiv:2402.03478 [pdf, html, other]
Title: Estimating Epistemic and Aleatoric Uncertainty with a Single Model
Matthew A. Chan, Maria J. Molina, Christopher A. Metzler
Comments: 19 pages, 11 figures. To be published in Conference on Neural Information Processing Systems (NeurIPS) 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[420] arXiv:2402.03479 [pdf, html, other]
Title: DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design
Samuel Garcin, James Doran, Shangmin Guo, Christopher G. Lucas, Stefano V. Albrecht
Comments: To appear in ICML 2024. A preliminary version of this work (arXiv:2310.03494) was presented at the ALOE workshop, NeurIPS 2023. arXiv admin note: text overlap with arXiv:2310.03494
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[421] arXiv:2402.03480 [pdf, html, other]
Title: Trillion Parameter AI Serving Infrastructure for Scientific Discovery: A Survey and Vision
Nathaniel Hudson, J. Gregory Pauloski, Matt Baughman, Alok Kamatar, Mansi Sakarvadia, Logan Ward, Ryan Chard, André Bauer, Maksim Levental, Wenyi Wang, Will Engler, Owen Price Skelly, Ben Blaiszik, Rick Stevens, Kyle Chard, Ian Foster
Comments: 10 pages, 3 figures, accepted for publication in the proceedings of the 10th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[422] arXiv:2402.03486 [pdf, html, other]
Title: Early prediction of onset of sepsis in Clinical Setting
Fahim Mohammad, Lakshmi Arunachalam, Samanway Sadhu, Boudewijn Aasman, Shweta Garg, Adil Ahmed, Silvie Colman, Meena Arunachalam, Sudhir Kulkarni, Parsa Mirhaji
Comments: 16 pages, 6 figures and 7 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[423] arXiv:2402.03495 [pdf, html, other]
Title: Partially Stochastic Infinitely Deep Bayesian Neural Networks
Sergio Calvo-Ordonez, Matthieu Meunier, Francesco Piatti, Yuantao Shi
Comments: 17 pages including supplementary material. Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Probability (math.PR)
[424] arXiv:2402.03496 [pdf, html, other]
Title: Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Wu Lin, Felix Dangel, Runa Eschenhagen, Juhan Bae, Richard E. Turner, Alireza Makhzani
Comments: A long version of the ICML 2024 paper. Updated the caption of Fig 4 to emphasize the importance of the scale invariance of root-free methods
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[425] arXiv:2402.03502 [pdf, html, other]
Title: How Does Unlabeled Data Provably Help Out-of-Distribution Detection?
Xuefeng Du, Zhen Fang, Ilias Diakonikolas, Yixuan Li
Comments: ICLR 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[426] arXiv:2402.03525 [pdf, html, other]
Title: Deep Reinforcement Learning for Picker Routing Problem in Warehousing
George Dunn, Hadi Charkhgard, Ali Eshragh, Sasan Mahmoudinazlou, Elizabeth Stojanovski
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[427] arXiv:2402.03531 [pdf, other]
Title: Fairness and Privacy Guarantees in Federated Contextual Bandits
Sambhav Solanki, Shweta Jain, Sujit Gujar
Comments: 16 pages, 2 figures
Subjects: Machine Learning (cs.LG)
[428] arXiv:2402.03540 [pdf, html, other]
Title: Regulation Games for Trustworthy Machine Learning
Mohammad Yaghini, Patty Liu, Franziska Boenisch, Nicolas Papernot
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Machine Learning (stat.ML)
[429] arXiv:2402.03541 [pdf, html, other]
Title: HAMLET: Graph Transformer Neural Operator for Partial Differential Equations
Andrey Bryutkin, Jiahao Huang, Zhongying Deng, Guang Yang, Carola-Bibiane Schönlieb, Angelica Aviles-Rivero
Comments: 18 pages, 7 figures, 6 tables
Journal-ref: Proceedings of Machine Learning Research, Vol. 235, pp. 4624-4641, 2024
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[430] arXiv:2402.03545 [pdf, html, other]
Title: Online Feature Updates Improve Online (Generalized) Label Shift Adaptation
Ruihan Wu, Siddhartha Datta, Yi Su, Dheeraj Baby, Yu-Xiang Wang, Kilian Q. Weinberger
Subjects: Machine Learning (cs.LG)
[431] arXiv:2402.03548 [pdf, html, other]
Title: Single-GPU GNN Systems: Traps and Pitfalls
Yidong Gong, Arnab Tarafder, Saima Afrin, Pradeep Kumar
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[432] arXiv:2402.03558 [pdf, html, other]
Title: Path Signatures and Graph Neural Networks for Slow Earthquake Analysis: Better Together?
Hans Riess, Manolis Veveakis, Michael M. Zavlanos
Subjects: Machine Learning (cs.LG); Geophysics (physics.geo-ph)
[433] arXiv:2402.03559 [pdf, html, other]
Title: Constrained Synthesis with Projected Diffusion Models
Jacob K Christopher, Stephen Baek, Ferdinando Fioretto
Comments: Published at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[434] arXiv:2402.03563 [pdf, html, other]
Title: Distinguishing the Knowable from the Unknowable with Language Models
Gustaf Ahdritz, Tian Qin, Nikhil Vyas, Boaz Barak, Benjamin L. Edelman
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[435] arXiv:2402.03564 [pdf, html, other]
Title: SkipPredict: When to Invest in Predictions for Scheduling
Rana Shahout, Michael Mitzenmacher
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[436] arXiv:2402.03570 [pdf, html, other]
Title: Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding, Amy Zhang, Yuandong Tian, Qinqing Zheng
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[437] arXiv:2402.03576 [pdf, html, other]
Title: Generalization Properties of Adversarial Training for $\ell_0$-Bounded Adversarial Attacks
Payam Delgosha, Hamed Hassani, Ramtin Pedarsani
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[438] arXiv:2402.03577 [pdf, html, other]
Title: Revisiting the Dataset Bias Problem from a Statistical Perspective
Kien Do, Dung Nguyen, Hung Le, Thao Le, Dang Nguyen, Haripriya Harikumar, Truyen Tran, Santu Rana, Svetha Venkatesh
Subjects: Machine Learning (cs.LG)
[439] arXiv:2402.03579 [pdf, html, other]
Title: Deconstructing the Goldilocks Zone of Neural Network Initialization
Artem Vysogorets, Anna Dawid, Julia Kempe
Journal-ref: Proceedings of the 41st International Conference on Machine Learning, PMLR (2024) 235:49717-49732
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[440] arXiv:2402.03587 [pdf, html, other]
Title: Information-Theoretic Active Correlation Clustering
Linus Aronsson, Morteza Haghir Chehreghani
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[441] arXiv:2402.03588 [pdf, html, other]
Title: Continual Domain Adversarial Adaptation via Double-Head Discriminators
Yan Shen, Zhanghexuan Ji, Chunwei Ma, Mingchen Gao
Comments: AISTATS 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[442] arXiv:2402.03589 [pdf, html, other]
Title: A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System
Jiaqi Liang, Sanjay Dominik Jena, Defeng Liu, Andrea Lodi
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[443] arXiv:2402.03590 [pdf, html, other]
Title: Assessing the Impact of Distribution Shift on Reinforcement Learning Performance
Ted Fujimoto, Joshua Suetterlein, Samrat Chatterjee, Auroop Ganguly
Comments: Poster at the Workshop on Regulatable Machine Learning at the 37th Conference on Neural Information Processing Systems (RegML @ NeurIPS 2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[444] arXiv:2402.03610 [pdf, html, other]
Title: RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents
Tomoyuki Kagaya, Thong Jing Yuan, Yuxuan Lou, Jayashree Karlekar, Sugiri Pranata, Akira Kinose, Koki Oguri, Felix Wick, Yang You
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[445] arXiv:2402.03614 [pdf, html, other]
Title: Bayesian Vector AutoRegression with Factorised Granger-Causal Graphs
He Zhao, Vassili Kitsios, Terence J. O'Kane, Edwin V. Bonilla
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[446] arXiv:2402.03621 [pdf, html, other]
Title: Neural Network Approximators for Marginal MAP in Probabilistic Circuits
Shivvrat Arya, Tahrima Rahman, Vibhav Gogate
Comments: Will appear in AAAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[447] arXiv:2402.03625 [pdf, html, other]
Title: Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial Time
Sungyoon Kim, Mert Pilanci
Comments: Version 2: Fixed proof of Thm 4.4, slight clarification on assumption 2 Version 3: Modified to ICML style and slight clarification on assumption 1
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[448] arXiv:2402.03629 [pdf, html, other]
Title: Disparate Impact on Group Accuracy of Linearization for Private Inference
Saswat Das, Marco Romanelli, Ferdinando Fioretto
Comments: Extended version of the paper accepted to appear at the Forty-first International Conference on Machine Learning (ICML) 2024
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computers and Society (cs.CY)
[449] arXiv:2402.03646 [pdf, other]
Title: Lens: A Foundation Model for Network Traffic
Qineng Wang, Chen Qian, Xiaochang Li, Ziyu Yao, Gang Zhou, Huajie Shao
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[450] arXiv:2402.03647 [pdf, html, other]
Title: CAMBranch: Contrastive Learning with Augmented MILPs for Branching
Jiacheng Lin, Meng Xu, Zhihua Xiong, Huangang Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[451] arXiv:2402.03655 [pdf, html, other]
Title: Operator SVD with Neural Networks via Nested Low-Rank Approximation
J. Jon Ryu, Xiangxiang Xu, H. S. Melihcan Erol, Yuheng Bu, Lizhong Zheng, Gregory W. Wornell
Comments: 36 pages, 7 figures. ICML 2024. Almost identical to the conference version, except a few updates for fixing typos and mistakes
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[452] arXiv:2402.03659 [pdf, html, other]
Title: Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language Models
Kelvin J.L. Koa, Yunshan Ma, Ritchie Ng, Tat-Seng Chua
Comments: WWW 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Statistical Finance (q-fin.ST)
[453] arXiv:2402.03660 [pdf, html, other]
Title: On the Emergence of Cross-Task Linearity in the Pretraining-Finetuning Paradigm
Zhanpeng Zhou, Zijun Chen, Yilan Chen, Bo Zhang, Junchi Yan
Comments: 31 pages, 24 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[454] arXiv:2402.03661 [pdf, html, other]
Title: Transductive Reward Inference on Graph
Bohao Qu, Xiaofeng Cao, Qing Guo, Yi Chang, Ivor W. Tsang, Chengqi Zhang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[455] arXiv:2402.03663 [pdf, html, other]
Title: Symbol Correctness in Deep Neural Networks Containing Symbolic Layers
Aaron Bembenek, Toby Murray
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[456] arXiv:2402.03664 [pdf, html, other]
Title: Partial Gromov-Wasserstein Metric
Yikun Bai, Rocio Diaz Martin, Abihith Kothapalli, Hengrong Du, Xinran Liu, Soheil Kolouri
Comments: Published at ICLR 2025
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[457] arXiv:2402.03687 [pdf, html, other]
Title: Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generation
Lingxiao Zhao, Xueying Ding, Leman Akoglu
Comments: Diffusion Model on Graphs
Journal-ref: NeurIPS 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[458] arXiv:2402.03698 [pdf, other]
Title: Estimating the Local Learning Coefficient at Scale
Zach Furman, Edmund Lau
Comments: This paper has been expanded and merged with arXiv:2308.12108 to form a more comprehensive study. Please refer to the latest version of that preprint for the most up-to-date manuscript
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[459] arXiv:2402.03701 [pdf, html, other]
Title: Unified Discrete Diffusion for Categorical Data
Lingxiao Zhao, Xueying Ding, Lijun Yu, Leman Akoglu
Comments: Unify Discrete Denoising Diffusion
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[460] arXiv:2402.03715 [pdf, html, other]
Title: Clarify: Improving Model Robustness With Natural Language Corrections
Yoonho Lee, Michelle S. Lam, Helena Vasconcelos, Michael S. Bernstein, Chelsea Finn
Comments: UIST 2024. Interface code available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[461] arXiv:2402.03720 [pdf, other]
Title: Similarity-based Neighbor Selection for Graph LLMs
Rui Li, Jiwei Li, Jiawei Han, Guoyin Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[462] arXiv:2402.03726 [pdf, html, other]
Title: Learning Granger Causality from Instance-wise Self-attentive Hawkes Processes
Dongxia Wu, Tsuyoshi Idé, Aurélie Lozano, Georgios Kollias, Jiří Navrátil, Naoki Abe, Yi-An Ma, Rose Yu
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[463] arXiv:2402.03737 [pdf, html, other]
Title: Differentially Private High Dimensional Bandits
Apurv Shukla
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Systems and Control (eess.SY); Optimization and Control (math.OC); Machine Learning (stat.ML)
[464] arXiv:2402.03741 [pdf, html, other]
Title: SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems
Oubo Ma, Yuwen Pu, Linkang Du, Yang Dai, Ruo Wang, Xiaolei Liu, Yingcai Wu, Shouling Ji
Comments: To appear in the ACM Conference on Computer and Communications Security (CCS'24), October 14-18, 2024, Salt Lake City, UT, USA
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[465] arXiv:2402.03747 [pdf, other]
Title: An invariance constrained deep learning network for PDE discovery
Chao Chen, Hui Li, Xiaowei Jin
Subjects: Machine Learning (cs.LG)
[466] arXiv:2402.03750 [pdf, other]
Title: Digital Twin Mobility Profiling: A Spatio-Temporal Graph Learning Approach
Xin Chen, Mingliang Hou, Tao Tang, Achhardeep Kaur, Feng Xia
Comments: 10 pages, 7 figures
Journal-ref: The 7th IEEE International Conference on Data Science and Systems (DSS), Dec 20 - 22, 2021, Haikou, China
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[467] arXiv:2402.03753 [pdf, other]
Title: Enhanced sampling of robust molecular datasets with uncertainty-based collective variables
Aik Rui Tan, Johannes C. B. Dietschreit, Rafael Gomez-Bombarelli
Comments: 13 pages, 4 figures, 10 pages of Supplementary Information
Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[468] arXiv:2402.03770 [pdf, other]
Title: Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes
Xiaoxin Su, Yipeng Zhou, Laizhong Cui, John C.S. Lui, Jiangchuan Liu
Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)
Subjects: Machine Learning (cs.LG)
[469] arXiv:2402.03771 [pdf, html, other]
Title: Reinforcement Learning from Bagged Reward
Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama
Subjects: Machine Learning (cs.LG)
[470] arXiv:2402.03774 [pdf, html, other]
Title: Learning a Decision Tree Algorithm with Transformers
Yufan Zhuang, Liyuan Liu, Chandan Singh, Jingbo Shang, Jianfeng Gao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[471] arXiv:2402.03784 [pdf, html, other]
Title: AirPhyNet: Harnessing Physics-Guided Neural Networks for Air Quality Prediction
Kethmi Hirushini Hettige, Jiahao Ji, Shili Xiang, Cheng Long, Gao Cong, Jingyuan Wang
Comments: Accepted by the 12th International Conference on Learning Representations (ICLR 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applied Physics (physics.app-ph)
[472] arXiv:2402.03785 [pdf, other]
Title: Weakly Supervised Anomaly Detection via Knowledge-Data Alignment
Haihong Zhao, Chenyi Zi, Yang Liu, Chen Zhang, Yan Zhou, Jia Li
Comments: Accepted by WWW 2024
Subjects: Machine Learning (cs.LG)
[473] arXiv:2402.03792 [pdf, other]
Title: No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran, Alberto Maria Metelli, Matteo Papini, Marcello Restell
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[474] arXiv:2402.03804 [pdf, other]
Title: ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs
Zhengyan Zhang, Yixin Song, Guanghui Yu, Xu Han, Yankai Lin, Chaojun Xiao, Chenyang Song, Zhiyuan Liu, Zeyu Mi, Maosong Sun
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[475] arXiv:2402.03807 [pdf, html, other]
Title: SEABO: A Simple Search-Based Method for Offline Imitation Learning
Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu
Comments: To appear in ICLR2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[476] arXiv:2402.03814 [pdf, other]
Title: Masked Graph Autoencoder with Non-discrete Bandwidths
Ziwen Zhao, Yuhua Li, Yixiong Zou, Jiliang Tang, Ruixuan Li
Comments: Full version (17 pages, 8 figures, 12 tables), accepted by TheWebConf 2024 (WWW 2024)
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[477] arXiv:2402.03815 [pdf, other]
Title: Expediting In-Network Federated Learning by Voting-Based Consensus Model Compression
Xiaoxin Su, Yipeng Zhou, Laizhong Cui, Song Guo
Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)
Subjects: Machine Learning (cs.LG)
[478] arXiv:2402.03818 [pdf, html, other]
Title: Asymptotic generalization error of a single-layer graph convolutional network
O. Duranthon, L. Zdeborová
Journal-ref: Proceedings of the Third Learning on Graphs Conference (LoG 2024), PMLR 269
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn)
[479] arXiv:2402.03828 [pdf, other]
Title: Estimating Barycenters of Distributions with Neural Optimal Transport
Alexander Kolesov, Petr Mokrov, Igor Udovichenko, Milena Gazdieva, Gudmund Pammer, Evgeny Burnaev, Alexander Korotin
Subjects: Machine Learning (cs.LG)
[480] arXiv:2402.03845 [pdf, other]
Title: On gauge freedom, conservativity and intrinsic dimensionality estimation in diffusion models
Christian Horvat, Jean-Pascal Pfister
Subjects: Machine Learning (cs.LG)
[481] arXiv:2402.03846 [pdf, html, other]
Title: Efficient Generation of Hidden Outliers for Improved Outlier Detection
Jose Cribeiro-Ramallo, Vadim Arzamasov, Klemens Böhm
Comments: Preprint. Full paper is scheduled to appear in TKDD; Updated results in table 4
Subjects: Machine Learning (cs.LG)
[482] arXiv:2402.03855 [pdf, html, other]
Title: Challenges in Mechanistically Interpreting Model Representations
Satvik Golechha, James Dao
Comments: 9 pages, ICML 2024 Workshop on Mechanistic Interpretability
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[483] arXiv:2402.03864 [pdf, html, other]
Title: The Challenges of the Nonlinear Regime for Physics-Informed Neural Networks
Andrea Bonfanti, Giuseppe Bruno, Cristina Cipriani
Comments: 10 pages, 4 figures, appendix of 12 additional pages
Subjects: Machine Learning (cs.LG)
[484] arXiv:2402.03885 [pdf, html, other]
Title: MOMENT: A Family of Open Time-series Foundation Models
Mononito Goswami, Konrad Szafer, Arjun Choudhry, Yifu Cai, Shuo Li, Artur Dubrawski
Comments: Accepted at ICML'24. This is a revision. See changelog in the Appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[485] arXiv:2402.03902 [pdf, html, other]
Title: A phase transition between positional and semantic learning in a solvable model of dot-product attention
Hugo Cui, Freya Behrens, Florent Krzakala, Lenka Zdeborová
Journal-ref: Advances in Neural Information Processing Systems 37 (NeurIPS 2024)
Subjects: Machine Learning (cs.LG)
[486] arXiv:2402.03903 [pdf, html, other]
Title: Averaging $n$-step Returns Reduces Variance in Reinforcement Learning
Brett Daley, Martha White, Marlos C. Machado
Comments: ICML 2024. 27 pages, 7 figures, 3 tables
Subjects: Machine Learning (cs.LG)
[487] arXiv:2402.03905 [pdf, other]
Title: Employee Turnover Analysis Using Machine Learning Algorithms
Mahyar Karimi, Kamyar Seyedkazem Viliyani
Comments: 6 pages, 11 feagures, 2 tables
Subjects: Machine Learning (cs.LG)
[488] arXiv:2402.03915 [pdf, html, other]
Title: Learning Metrics that Maximise Power for Accelerated A/B-Tests
Olivier Jeunen, Aleksei Ustimenko
Comments: To appear in the Applied Data Science track at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '24)
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Applications (stat.AP); Machine Learning (stat.ML)
[489] arXiv:2402.03921 [pdf, other]
Title: Large Language Models to Enhance Bayesian Optimization
Tennison Liu, Nicolás Astorga, Nabeel Seedat, Mihaela van der Schaar
Comments: Accepted as Poster at ICLR2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[490] arXiv:2402.03923 [pdf, html, other]
Title: Return-Aligned Decision Transformer
Tsunehiko Tanaka, Kenshi Abe, Kaito Ariu, Tetsuro Morimura, Edgar Simo-Serra
Subjects: Machine Learning (cs.LG)
[491] arXiv:2402.03941 [pdf, html, other]
Title: Discovery of the Hidden World with Large Language Models
Chenxi Liu, Yongqiang Chen, Tongliang Liu, Mingming Gong, James Cheng, Bo Han, Kun Zhang
Comments: NeurIPS 2024; Chenxi and Yongqiang contributed equally; 59 pages, 72 figures; Project page: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[492] arXiv:2402.03966 [pdf, other]
Title: On dimensionality of feature vectors in MPNNs
César Bravo, Alexander Kozachinskiy, Cristóbal Rojas
Comments: 15 pages, 2 figures. Changes to the previous version: added reference to Amir et al.~(NeurIPS'23)
Subjects: Machine Learning (cs.LG)
[493] arXiv:2402.03969 [pdf, other]
Title: In-context learning agents are asymmetric belief updaters
Johannes A. Schubert, Akshay K. Jagadish, Marcel Binz, Eric Schulz
Subjects: Machine Learning (cs.LG)
[494] arXiv:2402.03970 [pdf, html, other]
Title: Is Deep Learning finally better than Decision Trees on Tabular Data?
Guri Zabërgja, Arlind Kadra, Christian M. M. Frey, Josif Grabocka
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[495] arXiv:2402.03979 [pdf, html, other]
Title: Cross Entropy versus Label Smoothing: A Neural Collapse Perspective
Li Guo, George Andriopoulos, Zifan Zhao, Shuyang Ling, Zixuan Dong, Keith Ross
Journal-ref: Published in Transactions on Machine Learning Research(05/2025)
Subjects: Machine Learning (cs.LG)
[496] arXiv:2402.03985 [pdf, html, other]
Title: A Bias-Variance Decomposition for Ensembles over Multiple Synthetic Datasets
Ossi Räisä, Antti Honkela
Comments: AISTATS 2025
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[497] arXiv:2402.03991 [pdf, html, other]
Title: Provable Emergence of Deep Neural Collapse and Low-Rank Bias in $L^2$-Regularized Nonlinear Networks
Emanuele Zangrando, Piero Deidda, Simone Brugiapaglia, Nicola Guglielmi, Francesco Tudisco
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[498] arXiv:2402.03992 [pdf, html, other]
Title: Space Group Constrained Crystal Generation
Rui Jiao, Wenbing Huang, Yu Liu, Deli Zhao, Yang Liu
Comments: ICLR 2024 poster
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[499] arXiv:2402.03994 [pdf, html, other]
Title: Efficient Sketches for Training Data Attribution and Studying the Loss Landscape
Andrea Schioppa
Journal-ref: Neurips 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[500] arXiv:2402.04004 [pdf, html, other]
Title: Understanding the Effect of Noise in LLM Training Data with Algorithmic Chains of Thought
Alex Havrilla, Maia Iyer
Subjects: Machine Learning (cs.LG)
[501] arXiv:2402.04005 [pdf, html, other]
Title: Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning
Idan Achituve, Idit Diamant, Arnon Netzer, Gal Chechik, Ethan Fetaya
Subjects: Machine Learning (cs.LG)
[502] arXiv:2402.04010 [pdf, other]
Title: Efficient Availability Attacks against Supervised and Contrastive Learning Simultaneously
Yihan Wang, Yifan Zhu, Xiao-Shan Gao
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[503] arXiv:2402.04019 [pdf, other]
Title: Exploring the Effects of Population and Employment Characteristics on Truck Flows: An Analysis of NextGen NHTS Origin-Destination Data
Majbah Uddin, Yuandong Liu, Hyeonsup Lim
Journal-ref: In International Conference on Transportation and Development 2023 (pp. 503-513)
Subjects: Machine Learning (cs.LG)
[504] arXiv:2402.04029 [pdf, html, other]
Title: Positive concave deep equilibrium models
Mateusz Gabor, Tomasz Piotrowski, Renato L. G. Cavalcante
Subjects: Machine Learning (cs.LG)
[505] arXiv:2402.04030 [pdf, other]
Title: Reducing the Cost of Quantum Chemical Data By Backpropagating Through Density Functional Theory
Alexander Mathiasen, Hatem Helal, Paul Balanca, Adam Krzywaniak, Ali Parviz, Frederik Hvilshøj, Blazej Banaszewski, Carlo Luschi, Andrew William Fitzgibbon
Subjects: Machine Learning (cs.LG)
[506] arXiv:2402.04033 [pdf, html, other]
Title: On provable privacy vulnerabilities of graph representations
Ruofan Wu, Guanhua Fang, Qiying Pan, Mingyang Zhang, Tengfei Liu, Weiqiang Wang
Subjects: Machine Learning (cs.LG)
[507] arXiv:2402.04050 [pdf, html, other]
Title: Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models
Zhengbo Wang, Jian Liang, Ran He, Zilei Wang, Tieniu Tan
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[508] arXiv:2402.04051 [pdf, html, other]
Title: Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Akira Ito, Masanori Yamada, Atsutoshi Kumagai
Comments: In Proceedings of the Thirteenth International Conference on Learning Representations (ICLR 2025)
Subjects: Machine Learning (cs.LG)
[509] arXiv:2402.04054 [pdf, html, other]
Title: More Flexible PAC-Bayesian Meta-Learning by Learning Learning Algorithms
Hossein Zakerinia, Amin Behjati, Christoph H. Lampert
Comments: International Conference on Machine Learning (ICML), 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[510] arXiv:2402.04059 [pdf, html, other]
Title: Deep Learning for Multivariate Time Series Imputation: A Survey
Jun Wang, Wenjie Du, Yiyuan Yang, Linglong Qian, Wei Cao, Keli Zhang, Wenjia Wang, Yuxuan Liang, Qingsong Wen
Comments: Accepted by IJCAI 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[511] arXiv:2402.04062 [pdf, other]
Title: Link Prediction with Relational Hypergraphs
Xingyue Huang, Miguel Romero Orth, Pablo Barceló, Michael M. Bronstein, İsmail İlkan Ceylan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[512] arXiv:2402.04068 [pdf, html, other]
Title: Retrieve to Explain: Evidence-driven Predictions for Explainable Drug Target Identification
Ravi Patel, Angus Brayne, Rogier Hintzen, Daniel Jaroslawicz, Georgiana Neculae, Dane Corneil
Comments: Accepted at ACL 2025 (The 63rd Annual Meeting of the Association for Computational Linguistics)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[513] arXiv:2402.04080 [pdf, html, other]
Title: Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Ruoqi Zhang, Ziwei Luo, Jens Sjölund, Thomas B. Schön, Per Mattsson
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[514] arXiv:2402.04081 [pdf, html, other]
Title: Improved Generalization of Weight Space Networks via Augmentations
Aviv Shamsian, Aviv Navon, David W. Zhang, Yan Zhang, Ethan Fetaya, Gal Chechik, Haggai Maron
Comments: Under Review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[515] arXiv:2402.04082 [pdf, other]
Title: An Optimal House Price Prediction Algorithm: XGBoost
Hemlata Sharma, Hitesh Harsora, Bayode Ogunleye
Comments: 16 pages, Journal of Analytics
Journal-ref: Analytics, 3(1), 30-45 (2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP); Methodology (stat.ME)
[516] arXiv:2402.04084 [pdf, other]
Title: Provably learning a multi-head attention layer
Sitan Chen, Yuanzhi Li
Comments: 105 pages, comments welcome
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[517] arXiv:2402.04103 [pdf, other]
Title: An Exploration of Clustering Algorithms for Customer Segmentation in the UK Retail Market
Jeen Mary John, Olamilekan Shobayo, Bayode Ogunleye
Comments: 15 pages, Journal of Analytics
Journal-ref: Analytics, 2(4), 809-823 (2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP); Computation (stat.CO)
[518] arXiv:2402.04108 [pdf, other]
Title: Hierarchical Delay Attribution Classification using Unstructured Text in Train Management Systems
Anton Borg, Per Lingvall, Martin Svensson
Comments: 22 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[519] arXiv:2402.04119 [pdf, html, other]
Title: A quantitative analysis of knowledge-learning preferences in large language models in molecular science
Pengfei Liu, Jun Tao, Zhixiang Ren
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE)
[520] arXiv:2402.04129 [pdf, html, other]
Title: OVOR: OnePrompt with Virtual Outlier Regularization for Rehearsal-Free Class-Incremental Learning
Wei-Cheng Huang, Chun-Fu Chen, Hsiang Hsu
Comments: Accepted by ICLR 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[521] arXiv:2402.04161 [pdf, html, other]
Title: Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
Ashok Vardhan Makkuva, Marco Bondaschi, Adway Girish, Alliot Nagle, Martin Jaggi, Hyeji Kim, Michael Gastpar
Comments: Published at ICLR 2025 under the title "Attention with Markov: A Curious Case of Single-Layer Transformers"
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Information Theory (cs.IT); Machine Learning (stat.ML)
[522] arXiv:2402.04163 [pdf, other]
Title: Tempered Calculus for ML: Application to Hyperbolic Model Embedding
Richard Nock, Ehsan Amid, Frank Nielsen, Alexander Soen, Manfred K. Warmuth
Comments: Subsumed by paper "Hyperbolic Embeddings of Supervised Models" by Richard Nock, Ehsan Amid, Frank Nielsen, Alexander Soen and Manfred K. Warmuth, appearing at NeurIPS'24
Subjects: Machine Learning (cs.LG)
[523] arXiv:2402.04168 [pdf, html, other]
Title: Informed Reinforcement Learning for Situation-Aware Traffic Rule Exceptions
Daniel Bogdoll, Jing Qin, Moritz Nekolla, Ahmed Abouelazm, Tim Joseph, J. Marius Zöllner
Comments: Daniel Bogdoll and Jing Qin contributed equally. Accepted for publication at ICRA 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[524] arXiv:2402.04182 [pdf, other]
Title: Reinforcement Learning with Ensemble Model Predictive Safety Certification
Sven Gronauer, Tom Haider, Felippe Schmoeller da Roza, Klaus Diepold
Comments: Published in: Proc. of the 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024)
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[525] arXiv:2402.04193 [pdf, html, other]
Title: Gradient Coding in Decentralized Learning for Evading Stragglers
Chengxi Li, Mikael Skoglund
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[526] arXiv:2402.04209 [pdf, other]
Title: Acute kidney injury prediction for non-critical care patients: a retrospective external and internal validation study
Esra Adiyeke, Yuanfang Ren, Benjamin Shickel, Matthew M. Ruppert, Ziyuan Guan, Sandra L. Kane-Gill, Raghavan Murugan, Nabihah Amatullah, Britney A. Stottlemyer, Tiffany L. Tran, Dan Ricketts, Christopher M Horvat, Parisa Rashidi, Azra Bihorac, Tezcan Ozrazgat-Baslanti
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[527] arXiv:2402.04211 [pdf, other]
Title: Probabilistic Shapley Value Modeling and Inference
Mert Ketenci, Iñigo Urteaga, Victor Alfonso Rodriguez, Noémie Elhadad, Adler Perotte
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[528] arXiv:2402.04229 [pdf, other]
Title: MusicRL: Aligning Music Generation to Human Preferences
Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[529] arXiv:2402.04239 [pdf, html, other]
Title: CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers
Adjorn van Engelenhoven, Nicola Strisciuglio, Estefanía Talavera
Subjects: Machine Learning (cs.LG)
[530] arXiv:2402.04248 [pdf, html, other]
Title: Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks
Jongho Park, Jaeseung Park, Zheyang Xiong, Nayoung Lee, Jaewoong Cho, Samet Oymak, Kangwook Lee, Dimitris Papailiopoulos
Comments: Changes in v2: experiments on formal language ICL and explorations of width vs. depth on ICL; code repo available (24 pages, 10 figures)
Subjects: Machine Learning (cs.LG)
[531] arXiv:2402.04249 [pdf, html, other]
Title: HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Mantas Mazeika, Long Phan, Xuwang Yin, Andy Zou, Zifan Wang, Norman Mu, Elham Sakhaee, Nathaniel Li, Steven Basart, Bo Li, David Forsyth, Dan Hendrycks
Comments: Website: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[532] arXiv:2402.04284 [pdf, html, other]
Title: PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networks
Junwei Su, Difan Zou, Chuan Wu
Subjects: Machine Learning (cs.LG)
[533] arXiv:2402.04290 [pdf, html, other]
Title: CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling
Junchao Gong, Lei Bai, Peng Ye, Wanghan Xu, Na Liu, Jianhua Dai, Xiaokang Yang, Wanli Ouyang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[534] arXiv:2402.04291 [pdf, html, other]
Title: BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Wei Huang, Yangdong Liu, Haotong Qin, Ying Li, Shiming Zhang, Xianglong Liu, Michele Magno, Xiaojuan Qi
Comments: 19 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[535] arXiv:2402.04292 [pdf, other]
Title: AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies
Xixi Hu, Bo Liu, Xingchao Liu, Qiang Liu
Comments: NeuRIPS 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[536] arXiv:2402.04296 [pdf, other]
Title: LightHGNN: Distilling Hypergraph Neural Networks into MLPs for $100\times$ Faster Inference
Yifan Feng, Yihe Luo, Shihui Ying, Yue Gao
Comments: Some details are missing. The method of this paper is not complete
Subjects: Machine Learning (cs.LG)
[537] arXiv:2402.04298 [pdf, html, other]
Title: Multi-View Symbolic Regression
Etienne Russeil, Fabrício Olivetti de França, Konstantin Malanchev, Bogdan Burlacu, Emille E. O. Ishida, Marion Leroux, Clément Michelin, Guillaume Moinard, Emmanuel Gangler
Comments: Published in GECCO-2024. 11 pages, 5 figures
Subjects: Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM); Applications (stat.AP)
[538] arXiv:2402.04325 [pdf, html, other]
Title: Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons
Zhenyu Liu, Garrett Gagnon, Swagath Venkataramani, Liu Liu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[539] arXiv:2402.04344 [pdf, html, other]
Title: Does confidence calibration improve conformal prediction?
Huajun Xi, Jianguo Huang, Kangdao Liu, Lei Feng, Hongxin Wei
Subjects: Machine Learning (cs.LG)
[540] arXiv:2402.04347 [pdf, html, other]
Title: The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
Michael Zhang, Kush Bhatia, Hermann Kumbong, Christopher Ré
Comments: 30 pages, 20 figures, 15 tables, ICLR 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[541] arXiv:2402.04359 [pdf, html, other]
Title: Adaptive Inference: Theoretical Limits and Unexplored Opportunities
Soheil Hor, Ying Qian, Mert Pilanci, Amin Arbabian
Subjects: Machine Learning (cs.LG)
[542] arXiv:2402.04362 [pdf, html, other]
Title: Neural Networks Learn Statistics of Increasing Complexity
Nora Belrose, Quintin Pope, Lucia Quirke, Alex Mallen, Xiaoli Fern
Subjects: Machine Learning (cs.LG)
[543] arXiv:2402.04375 [pdf, html, other]
Title: Bounding the Excess Risk for Linear Models Trained on Marginal-Preserving, Differentially-Private, Synthetic Data
Yvonne Zhou, Mingyu Liang, Ivan Brugere, Dana Dachman-Soled, Danial Dervovic, Antigoni Polychroniadou, Min Wu
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[544] arXiv:2402.04376 [pdf, html, other]
Title: Scaling laws for learning with real and surrogate data
Ayush Jain, Andrea Montanari, Eren Sasoglu
Comments: Added new experiment and minor changes
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[545] arXiv:2402.04377 [pdf, html, other]
Title: NeRCC: Nested-Regression Coded Computing for Resilient Distributed Prediction Serving Systems
Parsa Moradi, Mohammad Ali Maddah-Ali
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Information Theory (cs.IT)
[546] arXiv:2402.04379 [pdf, html, other]
Title: Fine-Tuned Language Models Generate Stable Inorganic Materials as Text
Nate Gruver, Anuroop Sriram, Andrea Madotto, Andrew Gordon Wilson, C. Lawrence Zitnick, Zachary Ulissi
Comments: ICLR 2024. Code available at: this https URL
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[547] arXiv:2402.04383 [pdf, html, other]
Title: FairWire: Fair Graph Generation
O. Deniz Kose, Yanning Shen
Comments: 16 pages, 1 figure, 7 tables
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[548] arXiv:2402.04384 [pdf, other]
Title: Denoising Diffusion Probabilistic Models in Six Simple Steps
Richard E. Turner, Cristiana-Diana Diaconu, Stratis Markou, Aliaksandra Shysheya, Andrew Y. K. Foong, Bruno Mlodozeniec
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[549] arXiv:2402.04390 [pdf, other]
Title: Densely Multiplied Physics Informed Neural Networks
Feilong Jiang, Xiaonan Hou, Min Xia
Comments: 15 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[550] arXiv:2402.04396 [pdf, html, other]
Title: QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks
Albert Tseng, Jerry Chee, Qingyao Sun, Volodymyr Kuleshov, Christopher De Sa
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[551] arXiv:2402.04398 [pdf, html, other]
Title: Learning under Temporal Label Noise
Sujay Nagaraj, Walter Gerych, Sana Tonekaboni, Anna Goldenberg, Berk Ustun, Thomas Hartvigsen
Comments: The Thirteenth International Conference on Learning Representations (ICLR 2025)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[552] arXiv:2402.04400 [pdf, html, other]
Title: CEHR-GPT: Generating Electronic Health Records with Chronological Patient Timelines
Chao Pang, Xinzhuo Jiang, Nishanth Parameshwar Pavinkurve, Krishna S. Kalluri, Elise L. Minto, Jason Patterson, Linying Zhang, George Hripcsak, Gamze Gürsoy, Noémie Elhadad, Karthik Natarajan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[553] arXiv:2402.04409 [pdf, html, other]
Title: Towards Fair, Robust and Efficient Client Contribution Evaluation in Federated Learning
Meiying Zhang, Huan Zhao, Sheldon Ebron, Kan Yang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[554] arXiv:2402.04412 [pdf, other]
Title: The VampPrior Mixture Model
Andrew A. Stirn, David A. Knowles
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[555] arXiv:2402.04417 [pdf, html, other]
Title: Decentralized Blockchain-based Robust Multi-agent Multi-armed Bandit
Mengfan Xu, Diego Klabjan
Comments: 45 pages
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[556] arXiv:2402.04435 [pdf, html, other]
Title: PreGIP: Watermarking the Pretraining of Graph Neural Networks for Deep Intellectual Property Protection
Enyan Dai, Minhua Lin, Suhang Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[557] arXiv:2402.04440 [pdf, html, other]
Title: Exploring higher-order neural network node interactions with total correlation
Thomas Kerby, Teresa White, Kevin Moon
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[558] arXiv:2402.04467 [pdf, html, other]
Title: DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems
Yair Schiff, Zhong Yi Wan, Jeffrey B. Parker, Stephan Hoyer, Volodymyr Kuleshov, Fei Sha, Leonardo Zepeda-Núñez
Comments: ICML 2024; Code to reproduce our experiments is available at this https URL
Subjects: Machine Learning (cs.LG); Dynamical Systems (math.DS)
[559] arXiv:2402.04469 [pdf, html, other]
Title: IoT Network Traffic Analysis with Deep Learning
Mei Liu, Leon Yang
Comments: PerCom 2024 Workshop
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Networking and Internet Architecture (cs.NI)
[560] arXiv:2402.04485 [pdf, html, other]
Title: Incentivized Truthful Communication for Federated Bandits
Zhepei Wei, Chuanhao Li, Tianze Ren, Haifeng Xu, Hongning Wang
Comments: 20 pages, 2 figures. Accepted at ICLR 2024
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT)
[561] arXiv:2402.04489 [pdf, html, other]
Title: De-amplifying Bias from Differential Privacy in Language Model Fine-tuning
Sanjari Srivastava, Piotr Mardziel, Zhikhun Zhang, Archana Ahlawat, Anupam Datta, John C Mitchell
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computers and Society (cs.CY); Methodology (stat.ME)
[562] arXiv:2402.04494 [pdf, html, other]
Title: Amortized Planning with Large-Scale Transformers: A Case Study on Chess
Anian Ruoss, Grégoire Delétang, Sourabh Medapati, Jordi Grau-Moya, Li Kevin Wenliang, Elliot Catt, John Reid, Cannada A. Lewis, Joel Veness, Tim Genewein
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[563] arXiv:2402.04497 [pdf, other]
Title: The Fine-Grained Complexity of Gradient Computation for Training Large Language Models
Josh Alman, Zhao Song
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Computation and Language (cs.CL); Data Structures and Algorithms (cs.DS)
[564] arXiv:2402.04513 [pdf, html, other]
Title: Online Cascade Learning for Efficient Inference over Streams
Lunyiu Nie, Zhimin Ding, Erdong Hu, Christopher Jermaine, Swarat Chaudhuri
Comments: ICML 2024 Main Conference Paper
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[565] arXiv:2402.04520 [pdf, html, other]
Title: On Computational Limits of Modern Hopfield Models: A Fine-Grained Complexity Analysis
Jerry Yao-Chieh Hu, Thomas Lin, Zhao Song, Han Liu
Comments: Accepted at ICML 2024; v2 corrected typos; v3 added clarifications and references; v4,5 updated to camera-ready version
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[566] arXiv:2402.04523 [pdf, html, other]
Title: SumRec: A Framework for Recommendation using Open-Domain Dialogue
Ryutaro Asahara, Masaki Takahashi, Chiho Iwahashi, Michimasa Inaba
Comments: Accepted to PACLIC 2023
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[567] arXiv:2402.04538 [pdf, html, other]
Title: Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers
Md Shamim Hussain, Mohammed J. Zaki, Dharmashankar Subramanian
Comments: ICML'24 Accepted Version, 25 pages, 10 figures, 18 tables
Journal-ref: PMLR 235:20768- 20792, 2024
Subjects: Machine Learning (cs.LG)
[568] arXiv:2402.04539 [pdf, other]
Title: Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang, Faguo Wu, Xiao Zhang, Jianxiang Liu
Comments: 23 pages, 19 figures
Journal-ref: International Journal of Intelligent Systems, Volume 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[569] arXiv:2402.04553 [pdf, other]
Title: Curvature-Informed SGD via General Purpose Lie-Group Preconditioners
Omead Pooladzandi, Xi-Lin Li
Subjects: Machine Learning (cs.LG)
[570] arXiv:2402.04567 [pdf, other]
Title: OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
Chen Wang, Sarah Erfani, Tansu Alpcan, Christopher Leckie
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[571] arXiv:2402.04579 [pdf, other]
Title: Collective Counterfactual Explanations via Optimal Transport
Ahmad-Reza Ehyaei, Ali Shirali, Samira Samadi
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[572] arXiv:2402.04596 [pdf, other]
Title: Towards Improved Imbalance Robustness in Continual Multi-Label Learning with Dual Output Spiking Architecture (DOSA)
Sourav Mishra, Shirin Dora, Suresh Sundaram
Comments: 8 pages, 4 figures, 4 tables, 45 references. Submitted to IJCNN 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[573] arXiv:2402.04621 [pdf, html, other]
Title: Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily Perspective
Soo Yong Lee, Sunwoo Kim, Fanchen Bu, Jaemin Yoo, Jiliang Tang, Kijung Shin
Comments: published in ICML 2024
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[574] arXiv:2402.04640 [pdf, other]
Title: Domain Bridge: Generative model-based domain forensic for black-box models
Jiyi Zhang, Han Fang, Ee-Chien Chang
Subjects: Machine Learning (cs.LG)
[575] arXiv:2402.04644 [pdf, html, other]
Title: LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views
Yuji Roh, Qingyun Liu, Huan Gui, Zhe Yuan, Yujin Tang, Steven Euijong Whang, Liang Liu, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao
Comments: In Proceedings of the 41st International Conference on Machine Learning (ICML), 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[576] arXiv:2402.04646 [pdf, html, other]
Title: Block Sparse Bayesian Learning: A Diversified Scheme
Yanhao Zhang, Zhihan Zhu, Yong Xia
Comments: Accepted to NeurIPS 2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[577] arXiv:2402.04647 [pdf, html, other]
Title: Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference
Deqian Kong, Dehong Xu, Minglu Zhao, Bo Pang, Jianwen Xie, Andrew Lizarraga, Yuhao Huang, Sirui Xie, Ying Nian Wu
Subjects: Machine Learning (cs.LG)
[578] arXiv:2402.04653 [pdf, other]
Title: An Over Complete Deep Learning Method for Inverse Problems
Moshe Eliasof, Eldad Haber, Eran Treister
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[579] arXiv:2402.04655 [pdf, html, other]
Title: Open-Vocabulary Calibration for Fine-tuned CLIP
Shuoyuan Wang, Jindong Wang, Guoqing Wang, Bob Zhang, Kaiyang Zhou, Hongxin Wei
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG)
[580] arXiv:2402.04668 [pdf, other]
Title: A Perspective on Individualized Treatment Effects Estimation from Time-series Health Data
Ghadeer O. Ghosheh, Moritz Gögl, Tingting Zhu
Subjects: Machine Learning (cs.LG)
[581] arXiv:2402.04676 [pdf, html, other]
Title: Group Distributionally Robust Dataset Distillation with Risk Minimization
Saeed Vahidian, Mingyu Wang, Jianyang Gu, Vyacheslav Kungurtsev, Wei Jiang, Yiran Chen
Comments: ICLR 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[582] arXiv:2402.04710 [pdf, html, other]
Title: Incorporating Retrieval-based Causal Learning with Information Bottlenecks for Interpretable Graph Neural Networks
Jiahua Rao, Jiancong Xie, Hanjing Lin, Shuangjia Zheng, Zhen Wang, Yuedong Yang
Subjects: Machine Learning (cs.LG)
[583] arXiv:2402.04732 [pdf, html, other]
Title: Graph Cuts with Arbitrary Size Constraints Through Optimal Transport
Chakib Fettal, Lazhar Labiod, Mohamed Nadif
Comments: Published in Transactions on Machine Learning Research
Subjects: Machine Learning (cs.LG)
[584] arXiv:2402.04744 [pdf, html, other]
Title: Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Abhimanyu Rajeshkumar Bambhaniya, Amir Yazdanbakhsh, Suvinay Subramanian, Sheng-Chun Kao, Shivani Agrawal, Utku Evci, Tushar Krishna
Comments: 18 pages, 8 figures, 17 tables. Code is available at this https URL
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[585] arXiv:2402.04764 [pdf, other]
Title: Code as Reward: Empowering Reinforcement Learning with VLMs
David Venuto, Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand
Subjects: Machine Learning (cs.LG)
[586] arXiv:2402.04783 [pdf, other]
Title: Analyzing the Neural Tangent Kernel of Periodically Activated Coordinate Networks
Hemanth Saratchandran, Shin-Fang Chng, Simon Lucey
Comments: arXiv admin note: substantial text overlap with arXiv:2402.02711
Subjects: Machine Learning (cs.LG)
[587] arXiv:2402.04794 [pdf, html, other]
Title: Scalable Multi-view Clustering via Explicit Kernel Features Maps
Chakib Fettal, Lazhar Labiod, Mohamed Nadif
Subjects: Machine Learning (cs.LG)
[588] arXiv:2402.04814 [pdf, html, other]
Title: BOWL: A Deceptively Simple Open World Learner
Roshni .R. Kamath, Rupert Mitchell, Subarnaduti Paul, Kristian Kersting, Martin Mundt
Subjects: Machine Learning (cs.LG)
[589] arXiv:2402.04821 [pdf, other]
Title: E(3)-Equivariant Mesh Neural Networks
Thuan Trang, Nhat Khang Ngo, Daniel Levy, Thieu N. Vo, Siamak Ravanbakhsh, Truong Son Hy
Subjects: Machine Learning (cs.LG)
[590] arXiv:2402.04823 [pdf, other]
Title: How Realistic Is Your Synthetic Data? Constraining Deep Generative Models for Tabular Data
Mihaela Cătălina Stoian, Salijona Dyrmishi, Maxime Cordy, Thomas Lukasiewicz, Eleonora Giunchiglia
Comments: Accepted at ICLR 2024
Subjects: Machine Learning (cs.LG)
[591] arXiv:2402.04830 [pdf, html, other]
Title: Closing the Gap Between SGP4 and High-Precision Propagation via Differentiable Programming
Giacomo Acciarini, Atılım Güneş Baydin, Dario Izzo
Journal-ref: Acta Astronautica 226(1) (2025) 8
Subjects: Machine Learning (cs.LG); Earth and Planetary Astrophysics (astro-ph.EP)
[592] arXiv:2402.04836 [pdf, html, other]
Title: On the Completeness of Invariant Geometric Deep Learning Models
Zian Li, Xiyuan Wang, Shijia Kang, Muhan Zhang
Comments: The Thirteenth International Conference on Learning Representations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[593] arXiv:2402.04852 [pdf, html, other]
Title: Multi-Patch Prediction: Adapting LLMs for Time Series Representation Learning
Yuxuan Bian, Xuan Ju, Jiangtong Li, Zhijian Xu, Dawei Cheng, Qiang Xu
Subjects: Machine Learning (cs.LG)
[594] arXiv:2402.04869 [pdf, html, other]
Title: Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy
Ruichu Cai, Siyang Huang, Jie Qiao, Wei Chen, Yan Zeng, Keli Zhang, Fuchun Sun, Yang Yu, Zhifeng Hao
Comments: Accepted by Science China Information Sciences
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[595] arXiv:2402.04875 [pdf, html, other]
Title: On Provable Length and Compositional Generalization
Kartik Ahuja, Amin Mansouri
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[596] arXiv:2402.04902 [pdf, html, other]
Title: L4Q: Parameter Efficient Quantization-Aware Fine-Tuning on Large Language Models
Hyesung Jeon, Yulhwa Kim, Jae-joon Kim
Comments: 9 pages, 4 figures, 3 tables
Journal-ref: ACL (main) 2025
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[597] arXiv:2402.04906 [pdf, html, other]
Title: Conformal Convolution and Monte Carlo Meta-learners for Predictive Inference of Individual Treatment Effects
Jef Jonkers, Jarne Verhaeghe, Glenn Van Wallendael, Luc Duchateau, Sofie Van Hoecke
Comments: Major update (rescope to distributional regression in counterfactual inference)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[598] arXiv:2402.04915 [pdf, html, other]
Title: Moco: A Learnable Meta Optimizer for Combinatorial Optimization
Tim Dernedde, Daniela Thyssens, Sören Dittrich, Maximilian Stubbemann, Lars Schmidt-Thieme
Comments: 20 pages, 2 figures. A prior version was published in Advances in Knowledge Discovery and Data Mining. PAKDD 2025. Lecture Notes in Computer Science, vol 15872. Springer, Singapore
Subjects: Machine Learning (cs.LG)
[599] arXiv:2402.04924 [pdf, other]
Title: Two Trades is not Baffled: Condensing Graph via Crafting Rational Gradient Matching
Tianle Zhang, Yuchen Zhang, Kun Wang, Kai Wang, Beining Yang, Kaipeng Zhang, Wenqi Shao, Ping Liu, Joey Tianyi Zhou, Yang You
Comments: An effective method for graph condensation
Subjects: Machine Learning (cs.LG)
[600] arXiv:2402.04933 [pdf, html, other]
Title: Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits
Biyonka Liang, Lily Xu, Aparna Taneja, Milind Tambe, Lucas Janson
Comments: 29 pages, 18 figures
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[601] arXiv:2402.04982 [pdf, other]
Title: Beyond explaining: XAI-based Adaptive Learning with SHAP Clustering for Energy Consumption Prediction
Tobias Clement, Hung Truong Thanh Nguyen, Nils Kemmerzell, Mohamed Abdelaal, Davor Stjelja
Comments: A short version of this paper was published at the Australasian Joint Conference on Artificial Intelligence in 2023
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[602] arXiv:2402.04987 [pdf, other]
Title: PriorBoost: An Adaptive Algorithm for Learning from Aggregate Responses
Adel Javanmard, Matthew Fahrbach, Vahab Mirrokni
Comments: 29 pages, 4 figures
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
[603] arXiv:2402.05002 [pdf, html, other]
Title: Randomized Confidence Bounds for Stochastic Partial Monitoring
Maxime Heuillet, Ola Ahmad, Audrey Durand
Subjects: Machine Learning (cs.LG)
[604] arXiv:2402.05007 [pdf, other]
Title: Example-based Explanations for Random Forests using Machine Unlearning
Tanmay Surve, Romila Pradhan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[605] arXiv:2402.05011 [pdf, html, other]
Title: Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching
Yuchen Zhang, Tianle Zhang, Kai Wang, Ziyao Guo, Yuxuan Liang, Xavier Bresson, Wei Jin, Yang You
Comments: Lossless graph condensation method
Subjects: Machine Learning (cs.LG)
[606] arXiv:2402.05013 [pdf, other]
Title: Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth
Kevin Kögler, Alexander Shevchenko, Hamed Hassani, Marco Mondelli
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[607] arXiv:2402.05015 [pdf, other]
Title: A Sober Look at LLMs for Material Discovery: Are They Actually Good for Bayesian Optimization Over Molecules?
Agustinus Kristiadi, Felix Strieth-Kalthoff, Marta Skreta, Pascal Poupart, Alán Aspuru-Guzik, Geoff Pleiss
Comments: ICML 2024. Code: this https URL
Subjects: Machine Learning (cs.LG)
[608] arXiv:2402.05025 [pdf, other]
Title: Strong convexity-guided hyper-parameter optimization for flatter losses
Rahul Yedida, Snehanshu Saha
Comments: v1
Subjects: Machine Learning (cs.LG)
[609] arXiv:2402.05033 [pdf, html, other]
Title: Majority Kernels: An Approach to Leverage Big Model Dynamics for Efficient Small Model Training
Hanna Mazzawi, Pranjal Awasthi, Xavi Gonzalvo, Srikumar Ramalingam
Subjects: Machine Learning (cs.LG)
[610] arXiv:2402.05039 [pdf, other]
Title: PAC Learnability under Explanation-Preserving Graph Perturbations
Xu Zheng, Farhad Shirani, Tianchun Wang, Shouwei Gao, Wenqian Dong, Wei Cheng, Dongsheng Luo
Comments: 21 pages, 6 figures, 4 tables
Subjects: Machine Learning (cs.LG)
[611] arXiv:2402.05050 [pdf, other]
Title: Federated Learning Can Find Friends That Are Advantageous
Nazarii Tupitsa, Samuel Horváth, Martin Takáč, Eduard Gorbunov
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[612] arXiv:2402.05052 [pdf, html, other]
Title: Causal Representation Learning from Multiple Distributions: A General Setting
Kun Zhang, Shaoan Xie, Ignavier Ng, Yujia Zheng
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[613] arXiv:2402.05073 [pdf, other]
Title: NITO: Neural Implicit Fields for Resolution-free Topology Optimization
Amin Heyrani Nobari, Giorgio Giannone, Lyle Regenwetter, Faez Ahmed
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE)
[614] arXiv:2402.05098 [pdf, html, other]
Title: Improved off-policy training of diffusion samplers
Marcin Sendera, Minsu Kim, Sarthak Mittal, Pablo Lemos, Luca Scimeca, Jarrid Rector-Brooks, Alexandre Adam, Yoshua Bengio, Nikolay Malkin
Comments: NeurIPS 2024; code: this https URL
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[615] arXiv:2402.05099 [pdf, html, other]
Title: Hydragen: High-Throughput LLM Inference with Shared Prefixes
Jordan Juravsky, Bradley Brown, Ryan Ehrlich, Daniel Y. Fu, Christopher Ré, Azalia Mirhoseini
Subjects: Machine Learning (cs.LG)
[616] arXiv:2402.05109 [pdf, html, other]
Title: Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding
Zachary Ankner, Rishab Parthasarathy, Aniruddha Nrusimha, Christopher Rinard, Jonathan Ragan-Kelley, William Brandon
Subjects: Machine Learning (cs.LG)
[617] arXiv:2402.05110 [pdf, other]
Title: Opening the AI black box: program synthesis via mechanistic interpretability
Eric J. Michaud, Isaac Liao, Vedang Lad, Ziming Liu, Anish Mudide, Chloe Loughridge, Zifan Carl Guo, Tara Rezaei Kheirkhah, Mateja Vukelić, Max Tegmark
Comments: 24 pages
Subjects: Machine Learning (cs.LG)
[618] arXiv:2402.05140 [pdf, html, other]
Title: Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Junhong Shen, Neil Tenenholtz, James Brian Hall, David Alvarez-Melis, Nicolo Fusi
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[619] arXiv:2402.05145 [pdf, html, other]
Title: Online Learning Approach for Survival Analysis
Camila Fernandez (LPSM), Pierre Gaillard (Thoth), Joseph de Vilmarest, Olivier Wintenberger (LPSM (UMR\_8001))
Subjects: Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
[620] arXiv:2402.05146 [pdf, html, other]
Title: Compressing Deep Reinforcement Learning Networks with a Dynamic Structured Pruning Method for Autonomous Driving
Wensheng Su, Zhenni Li, Minrui Xu, Jiawen Kang, Dusit Niyato, Shengli Xie
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[621] arXiv:2402.05147 [pdf, html, other]
Title: ApiQ: Finetuning of 2-Bit Quantized Large Language Model
Baohao Liao, Christian Herold, Shahram Khadivi, Christof Monz
Comments: more benchmarks and new method, block-wise ApiQ. code: this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[622] arXiv:2402.05149 [pdf, html, other]
Title: FlowPG: Action-constrained Policy Gradient with Normalizing Flows
Janaka Chathuranga Brahmanage, Jiajing Ling, Akshat Kumar
Journal-ref: Thirty-seventh Conference on Neural Information Processing Systems. 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[623] arXiv:2402.05150 [pdf, html, other]
Title: Designing deep neural networks for driver intention recognition
Koen Vellenga, H. Joe Steinhauer, Alexander Karlsson, Göran Falkman, Asli Rhodin, Ashok Koppisetty
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[624] arXiv:2402.05151 [pdf, html, other]
Title: CrashFormer: A Multimodal Architecture to Predict the Risk of Crash
Amin Karimi Monsefi, Pouya Shiri, Ahmad Mohammadshirazi, Nastaran Karimi Monsefi, Ron Davies, Sobhan Moosavi, Rajiv Ramnath
Comments: The paper is accepted In 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AI (UrbanAI 23), November 13, 2023, Hamburg, Germany
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[625] arXiv:2402.05153 [pdf, html, other]
Title: Estimating On-road Transportation Carbon Emissions from Open Data of Road Network and Origin-destination Flow Data
Jinwei Zeng, Yu Liu, Jingtao Ding, Jian Yuan, Yong Li
Subjects: Machine Learning (cs.LG)
[626] arXiv:2402.05162 [pdf, html, other]
Title: Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson
Comments: 22 pages, 9 figures. Project page is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[627] arXiv:2402.05164 [pdf, html, other]
Title: A Resource Model For Neural Scaling Law
Jinyeop Song, Ziming Liu, Max Tegmark, Jeff Gore
Comments: 10 pages, 8 figures, Published as a workshop paper at ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[628] arXiv:2402.05173 [pdf, other]
Title: Towards Understanding Inductive Bias in Transformers: A View From Infinity
Itay Lavie, Guy Gur-Ari, Zohar Ringel
Comments: ICML 2024
Journal-ref: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (stat.ML)
[629] arXiv:2402.05203 [pdf, other]
Title: Bellman Conformal Inference: Calibrating Prediction Intervals For Time Series
Zitong Yang, Emmanuel Candès, Lihua Lei
Comments: 17 pages, 4 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[630] arXiv:2402.05232 [pdf, html, other]
Title: Universal Neural Functionals
Allan Zhou, Chelsea Finn, James Harrison
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[631] arXiv:2402.05234 [pdf, html, other]
Title: QGFN: Controllable Greediness with Action Values
Elaine Lau, Stephen Zhewen Lu, Ling Pan, Doina Precup, Emmanuel Bengio
Comments: Accepted by 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Subjects: Machine Learning (cs.LG)
[632] arXiv:2402.05252 [pdf, html, other]
Title: Learning Fair Ranking Policies via Differentiable Optimization of Ordered Weighted Averages
My H. Dinh, James Kotary, Ferdinando Fioretto
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[633] arXiv:2402.05264 [pdf, html, other]
Title: AdaBatchGrad: Combining Adaptive Batch Size and Adaptive Step Size
Petr Ostroukhov, Aigerim Zhumabayeva, Chulu Xiang, Alexander Gasnikov, Martin Takáč, Dmitry Kamzolov
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[634] arXiv:2402.05274 [pdf, html, other]
Title: Convergence of Natural Policy Gradient for a Family of Infinite-State Queueing MDPs
Isaac Grosof, Siva Theja Maguluri, R. Srikant
Comments: 32 pages
Subjects: Machine Learning (cs.LG)
[635] arXiv:2402.05275 [pdf, other]
Title: Exploring Hierarchical Classification Performance for Time Series Data: Dissimilarity Measures and Classifier Comparisons
Celal Alagoz
Comments: 9 pages, 2 figures, 5th International Mediterranean Congress 1, 1367-1376
Subjects: Machine Learning (cs.LG)
[636] arXiv:2402.05279 [pdf, html, other]
Title: Safety Filters for Black-Box Dynamical Systems by Learning Discriminating Hyperplanes
Will Lavanakul, Jason J. Choi, Koushil Sreenath, Claire J. Tomlin
Comments: * Indicate co-first authors. This is an extended version of the paper presented at L4DC 2024
Subjects: Machine Learning (cs.LG)
[637] arXiv:2402.05280 [pdf, html, other]
Title: No Dimensional Sampling Coresets for Classification
Meysam Alishahi, Jeff M. Phillips
Comments: 42 Pages
Subjects: Machine Learning (cs.LG); Computational Geometry (cs.CG)
[638] arXiv:2402.05284 [pdf, html, other]
Title: Analyzing Adversarial Inputs in Deep Reinforcement Learning
Davide Corsi, Guy Amir, Guy Katz, Alessandro Farinelli
Subjects: Machine Learning (cs.LG)
[639] arXiv:2402.05290 [pdf, other]
Title: Do Transformer World Models Give Better Policy Gradients?
Michel Ma, Tianwei Ni, Clement Gehring, Pierluca D'Oro, Pierre-Luc Bacon
Comments: Michel Ma and Pierluca D'Oro contributed equally
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[640] arXiv:2402.05291 [pdf, html, other]
Title: Graph convolutional network as a fast statistical emulator for numerical ice sheet modeling
Maryam Rahnemoonfar, Younghyun Koo
Comments: Accepted to Journal of Glaciology on November 20, 2024
Journal-ref: J. Glaciol. 71 (2025) e15
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE)
[641] arXiv:2402.05293 [pdf, other]
Title: A comparative study on feature selection for a risk prediction model for colorectal cancer
N. Cueto-López, M. T. García-Ordás, V. Dávila-Batista, V. Moreno, N. Aragonés, R. Alaiz-Rodríguez
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[642] arXiv:2402.05294 [pdf, html, other]
Title: Examining Modality Incongruity in Multimodal Federated Learning for Medical Vision and Language-based Disease Detection
Pramit Saha, Divyanshu Mishra, Felix Wagner, Konstantinos Kamnitsas, J. Alison Noble
Comments: 42 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[643] arXiv:2402.05295 [pdf, other]
Title: An information theoretic approach to quantify the stability of feature selection and ranking algorithms
Alaiz-Rodriguez, R., Parnell, A. C
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[644] arXiv:2402.05296 [pdf, other]
Title: Classifying spam emails using agglomerative hierarchical clustering and a topic-based approach
F. Janez-Martino, R. Alaiz-Rodriguez, V. Gonzalez-Castro, E. Fidalgo, E. Alegre
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[645] arXiv:2402.05306 [pdf, html, other]
Title: Interactive Symbolic Regression through Offline Reinforcement Learning: A Co-Design Framework
Yuan Tian, Wenqi Zhou, Michele Viscione, Hao Dong, David Kammer, Olga Fink
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[646] arXiv:2402.05309 [pdf, html, other]
Title: Investigating Generalization Behaviours of Generative Flow Networks
Lazar Atanackovic, Emmanuel Bengio
Comments: Accepted to TMLR
Subjects: Machine Learning (cs.LG)
[647] arXiv:2402.05322 [pdf, html, other]
Title: Learning on Multimodal Graphs: A Survey
Ciyuan Peng, Jiayuan He, Feng Xia
Comments: 9 pages, 1 figure
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Graphics (cs.GR); Social and Information Networks (cs.SI)
[648] arXiv:2402.05353 [pdf, html, other]
Title: Revisiting Early-Learning Regularization When Federated Learning Meets Noisy Labels
Taehyeon Kim, Donggyu Kim, Se-Young Yun
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[649] arXiv:2402.05356 [pdf, html, other]
Title: Exploring Learning Complexity for Efficient Downstream Dataset Pruning
Wenyu Jiang, Zhenlong Liu, Zejian Xie, Songxin Zhang, Bingyi Jing, Hongxin Wei
Comments: Accepted by ICLR 2025
Subjects: Machine Learning (cs.LG)
[650] arXiv:2402.05367 [pdf, html, other]
Title: Principled Preferential Bayesian Optimization
Wenjie Xu, Wenbin Wang, Yuning Jiang, Bratislav Svetozarevic, Colin N. Jones
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG)
Total of 3960 entries : 1-250 251-500 401-650 501-750 751-1000 1001-1250 ... 3751-3960
Showing up to 250 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack