Machine Learning

Authors and titles for February 2024

Total of 3960 entries : 1-100 101-200 201-300 301-400 401-500 501-600 601-700 701-800 ... 3901-3960

Showing up to 100 entries per page: fewer | more | all

[401] arXiv:2402.03264 [pdf, html, other]: Title: MobilityGPT: Enhanced Human Mobility Modeling with a GPT model

Ammar Haydari, Dongjie Chen, Zhengfeng Lai, Michael Zhang, Chen-Nee Chuah

Subjects: Machine Learning (cs.LG)
[402] arXiv:2402.03268 [pdf, html, other]: Title: Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation

Xinyi Wang, Alfonso Amayuelas, Kexun Zhang, Liangming Pan, Wenhu Chen, William Yang Wang

Comments: Accepted to ICML 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[403] arXiv:2402.03270 [pdf, other]: Title: Multiclass Classification Procedure for Detecting Attacks on MQTT-IoT Protocol

Hector Alaiz-Moreton (1), Jose Aveleira-Mata (2), Jorge Ondicol-Garcia (2), Angel Luis Muñoz-Castañeda (2), Isaías García (1), Carmen Benavides (1) ((1) Escuela de Ingenierías, Universidad de León, (2) Research Institute of Applied Sciences in Cybersecurity, Universidad de León)

Journal-ref: Complexity (New York, N.Y.), 2019, Vol.2019, p.1-11

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[404] arXiv:2402.03282 [pdf, html, other]: Title: A Theoretical Framework for Partially Observed Reward-States in RLHF

Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari

Comments: 64 pages. 14 pages for main paper, 50 pages for references + appendix

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[405] arXiv:2402.03287 [pdf, html, other]: Title: A Lennard-Jones Layer for Distribution Normalization

Mulun Na, Jonathan Klein, Biao Zhang, Wojtek Pałubicki, Sören Pirk, Dominik L. Michels

Comments: Upon request, we are happy to share the source code to generate the results presented in this paper. Please contact the first or the last author of this manuscript

Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[406] arXiv:2402.03289 [pdf, other]: Title: Make Every Move Count: LLM-based High-Quality RTL Code Generation Using MCTS

Matthew DeLorenzo, Animesh Basak Chowdhury, Vasudev Gohil, Shailja Thakur, Ramesh Karri, Siddharth Garg, Jeyavijayan Rajendran

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[407] arXiv:2402.03292 [pdf, html, other]: Title: Detecting Out-of-Distribution Objects through Class-Conditioned Inpainting

Quang-Huy Nguyen, Jin Peng Zhou, Zhenzhen Liu, Khanh-Huyen Bui, Kilian Q. Weinberger, Wei-Lun Chao, Dung D. Le

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2402.03293 [pdf, html, other]: Title: Flora: Low-Rank Adapters Are Secretly Gradient Compressors

Yongchang Hao, Yanshuai Cao, Lili Mou

Comments: Accepted @ ICML 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[409] arXiv:2402.03295 [pdf, other]: Title: Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks

Yongchang Hao, Yanshuai Cao, Lili Mou

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[410] arXiv:2402.03299 [pdf, other]: Title: GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models

Haibo Jin, Ruoxi Chen, Peiyan Zhang, Andy Zhou, Yang Zhang, Haohan Wang

Comments: 28 papges

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[411] arXiv:2402.03305 [pdf, html, other]: Title: Do Diffusion Models Learn Semantically Meaningful and Efficient Representations?

Qiyao Liang, Ziming Liu, Ila Fiete

Comments: 13 pages, 9 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[412] arXiv:2402.03386 [pdf, other]: Title: A generalized decision tree ensemble based on the NeuralNetworks architecture: Distributed Gradient Boosting Forest (DGBF)

Ángel Delgado-Panadero, José Alberto Benítez-Andrades, María Teresa García-Ordás

Journal-ref: Applied Intelligence, Volume 53, July 2023, pages 22991-23003

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[413] arXiv:2402.03448 [pdf, other]: Title: Decentralized Sporadic Federated Learning: A Unified Algorithmic Framework with Convergence Guarantees

Shahryar Zehtabi, Dong-Jun Han, Rohit Parasnis, Seyyedali Hosseinalipour, Christopher G. Brinton

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[414] arXiv:2402.03457 [pdf, html, other]: Title: Efficient and Interpretable Traffic Destination Prediction using Explainable Boosting Machines

Yasin Yousif, Jörg Müller

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[415] arXiv:2402.03467 [pdf, html, other]: Title: Stochastic Modified Flows for Riemannian Stochastic Gradient Descent

Benjamin Gess, Sebastian Kassing, Nimit Rana

Journal-ref: SIAM J. Control Optim. 62(6): 3288-3314 (2024)

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Probability (math.PR); Machine Learning (stat.ML)
[416] arXiv:2402.03468 [pdf, html, other]: Title: Exact Tensor Completion Powered by Slim Transforms

Li Ge, Lin Chen, Yudong Chen, Xue Jiang

Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[417] arXiv:2402.03469 [pdf, html, other]: Title: Rethinking the Role of Proxy Rewards in Language Model Alignment

Sungdong Kim, Minjoon Seo

Comments: Accepted to EMNLP 2024 main conference

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[418] arXiv:2402.03471 [pdf, html, other]: Title: The Information of Large Language Model Geometry

Zhiquan Tan, Chenghai Li, Weiran Huang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Theory (cs.IT)
[419] arXiv:2402.03478 [pdf, html, other]: Title: Estimating Epistemic and Aleatoric Uncertainty with a Single Model

Matthew A. Chan, Maria J. Molina, Christopher A. Metzler

Comments: 19 pages, 11 figures. To be published in Conference on Neural Information Processing Systems (NeurIPS) 2024

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[420] arXiv:2402.03479 [pdf, html, other]: Title: DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design

Samuel Garcin, James Doran, Shangmin Guo, Christopher G. Lucas, Stefano V. Albrecht

Comments: To appear in ICML 2024. A preliminary version of this work (arXiv:2310.03494) was presented at the ALOE workshop, NeurIPS 2023. arXiv admin note: text overlap with arXiv:2310.03494

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[421] arXiv:2402.03480 [pdf, html, other]: Title: Trillion Parameter AI Serving Infrastructure for Scientific Discovery: A Survey and Vision

Nathaniel Hudson, J. Gregory Pauloski, Matt Baughman, Alok Kamatar, Mansi Sakarvadia, Logan Ward, Ryan Chard, André Bauer, Maksim Levental, Wenyi Wang, Will Engler, Owen Price Skelly, Ben Blaiszik, Rick Stevens, Kyle Chard, Ian Foster

Comments: 10 pages, 3 figures, accepted for publication in the proceedings of the 10th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT2023)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[422] arXiv:2402.03486 [pdf, html, other]: Title: Early prediction of onset of sepsis in Clinical Setting

Fahim Mohammad, Lakshmi Arunachalam, Samanway Sadhu, Boudewijn Aasman, Shweta Garg, Adil Ahmed, Silvie Colman, Meena Arunachalam, Sudhir Kulkarni, Parsa Mirhaji

Comments: 16 pages, 6 figures and 7 tables

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[423] arXiv:2402.03495 [pdf, html, other]: Title: Partially Stochastic Infinitely Deep Bayesian Neural Networks

Sergio Calvo-Ordonez, Matthieu Meunier, Francesco Piatti, Yuantao Shi

Comments: 17 pages including supplementary material. Accepted at ICML 2024

Subjects: Machine Learning (cs.LG); Probability (math.PR)
[424] arXiv:2402.03496 [pdf, html, other]: Title: Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

Wu Lin, Felix Dangel, Runa Eschenhagen, Juhan Bae, Richard E. Turner, Alireza Makhzani

Comments: A long version of the ICML 2024 paper. Updated the caption of Fig 4 to emphasize the importance of the scale invariance of root-free methods

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[425] arXiv:2402.03502 [pdf, html, other]: Title: How Does Unlabeled Data Provably Help Out-of-Distribution Detection?

Xuefeng Du, Zhen Fang, Ilias Diakonikolas, Yixuan Li

Comments: ICLR 2024

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[426] arXiv:2402.03525 [pdf, html, other]: Title: Deep Reinforcement Learning for Picker Routing Problem in Warehousing

George Dunn, Hadi Charkhgard, Ali Eshragh, Sasan Mahmoudinazlou, Elizabeth Stojanovski

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[427] arXiv:2402.03531 [pdf, other]: Title: Fairness and Privacy Guarantees in Federated Contextual Bandits

Sambhav Solanki, Shweta Jain, Sujit Gujar

Comments: 16 pages, 2 figures

Subjects: Machine Learning (cs.LG)
[428] arXiv:2402.03540 [pdf, html, other]: Title: Regulation Games for Trustworthy Machine Learning

Mohammad Yaghini, Patty Liu, Franziska Boenisch, Nicolas Papernot

Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Machine Learning (stat.ML)
[429] arXiv:2402.03541 [pdf, html, other]: Title: HAMLET: Graph Transformer Neural Operator for Partial Differential Equations

Andrey Bryutkin, Jiahao Huang, Zhongying Deng, Guang Yang, Carola-Bibiane Schönlieb, Angelica Aviles-Rivero

Comments: 18 pages, 7 figures, 6 tables

Journal-ref: Proceedings of Machine Learning Research, Vol. 235, pp. 4624-4641, 2024

Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[430] arXiv:2402.03545 [pdf, html, other]: Title: Online Feature Updates Improve Online (Generalized) Label Shift Adaptation

Ruihan Wu, Siddhartha Datta, Yi Su, Dheeraj Baby, Yu-Xiang Wang, Kilian Q. Weinberger

Subjects: Machine Learning (cs.LG)
[431] arXiv:2402.03548 [pdf, html, other]: Title: Single-GPU GNN Systems: Traps and Pitfalls

Yidong Gong, Arnab Tarafder, Saima Afrin, Pradeep Kumar

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[432] arXiv:2402.03558 [pdf, html, other]: Title: Path Signatures and Graph Neural Networks for Slow Earthquake Analysis: Better Together?

Hans Riess, Manolis Veveakis, Michael M. Zavlanos

Subjects: Machine Learning (cs.LG); Geophysics (physics.geo-ph)
[433] arXiv:2402.03559 [pdf, html, other]: Title: Constrained Synthesis with Projected Diffusion Models

Jacob K Christopher, Stephen Baek, Ferdinando Fioretto

Comments: Published at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[434] arXiv:2402.03563 [pdf, html, other]: Title: Distinguishing the Knowable from the Unknowable with Language Models

Gustaf Ahdritz, Tian Qin, Nikhil Vyas, Boaz Barak, Benjamin L. Edelman

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[435] arXiv:2402.03564 [pdf, html, other]: Title: SkipPredict: When to Invest in Predictions for Scheduling

Rana Shahout, Michael Mitzenmacher

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[436] arXiv:2402.03570 [pdf, html, other]: Title: Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning

Zihan Ding, Amy Zhang, Yuandong Tian, Qinqing Zheng

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[437] arXiv:2402.03576 [pdf, html, other]: Title: Generalization Properties of Adversarial Training for $\ell_0$-Bounded Adversarial Attacks

Payam Delgosha, Hamed Hassani, Ramtin Pedarsani

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[438] arXiv:2402.03577 [pdf, html, other]: Title: Revisiting the Dataset Bias Problem from a Statistical Perspective

Kien Do, Dung Nguyen, Hung Le, Thao Le, Dang Nguyen, Haripriya Harikumar, Truyen Tran, Santu Rana, Svetha Venkatesh

Subjects: Machine Learning (cs.LG)
[439] arXiv:2402.03579 [pdf, html, other]: Title: Deconstructing the Goldilocks Zone of Neural Network Initialization

Artem Vysogorets, Anna Dawid, Julia Kempe

Journal-ref: Proceedings of the 41st International Conference on Machine Learning, PMLR (2024) 235:49717-49732

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[440] arXiv:2402.03587 [pdf, html, other]: Title: Information-Theoretic Active Correlation Clustering

Linus Aronsson, Morteza Haghir Chehreghani

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[441] arXiv:2402.03588 [pdf, html, other]: Title: Continual Domain Adversarial Adaptation via Double-Head Discriminators

Yan Shen, Zhanghexuan Ji, Chunwei Ma, Mingchen Gao

Comments: AISTATS 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[442] arXiv:2402.03589 [pdf, html, other]: Title: A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System

Jiaqi Liang, Sanjay Dominik Jena, Defeng Liu, Andrea Lodi

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[443] arXiv:2402.03590 [pdf, html, other]: Title: Assessing the Impact of Distribution Shift on Reinforcement Learning Performance

Ted Fujimoto, Joshua Suetterlein, Samrat Chatterjee, Auroop Ganguly

Comments: Poster at the Workshop on Regulatable Machine Learning at the 37th Conference on Neural Information Processing Systems (RegML @ NeurIPS 2023)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[444] arXiv:2402.03610 [pdf, html, other]: Title: RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents

Tomoyuki Kagaya, Thong Jing Yuan, Yuxuan Lou, Jayashree Karlekar, Sugiri Pranata, Akira Kinose, Koki Oguri, Felix Wick, Yang You

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[445] arXiv:2402.03614 [pdf, html, other]: Title: Bayesian Vector AutoRegression with Factorised Granger-Causal Graphs

He Zhao, Vassili Kitsios, Terence J. O'Kane, Edwin V. Bonilla

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[446] arXiv:2402.03621 [pdf, html, other]: Title: Neural Network Approximators for Marginal MAP in Probabilistic Circuits

Shivvrat Arya, Tahrima Rahman, Vibhav Gogate

Comments: Will appear in AAAI 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[447] arXiv:2402.03625 [pdf, html, other]: Title: Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial Time

Sungyoon Kim, Mert Pilanci

Comments: Version 2: Fixed proof of Thm 4.4, slight clarification on assumption 2 Version 3: Modified to ICML style and slight clarification on assumption 1

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[448] arXiv:2402.03629 [pdf, html, other]: Title: Disparate Impact on Group Accuracy of Linearization for Private Inference

Saswat Das, Marco Romanelli, Ferdinando Fioretto

Comments: Extended version of the paper accepted to appear at the Forty-first International Conference on Machine Learning (ICML) 2024

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computers and Society (cs.CY)
[449] arXiv:2402.03646 [pdf, other]: Title: Lens: A Foundation Model for Network Traffic

Qineng Wang, Chen Qian, Xiaochang Li, Ziyu Yao, Gang Zhou, Huajie Shao

Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[450] arXiv:2402.03647 [pdf, html, other]: Title: CAMBranch: Contrastive Learning with Augmented MILPs for Branching

Jiacheng Lin, Meng Xu, Zhihua Xiong, Huangang Wang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[451] arXiv:2402.03655 [pdf, html, other]: Title: Operator SVD with Neural Networks via Nested Low-Rank Approximation

J. Jon Ryu, Xiangxiang Xu, H. S. Melihcan Erol, Yuheng Bu, Lizhong Zheng, Gregory W. Wornell

Comments: 36 pages, 7 figures. ICML 2024. Almost identical to the conference version, except a few updates for fixing typos and mistakes

Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[452] arXiv:2402.03659 [pdf, html, other]: Title: Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language Models

Kelvin J.L. Koa, Yunshan Ma, Ritchie Ng, Tat-Seng Chua

Comments: WWW 2024

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Statistical Finance (q-fin.ST)
[453] arXiv:2402.03660 [pdf, html, other]: Title: On the Emergence of Cross-Task Linearity in the Pretraining-Finetuning Paradigm

Zhanpeng Zhou, Zijun Chen, Yilan Chen, Bo Zhang, Junchi Yan

Comments: 31 pages, 24 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[454] arXiv:2402.03661 [pdf, html, other]: Title: Transductive Reward Inference on Graph

Bohao Qu, Xiaofeng Cao, Qing Guo, Yi Chang, Ivor W. Tsang, Chengqi Zhang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[455] arXiv:2402.03663 [pdf, html, other]: Title: Symbol Correctness in Deep Neural Networks Containing Symbolic Layers

Aaron Bembenek, Toby Murray

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[456] arXiv:2402.03664 [pdf, html, other]: Title: Partial Gromov-Wasserstein Metric

Yikun Bai, Rocio Diaz Martin, Abihith Kothapalli, Hengrong Du, Xinran Liu, Soheil Kolouri

Comments: Published at ICLR 2025

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[457] arXiv:2402.03687 [pdf, html, other]: Title: Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generation

Lingxiao Zhao, Xueying Ding, Leman Akoglu

Comments: Diffusion Model on Graphs

Journal-ref: NeurIPS 2024

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[458] arXiv:2402.03698 [pdf, other]: Title: Estimating the Local Learning Coefficient at Scale

Zach Furman, Edmund Lau

Comments: This paper has been expanded and merged with arXiv:2308.12108 to form a more comprehensive study. Please refer to the latest version of that preprint for the most up-to-date manuscript

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[459] arXiv:2402.03701 [pdf, html, other]: Title: Unified Discrete Diffusion for Categorical Data

Lingxiao Zhao, Xueying Ding, Lijun Yu, Leman Akoglu

Comments: Unify Discrete Denoising Diffusion

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[460] arXiv:2402.03715 [pdf, html, other]: Title: Clarify: Improving Model Robustness With Natural Language Corrections

Yoonho Lee, Michelle S. Lam, Helena Vasconcelos, Michael S. Bernstein, Chelsea Finn

Comments: UIST 2024. Interface code available at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[461] arXiv:2402.03720 [pdf, other]: Title: Similarity-based Neighbor Selection for Graph LLMs

Rui Li, Jiwei Li, Jiawei Han, Guoyin Wang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[462] arXiv:2402.03726 [pdf, html, other]: Title: Learning Granger Causality from Instance-wise Self-attentive Hawkes Processes

Dongxia Wu, Tsuyoshi Idé, Aurélie Lozano, Georgios Kollias, Jiří Navrátil, Naoki Abe, Yi-An Ma, Rose Yu

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[463] arXiv:2402.03737 [pdf, html, other]: Title: Differentially Private High Dimensional Bandits

Apurv Shukla

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Systems and Control (eess.SY); Optimization and Control (math.OC); Machine Learning (stat.ML)
[464] arXiv:2402.03741 [pdf, html, other]: Title: SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems

Oubo Ma, Yuwen Pu, Linkang Du, Yang Dai, Ruo Wang, Xiaolei Liu, Yingcai Wu, Shouling Ji

Comments: To appear in the ACM Conference on Computer and Communications Security (CCS'24), October 14-18, 2024, Salt Lake City, UT, USA

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[465] arXiv:2402.03747 [pdf, other]: Title: An invariance constrained deep learning network for PDE discovery

Chao Chen, Hui Li, Xiaowei Jin

Subjects: Machine Learning (cs.LG)
[466] arXiv:2402.03750 [pdf, other]: Title: Digital Twin Mobility Profiling: A Spatio-Temporal Graph Learning Approach

Xin Chen, Mingliang Hou, Tao Tang, Achhardeep Kaur, Feng Xia

Comments: 10 pages, 7 figures

Journal-ref: The 7th IEEE International Conference on Data Science and Systems (DSS), Dec 20 - 22, 2021, Haikou, China

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[467] arXiv:2402.03753 [pdf, other]: Title: Enhanced sampling of robust molecular datasets with uncertainty-based collective variables

Aik Rui Tan, Johannes C. B. Dietschreit, Rafael Gomez-Bombarelli

Comments: 13 pages, 4 figures, 10 pages of Supplementary Information

Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[468] arXiv:2402.03770 [pdf, other]: Title: Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes

Xiaoxin Su, Yipeng Zhou, Laizhong Cui, John C.S. Lui, Jiangchuan Liu

Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)

Subjects: Machine Learning (cs.LG)
[469] arXiv:2402.03771 [pdf, html, other]: Title: Reinforcement Learning from Bagged Reward

Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama

Subjects: Machine Learning (cs.LG)
[470] arXiv:2402.03774 [pdf, html, other]: Title: Learning a Decision Tree Algorithm with Transformers

Yufan Zhuang, Liyuan Liu, Chandan Singh, Jingbo Shang, Jianfeng Gao

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[471] arXiv:2402.03784 [pdf, html, other]: Title: AirPhyNet: Harnessing Physics-Guided Neural Networks for Air Quality Prediction

Kethmi Hirushini Hettige, Jiahao Ji, Shili Xiang, Cheng Long, Gao Cong, Jingyuan Wang

Comments: Accepted by the 12th International Conference on Learning Representations (ICLR 2024)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applied Physics (physics.app-ph)
[472] arXiv:2402.03785 [pdf, other]: Title: Weakly Supervised Anomaly Detection via Knowledge-Data Alignment

Haihong Zhao, Chenyi Zi, Yang Liu, Chen Zhang, Yan Zhou, Jia Li

Comments: Accepted by WWW 2024

Subjects: Machine Learning (cs.LG)
[473] arXiv:2402.03792 [pdf, other]: Title: No-Regret Reinforcement Learning in Smooth MDPs

Davide Maran, Alberto Maria Metelli, Matteo Papini, Marcello Restell

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[474] arXiv:2402.03804 [pdf, other]: Title: ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs

Zhengyan Zhang, Yixin Song, Guanghui Yu, Xu Han, Yankai Lin, Chaojun Xiao, Chenyang Song, Zhiyuan Liu, Zeyu Mi, Maosong Sun

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[475] arXiv:2402.03807 [pdf, html, other]: Title: SEABO: A Simple Search-Based Method for Offline Imitation Learning

Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu

Comments: To appear in ICLR2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[476] arXiv:2402.03814 [pdf, other]: Title: Masked Graph Autoencoder with Non-discrete Bandwidths

Ziwen Zhao, Yuhua Li, Yixiong Zou, Jiliang Tang, Ruixuan Li

Comments: Full version (17 pages, 8 figures, 12 tables), accepted by TheWebConf 2024 (WWW 2024)

Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[477] arXiv:2402.03815 [pdf, other]: Title: Expediting In-Network Federated Learning by Voting-Based Consensus Model Compression

Xiaoxin Su, Yipeng Zhou, Laizhong Cui, Song Guo

Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)

Subjects: Machine Learning (cs.LG)
[478] arXiv:2402.03818 [pdf, html, other]: Title: Asymptotic generalization error of a single-layer graph convolutional network

O. Duranthon, L. Zdeborová

Journal-ref: Proceedings of the Third Learning on Graphs Conference (LoG 2024), PMLR 269

Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn)
[479] arXiv:2402.03828 [pdf, other]: Title: Estimating Barycenters of Distributions with Neural Optimal Transport

Alexander Kolesov, Petr Mokrov, Igor Udovichenko, Milena Gazdieva, Gudmund Pammer, Evgeny Burnaev, Alexander Korotin

Subjects: Machine Learning (cs.LG)
[480] arXiv:2402.03845 [pdf, other]: Title: On gauge freedom, conservativity and intrinsic dimensionality estimation in diffusion models

Christian Horvat, Jean-Pascal Pfister

Subjects: Machine Learning (cs.LG)
[481] arXiv:2402.03846 [pdf, html, other]: Title: Efficient Generation of Hidden Outliers for Improved Outlier Detection

Jose Cribeiro-Ramallo, Vadim Arzamasov, Klemens Böhm

Comments: Preprint. Full paper is scheduled to appear in TKDD; Updated results in table 4

Subjects: Machine Learning (cs.LG)
[482] arXiv:2402.03855 [pdf, html, other]: Title: Challenges in Mechanistically Interpreting Model Representations

Satvik Golechha, James Dao

Comments: 9 pages, ICML 2024 Workshop on Mechanistic Interpretability

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[483] arXiv:2402.03864 [pdf, html, other]: Title: The Challenges of the Nonlinear Regime for Physics-Informed Neural Networks

Andrea Bonfanti, Giuseppe Bruno, Cristina Cipriani

Comments: 10 pages, 4 figures, appendix of 12 additional pages

Subjects: Machine Learning (cs.LG)
[484] arXiv:2402.03885 [pdf, html, other]: Title: MOMENT: A Family of Open Time-series Foundation Models

Mononito Goswami, Konrad Szafer, Arjun Choudhry, Yifu Cai, Shuo Li, Artur Dubrawski

Comments: Accepted at ICML'24. This is a revision. See changelog in the Appendix

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[485] arXiv:2402.03902 [pdf, html, other]: Title: A phase transition between positional and semantic learning in a solvable model of dot-product attention

Hugo Cui, Freya Behrens, Florent Krzakala, Lenka Zdeborová

Journal-ref: Advances in Neural Information Processing Systems 37 (NeurIPS 2024)

Subjects: Machine Learning (cs.LG)
[486] arXiv:2402.03903 [pdf, html, other]: Title: Averaging $n$-step Returns Reduces Variance in Reinforcement Learning

Brett Daley, Martha White, Marlos C. Machado

Comments: ICML 2024. 27 pages, 7 figures, 3 tables

Subjects: Machine Learning (cs.LG)
[487] arXiv:2402.03905 [pdf, other]: Title: Employee Turnover Analysis Using Machine Learning Algorithms

Mahyar Karimi, Kamyar Seyedkazem Viliyani

Comments: 6 pages, 11 feagures, 2 tables

Subjects: Machine Learning (cs.LG)
[488] arXiv:2402.03915 [pdf, html, other]: Title: Learning Metrics that Maximise Power for Accelerated A/B-Tests

Olivier Jeunen, Aleksei Ustimenko

Comments: To appear in the Applied Data Science track at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '24)

Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Applications (stat.AP); Machine Learning (stat.ML)
[489] arXiv:2402.03921 [pdf, other]: Title: Large Language Models to Enhance Bayesian Optimization

Tennison Liu, Nicolás Astorga, Nabeel Seedat, Mihaela van der Schaar

Comments: Accepted as Poster at ICLR2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[490] arXiv:2402.03923 [pdf, html, other]: Title: Return-Aligned Decision Transformer

Tsunehiko Tanaka, Kenshi Abe, Kaito Ariu, Tetsuro Morimura, Edgar Simo-Serra

Subjects: Machine Learning (cs.LG)
[491] arXiv:2402.03941 [pdf, html, other]: Title: Discovery of the Hidden World with Large Language Models

Chenxi Liu, Yongqiang Chen, Tongliang Liu, Mingming Gong, James Cheng, Bo Han, Kun Zhang

Comments: NeurIPS 2024; Chenxi and Yongqiang contributed equally; 59 pages, 72 figures; Project page: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[492] arXiv:2402.03966 [pdf, other]: Title: On dimensionality of feature vectors in MPNNs

César Bravo, Alexander Kozachinskiy, Cristóbal Rojas

Comments: 15 pages, 2 figures. Changes to the previous version: added reference to Amir et al.~(NeurIPS'23)

Subjects: Machine Learning (cs.LG)
[493] arXiv:2402.03969 [pdf, other]: Title: In-context learning agents are asymmetric belief updaters

Johannes A. Schubert, Akshay K. Jagadish, Marcel Binz, Eric Schulz

Subjects: Machine Learning (cs.LG)
[494] arXiv:2402.03970 [pdf, html, other]: Title: Is Deep Learning finally better than Decision Trees on Tabular Data?

Guri Zabërgja, Arlind Kadra, Christian M. M. Frey, Josif Grabocka

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[495] arXiv:2402.03979 [pdf, html, other]: Title: Cross Entropy versus Label Smoothing: A Neural Collapse Perspective

Li Guo, George Andriopoulos, Zifan Zhao, Shuyang Ling, Zixuan Dong, Keith Ross

Journal-ref: Published in Transactions on Machine Learning Research(05/2025)

Subjects: Machine Learning (cs.LG)
[496] arXiv:2402.03985 [pdf, html, other]: Title: A Bias-Variance Decomposition for Ensembles over Multiple Synthetic Datasets

Ossi Räisä, Antti Honkela

Comments: AISTATS 2025

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[497] arXiv:2402.03991 [pdf, html, other]: Title: Provable Emergence of Deep Neural Collapse and Low-Rank Bias in $L^2$-Regularized Nonlinear Networks

Emanuele Zangrando, Piero Deidda, Simone Brugiapaglia, Nicola Guglielmi, Francesco Tudisco

Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[498] arXiv:2402.03992 [pdf, html, other]: Title: Space Group Constrained Crystal Generation

Rui Jiao, Wenbing Huang, Yu Liu, Deli Zhao, Yang Liu

Comments: ICLR 2024 poster

Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[499] arXiv:2402.03994 [pdf, html, other]: Title: Efficient Sketches for Training Data Attribution and Studying the Loss Landscape

Andrea Schioppa

Journal-ref: Neurips 2024

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[500] arXiv:2402.04004 [pdf, html, other]: Title: Understanding the Effect of Noise in LLM Training Data with Algorithmic Chains of Thought

Alex Havrilla, Maia Iyer

Subjects: Machine Learning (cs.LG)

Total of 3960 entries : 1-100 101-200 201-300 301-400 401-500 501-600 601-700 701-800 ... 3901-3960

Showing up to 100 entries per page: fewer | more | all