Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for February 2024

Total of 3960 entries : 1-50 ... 251-300 301-350 351-400 401-450 451-500 501-550 551-600 ... 3951-3960
Showing up to 50 entries per page: fewer | more | all
[401] arXiv:2402.03264 [pdf, html, other]
Title: MobilityGPT: Enhanced Human Mobility Modeling with a GPT model
Ammar Haydari, Dongjie Chen, Zhengfeng Lai, Michael Zhang, Chen-Nee Chuah
Subjects: Machine Learning (cs.LG)
[402] arXiv:2402.03268 [pdf, html, other]
Title: Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation
Xinyi Wang, Alfonso Amayuelas, Kexun Zhang, Liangming Pan, Wenhu Chen, William Yang Wang
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[403] arXiv:2402.03270 [pdf, other]
Title: Multiclass Classification Procedure for Detecting Attacks on MQTT-IoT Protocol
Hector Alaiz-Moreton (1), Jose Aveleira-Mata (2), Jorge Ondicol-Garcia (2), Angel Luis Muñoz-Castañeda (2), Isaías García (1), Carmen Benavides (1) ((1) Escuela de Ingenierías, Universidad de León, (2) Research Institute of Applied Sciences in Cybersecurity, Universidad de León)
Journal-ref: Complexity (New York, N.Y.), 2019, Vol.2019, p.1-11
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[404] arXiv:2402.03282 [pdf, html, other]
Title: A Theoretical Framework for Partially Observed Reward-States in RLHF
Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari
Comments: 64 pages. 14 pages for main paper, 50 pages for references + appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[405] arXiv:2402.03287 [pdf, html, other]
Title: A Lennard-Jones Layer for Distribution Normalization
Mulun Na, Jonathan Klein, Biao Zhang, Wojtek Pałubicki, Sören Pirk, Dominik L. Michels
Comments: Upon request, we are happy to share the source code to generate the results presented in this paper. Please contact the first or the last author of this manuscript
Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[406] arXiv:2402.03289 [pdf, other]
Title: Make Every Move Count: LLM-based High-Quality RTL Code Generation Using MCTS
Matthew DeLorenzo, Animesh Basak Chowdhury, Vasudev Gohil, Shailja Thakur, Ramesh Karri, Siddharth Garg, Jeyavijayan Rajendran
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[407] arXiv:2402.03292 [pdf, html, other]
Title: Detecting Out-of-Distribution Objects through Class-Conditioned Inpainting
Quang-Huy Nguyen, Jin Peng Zhou, Zhenzhen Liu, Khanh-Huyen Bui, Kilian Q. Weinberger, Wei-Lun Chao, Dung D. Le
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2402.03293 [pdf, html, other]
Title: Flora: Low-Rank Adapters Are Secretly Gradient Compressors
Yongchang Hao, Yanshuai Cao, Lili Mou
Comments: Accepted @ ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[409] arXiv:2402.03295 [pdf, other]
Title: Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks
Yongchang Hao, Yanshuai Cao, Lili Mou
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[410] arXiv:2402.03299 [pdf, other]
Title: GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models
Haibo Jin, Ruoxi Chen, Peiyan Zhang, Andy Zhou, Yang Zhang, Haohan Wang
Comments: 28 papges
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[411] arXiv:2402.03305 [pdf, html, other]
Title: Do Diffusion Models Learn Semantically Meaningful and Efficient Representations?
Qiyao Liang, Ziming Liu, Ila Fiete
Comments: 13 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[412] arXiv:2402.03386 [pdf, other]
Title: A generalized decision tree ensemble based on the NeuralNetworks architecture: Distributed Gradient Boosting Forest (DGBF)
Ángel Delgado-Panadero, José Alberto Benítez-Andrades, María Teresa García-Ordás
Journal-ref: Applied Intelligence, Volume 53, July 2023, pages 22991-23003
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[413] arXiv:2402.03448 [pdf, other]
Title: Decentralized Sporadic Federated Learning: A Unified Algorithmic Framework with Convergence Guarantees
Shahryar Zehtabi, Dong-Jun Han, Rohit Parasnis, Seyyedali Hosseinalipour, Christopher G. Brinton
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[414] arXiv:2402.03457 [pdf, html, other]
Title: Efficient and Interpretable Traffic Destination Prediction using Explainable Boosting Machines
Yasin Yousif, Jörg Müller
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[415] arXiv:2402.03467 [pdf, html, other]
Title: Stochastic Modified Flows for Riemannian Stochastic Gradient Descent
Benjamin Gess, Sebastian Kassing, Nimit Rana
Journal-ref: SIAM J. Control Optim. 62(6): 3288-3314 (2024)
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Probability (math.PR); Machine Learning (stat.ML)
[416] arXiv:2402.03468 [pdf, html, other]
Title: Exact Tensor Completion Powered by Slim Transforms
Li Ge, Lin Chen, Yudong Chen, Xue Jiang
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[417] arXiv:2402.03469 [pdf, html, other]
Title: Rethinking the Role of Proxy Rewards in Language Model Alignment
Sungdong Kim, Minjoon Seo
Comments: Accepted to EMNLP 2024 main conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[418] arXiv:2402.03471 [pdf, html, other]
Title: The Information of Large Language Model Geometry
Zhiquan Tan, Chenghai Li, Weiran Huang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Theory (cs.IT)
[419] arXiv:2402.03478 [pdf, html, other]
Title: Estimating Epistemic and Aleatoric Uncertainty with a Single Model
Matthew A. Chan, Maria J. Molina, Christopher A. Metzler
Comments: 19 pages, 11 figures. To be published in Conference on Neural Information Processing Systems (NeurIPS) 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[420] arXiv:2402.03479 [pdf, html, other]
Title: DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design
Samuel Garcin, James Doran, Shangmin Guo, Christopher G. Lucas, Stefano V. Albrecht
Comments: To appear in ICML 2024. A preliminary version of this work (arXiv:2310.03494) was presented at the ALOE workshop, NeurIPS 2023. arXiv admin note: text overlap with arXiv:2310.03494
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[421] arXiv:2402.03480 [pdf, html, other]
Title: Trillion Parameter AI Serving Infrastructure for Scientific Discovery: A Survey and Vision
Nathaniel Hudson, J. Gregory Pauloski, Matt Baughman, Alok Kamatar, Mansi Sakarvadia, Logan Ward, Ryan Chard, André Bauer, Maksim Levental, Wenyi Wang, Will Engler, Owen Price Skelly, Ben Blaiszik, Rick Stevens, Kyle Chard, Ian Foster
Comments: 10 pages, 3 figures, accepted for publication in the proceedings of the 10th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[422] arXiv:2402.03486 [pdf, html, other]
Title: Early prediction of onset of sepsis in Clinical Setting
Fahim Mohammad, Lakshmi Arunachalam, Samanway Sadhu, Boudewijn Aasman, Shweta Garg, Adil Ahmed, Silvie Colman, Meena Arunachalam, Sudhir Kulkarni, Parsa Mirhaji
Comments: 16 pages, 6 figures and 7 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[423] arXiv:2402.03495 [pdf, html, other]
Title: Partially Stochastic Infinitely Deep Bayesian Neural Networks
Sergio Calvo-Ordonez, Matthieu Meunier, Francesco Piatti, Yuantao Shi
Comments: 17 pages including supplementary material. Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Probability (math.PR)
[424] arXiv:2402.03496 [pdf, html, other]
Title: Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Wu Lin, Felix Dangel, Runa Eschenhagen, Juhan Bae, Richard E. Turner, Alireza Makhzani
Comments: A long version of the ICML 2024 paper. Updated the caption of Fig 4 to emphasize the importance of the scale invariance of root-free methods
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[425] arXiv:2402.03502 [pdf, html, other]
Title: How Does Unlabeled Data Provably Help Out-of-Distribution Detection?
Xuefeng Du, Zhen Fang, Ilias Diakonikolas, Yixuan Li
Comments: ICLR 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[426] arXiv:2402.03525 [pdf, html, other]
Title: Deep Reinforcement Learning for Picker Routing Problem in Warehousing
George Dunn, Hadi Charkhgard, Ali Eshragh, Sasan Mahmoudinazlou, Elizabeth Stojanovski
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[427] arXiv:2402.03531 [pdf, other]
Title: Fairness and Privacy Guarantees in Federated Contextual Bandits
Sambhav Solanki, Shweta Jain, Sujit Gujar
Comments: 16 pages, 2 figures
Subjects: Machine Learning (cs.LG)
[428] arXiv:2402.03540 [pdf, html, other]
Title: Regulation Games for Trustworthy Machine Learning
Mohammad Yaghini, Patty Liu, Franziska Boenisch, Nicolas Papernot
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Machine Learning (stat.ML)
[429] arXiv:2402.03541 [pdf, html, other]
Title: HAMLET: Graph Transformer Neural Operator for Partial Differential Equations
Andrey Bryutkin, Jiahao Huang, Zhongying Deng, Guang Yang, Carola-Bibiane Schönlieb, Angelica Aviles-Rivero
Comments: 18 pages, 7 figures, 6 tables
Journal-ref: Proceedings of Machine Learning Research, Vol. 235, pp. 4624-4641, 2024
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[430] arXiv:2402.03545 [pdf, html, other]
Title: Online Feature Updates Improve Online (Generalized) Label Shift Adaptation
Ruihan Wu, Siddhartha Datta, Yi Su, Dheeraj Baby, Yu-Xiang Wang, Kilian Q. Weinberger
Subjects: Machine Learning (cs.LG)
[431] arXiv:2402.03548 [pdf, html, other]
Title: Single-GPU GNN Systems: Traps and Pitfalls
Yidong Gong, Arnab Tarafder, Saima Afrin, Pradeep Kumar
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[432] arXiv:2402.03558 [pdf, html, other]
Title: Path Signatures and Graph Neural Networks for Slow Earthquake Analysis: Better Together?
Hans Riess, Manolis Veveakis, Michael M. Zavlanos
Subjects: Machine Learning (cs.LG); Geophysics (physics.geo-ph)
[433] arXiv:2402.03559 [pdf, html, other]
Title: Constrained Synthesis with Projected Diffusion Models
Jacob K Christopher, Stephen Baek, Ferdinando Fioretto
Comments: Published at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[434] arXiv:2402.03563 [pdf, html, other]
Title: Distinguishing the Knowable from the Unknowable with Language Models
Gustaf Ahdritz, Tian Qin, Nikhil Vyas, Boaz Barak, Benjamin L. Edelman
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[435] arXiv:2402.03564 [pdf, html, other]
Title: SkipPredict: When to Invest in Predictions for Scheduling
Rana Shahout, Michael Mitzenmacher
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[436] arXiv:2402.03570 [pdf, html, other]
Title: Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding, Amy Zhang, Yuandong Tian, Qinqing Zheng
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[437] arXiv:2402.03576 [pdf, html, other]
Title: Generalization Properties of Adversarial Training for $\ell_0$-Bounded Adversarial Attacks
Payam Delgosha, Hamed Hassani, Ramtin Pedarsani
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[438] arXiv:2402.03577 [pdf, html, other]
Title: Revisiting the Dataset Bias Problem from a Statistical Perspective
Kien Do, Dung Nguyen, Hung Le, Thao Le, Dang Nguyen, Haripriya Harikumar, Truyen Tran, Santu Rana, Svetha Venkatesh
Subjects: Machine Learning (cs.LG)
[439] arXiv:2402.03579 [pdf, html, other]
Title: Deconstructing the Goldilocks Zone of Neural Network Initialization
Artem Vysogorets, Anna Dawid, Julia Kempe
Journal-ref: Proceedings of the 41st International Conference on Machine Learning, PMLR (2024) 235:49717-49732
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[440] arXiv:2402.03587 [pdf, html, other]
Title: Information-Theoretic Active Correlation Clustering
Linus Aronsson, Morteza Haghir Chehreghani
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[441] arXiv:2402.03588 [pdf, html, other]
Title: Continual Domain Adversarial Adaptation via Double-Head Discriminators
Yan Shen, Zhanghexuan Ji, Chunwei Ma, Mingchen Gao
Comments: AISTATS 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[442] arXiv:2402.03589 [pdf, html, other]
Title: A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System
Jiaqi Liang, Sanjay Dominik Jena, Defeng Liu, Andrea Lodi
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[443] arXiv:2402.03590 [pdf, html, other]
Title: Assessing the Impact of Distribution Shift on Reinforcement Learning Performance
Ted Fujimoto, Joshua Suetterlein, Samrat Chatterjee, Auroop Ganguly
Comments: Poster at the Workshop on Regulatable Machine Learning at the 37th Conference on Neural Information Processing Systems (RegML @ NeurIPS 2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[444] arXiv:2402.03610 [pdf, html, other]
Title: RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents
Tomoyuki Kagaya, Thong Jing Yuan, Yuxuan Lou, Jayashree Karlekar, Sugiri Pranata, Akira Kinose, Koki Oguri, Felix Wick, Yang You
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[445] arXiv:2402.03614 [pdf, html, other]
Title: Bayesian Vector AutoRegression with Factorised Granger-Causal Graphs
He Zhao, Vassili Kitsios, Terence J. O'Kane, Edwin V. Bonilla
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[446] arXiv:2402.03621 [pdf, html, other]
Title: Neural Network Approximators for Marginal MAP in Probabilistic Circuits
Shivvrat Arya, Tahrima Rahman, Vibhav Gogate
Comments: Will appear in AAAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[447] arXiv:2402.03625 [pdf, html, other]
Title: Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial Time
Sungyoon Kim, Mert Pilanci
Comments: Version 2: Fixed proof of Thm 4.4, slight clarification on assumption 2 Version 3: Modified to ICML style and slight clarification on assumption 1
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[448] arXiv:2402.03629 [pdf, html, other]
Title: Disparate Impact on Group Accuracy of Linearization for Private Inference
Saswat Das, Marco Romanelli, Ferdinando Fioretto
Comments: Extended version of the paper accepted to appear at the Forty-first International Conference on Machine Learning (ICML) 2024
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computers and Society (cs.CY)
[449] arXiv:2402.03646 [pdf, other]
Title: Lens: A Foundation Model for Network Traffic
Qineng Wang, Chen Qian, Xiaochang Li, Ziyu Yao, Gang Zhou, Huajie Shao
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[450] arXiv:2402.03647 [pdf, html, other]
Title: CAMBranch: Contrastive Learning with Augmented MILPs for Branching
Jiacheng Lin, Meng Xu, Zhihua Xiong, Huangang Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Total of 3960 entries : 1-50 ... 251-300 301-350 351-400 401-450 451-500 501-550 551-600 ... 3951-3960
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack