Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for February 2024

Total of 3960 entries : 1-50 ... 351-400 401-450 451-500 501-550 551-600 601-650 651-700 ... 3951-3960
Showing up to 50 entries per page: fewer | more | all
[501] arXiv:2402.04005 [pdf, html, other]
Title: Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning
Idan Achituve, Idit Diamant, Arnon Netzer, Gal Chechik, Ethan Fetaya
Subjects: Machine Learning (cs.LG)
[502] arXiv:2402.04010 [pdf, other]
Title: Efficient Availability Attacks against Supervised and Contrastive Learning Simultaneously
Yihan Wang, Yifan Zhu, Xiao-Shan Gao
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[503] arXiv:2402.04019 [pdf, other]
Title: Exploring the Effects of Population and Employment Characteristics on Truck Flows: An Analysis of NextGen NHTS Origin-Destination Data
Majbah Uddin, Yuandong Liu, Hyeonsup Lim
Journal-ref: In International Conference on Transportation and Development 2023 (pp. 503-513)
Subjects: Machine Learning (cs.LG)
[504] arXiv:2402.04029 [pdf, html, other]
Title: Positive concave deep equilibrium models
Mateusz Gabor, Tomasz Piotrowski, Renato L. G. Cavalcante
Subjects: Machine Learning (cs.LG)
[505] arXiv:2402.04030 [pdf, other]
Title: Reducing the Cost of Quantum Chemical Data By Backpropagating Through Density Functional Theory
Alexander Mathiasen, Hatem Helal, Paul Balanca, Adam Krzywaniak, Ali Parviz, Frederik Hvilshøj, Blazej Banaszewski, Carlo Luschi, Andrew William Fitzgibbon
Subjects: Machine Learning (cs.LG)
[506] arXiv:2402.04033 [pdf, html, other]
Title: On provable privacy vulnerabilities of graph representations
Ruofan Wu, Guanhua Fang, Qiying Pan, Mingyang Zhang, Tengfei Liu, Weiqiang Wang
Subjects: Machine Learning (cs.LG)
[507] arXiv:2402.04050 [pdf, html, other]
Title: Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models
Zhengbo Wang, Jian Liang, Ran He, Zilei Wang, Tieniu Tan
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[508] arXiv:2402.04051 [pdf, html, other]
Title: Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Akira Ito, Masanori Yamada, Atsutoshi Kumagai
Comments: In Proceedings of the Thirteenth International Conference on Learning Representations (ICLR 2025)
Subjects: Machine Learning (cs.LG)
[509] arXiv:2402.04054 [pdf, html, other]
Title: More Flexible PAC-Bayesian Meta-Learning by Learning Learning Algorithms
Hossein Zakerinia, Amin Behjati, Christoph H. Lampert
Comments: International Conference on Machine Learning (ICML), 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[510] arXiv:2402.04059 [pdf, html, other]
Title: Deep Learning for Multivariate Time Series Imputation: A Survey
Jun Wang, Wenjie Du, Yiyuan Yang, Linglong Qian, Wei Cao, Keli Zhang, Wenjia Wang, Yuxuan Liang, Qingsong Wen
Comments: Accepted by IJCAI 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[511] arXiv:2402.04062 [pdf, other]
Title: Link Prediction with Relational Hypergraphs
Xingyue Huang, Miguel Romero Orth, Pablo Barceló, Michael M. Bronstein, İsmail İlkan Ceylan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[512] arXiv:2402.04068 [pdf, html, other]
Title: Retrieve to Explain: Evidence-driven Predictions for Explainable Drug Target Identification
Ravi Patel, Angus Brayne, Rogier Hintzen, Daniel Jaroslawicz, Georgiana Neculae, Dane Corneil
Comments: Accepted at ACL 2025 (The 63rd Annual Meeting of the Association for Computational Linguistics)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[513] arXiv:2402.04080 [pdf, html, other]
Title: Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Ruoqi Zhang, Ziwei Luo, Jens Sjölund, Thomas B. Schön, Per Mattsson
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[514] arXiv:2402.04081 [pdf, html, other]
Title: Improved Generalization of Weight Space Networks via Augmentations
Aviv Shamsian, Aviv Navon, David W. Zhang, Yan Zhang, Ethan Fetaya, Gal Chechik, Haggai Maron
Comments: Under Review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[515] arXiv:2402.04082 [pdf, other]
Title: An Optimal House Price Prediction Algorithm: XGBoost
Hemlata Sharma, Hitesh Harsora, Bayode Ogunleye
Comments: 16 pages, Journal of Analytics
Journal-ref: Analytics, 3(1), 30-45 (2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP); Methodology (stat.ME)
[516] arXiv:2402.04084 [pdf, other]
Title: Provably learning a multi-head attention layer
Sitan Chen, Yuanzhi Li
Comments: 105 pages, comments welcome
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[517] arXiv:2402.04103 [pdf, other]
Title: An Exploration of Clustering Algorithms for Customer Segmentation in the UK Retail Market
Jeen Mary John, Olamilekan Shobayo, Bayode Ogunleye
Comments: 15 pages, Journal of Analytics
Journal-ref: Analytics, 2(4), 809-823 (2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP); Computation (stat.CO)
[518] arXiv:2402.04108 [pdf, other]
Title: Hierarchical Delay Attribution Classification using Unstructured Text in Train Management Systems
Anton Borg, Per Lingvall, Martin Svensson
Comments: 22 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[519] arXiv:2402.04119 [pdf, html, other]
Title: A quantitative analysis of knowledge-learning preferences in large language models in molecular science
Pengfei Liu, Jun Tao, Zhixiang Ren
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE)
[520] arXiv:2402.04129 [pdf, html, other]
Title: OVOR: OnePrompt with Virtual Outlier Regularization for Rehearsal-Free Class-Incremental Learning
Wei-Cheng Huang, Chun-Fu Chen, Hsiang Hsu
Comments: Accepted by ICLR 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[521] arXiv:2402.04161 [pdf, html, other]
Title: Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
Ashok Vardhan Makkuva, Marco Bondaschi, Adway Girish, Alliot Nagle, Martin Jaggi, Hyeji Kim, Michael Gastpar
Comments: Published at ICLR 2025 under the title "Attention with Markov: A Curious Case of Single-Layer Transformers"
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Information Theory (cs.IT); Machine Learning (stat.ML)
[522] arXiv:2402.04163 [pdf, other]
Title: Tempered Calculus for ML: Application to Hyperbolic Model Embedding
Richard Nock, Ehsan Amid, Frank Nielsen, Alexander Soen, Manfred K. Warmuth
Comments: Subsumed by paper "Hyperbolic Embeddings of Supervised Models" by Richard Nock, Ehsan Amid, Frank Nielsen, Alexander Soen and Manfred K. Warmuth, appearing at NeurIPS'24
Subjects: Machine Learning (cs.LG)
[523] arXiv:2402.04168 [pdf, html, other]
Title: Informed Reinforcement Learning for Situation-Aware Traffic Rule Exceptions
Daniel Bogdoll, Jing Qin, Moritz Nekolla, Ahmed Abouelazm, Tim Joseph, J. Marius Zöllner
Comments: Daniel Bogdoll and Jing Qin contributed equally. Accepted for publication at ICRA 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[524] arXiv:2402.04182 [pdf, other]
Title: Reinforcement Learning with Ensemble Model Predictive Safety Certification
Sven Gronauer, Tom Haider, Felippe Schmoeller da Roza, Klaus Diepold
Comments: Published in: Proc. of the 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024)
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[525] arXiv:2402.04193 [pdf, html, other]
Title: Gradient Coding in Decentralized Learning for Evading Stragglers
Chengxi Li, Mikael Skoglund
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[526] arXiv:2402.04209 [pdf, other]
Title: Acute kidney injury prediction for non-critical care patients: a retrospective external and internal validation study
Esra Adiyeke, Yuanfang Ren, Benjamin Shickel, Matthew M. Ruppert, Ziyuan Guan, Sandra L. Kane-Gill, Raghavan Murugan, Nabihah Amatullah, Britney A. Stottlemyer, Tiffany L. Tran, Dan Ricketts, Christopher M Horvat, Parisa Rashidi, Azra Bihorac, Tezcan Ozrazgat-Baslanti
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[527] arXiv:2402.04211 [pdf, other]
Title: Probabilistic Shapley Value Modeling and Inference
Mert Ketenci, Iñigo Urteaga, Victor Alfonso Rodriguez, Noémie Elhadad, Adler Perotte
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[528] arXiv:2402.04229 [pdf, other]
Title: MusicRL: Aligning Music Generation to Human Preferences
Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[529] arXiv:2402.04239 [pdf, html, other]
Title: CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers
Adjorn van Engelenhoven, Nicola Strisciuglio, Estefanía Talavera
Subjects: Machine Learning (cs.LG)
[530] arXiv:2402.04248 [pdf, html, other]
Title: Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks
Jongho Park, Jaeseung Park, Zheyang Xiong, Nayoung Lee, Jaewoong Cho, Samet Oymak, Kangwook Lee, Dimitris Papailiopoulos
Comments: Changes in v2: experiments on formal language ICL and explorations of width vs. depth on ICL; code repo available (24 pages, 10 figures)
Subjects: Machine Learning (cs.LG)
[531] arXiv:2402.04249 [pdf, html, other]
Title: HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Mantas Mazeika, Long Phan, Xuwang Yin, Andy Zou, Zifan Wang, Norman Mu, Elham Sakhaee, Nathaniel Li, Steven Basart, Bo Li, David Forsyth, Dan Hendrycks
Comments: Website: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[532] arXiv:2402.04284 [pdf, html, other]
Title: PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networks
Junwei Su, Difan Zou, Chuan Wu
Subjects: Machine Learning (cs.LG)
[533] arXiv:2402.04290 [pdf, html, other]
Title: CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling
Junchao Gong, Lei Bai, Peng Ye, Wanghan Xu, Na Liu, Jianhua Dai, Xiaokang Yang, Wanli Ouyang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[534] arXiv:2402.04291 [pdf, html, other]
Title: BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Wei Huang, Yangdong Liu, Haotong Qin, Ying Li, Shiming Zhang, Xianglong Liu, Michele Magno, Xiaojuan Qi
Comments: 19 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[535] arXiv:2402.04292 [pdf, other]
Title: AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies
Xixi Hu, Bo Liu, Xingchao Liu, Qiang Liu
Comments: NeuRIPS 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[536] arXiv:2402.04296 [pdf, other]
Title: LightHGNN: Distilling Hypergraph Neural Networks into MLPs for $100\times$ Faster Inference
Yifan Feng, Yihe Luo, Shihui Ying, Yue Gao
Comments: Some details are missing. The method of this paper is not complete
Subjects: Machine Learning (cs.LG)
[537] arXiv:2402.04298 [pdf, html, other]
Title: Multi-View Symbolic Regression
Etienne Russeil, Fabrício Olivetti de França, Konstantin Malanchev, Bogdan Burlacu, Emille E. O. Ishida, Marion Leroux, Clément Michelin, Guillaume Moinard, Emmanuel Gangler
Comments: Published in GECCO-2024. 11 pages, 5 figures
Subjects: Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM); Applications (stat.AP)
[538] arXiv:2402.04325 [pdf, html, other]
Title: Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons
Zhenyu Liu, Garrett Gagnon, Swagath Venkataramani, Liu Liu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[539] arXiv:2402.04344 [pdf, html, other]
Title: Does confidence calibration improve conformal prediction?
Huajun Xi, Jianguo Huang, Kangdao Liu, Lei Feng, Hongxin Wei
Subjects: Machine Learning (cs.LG)
[540] arXiv:2402.04347 [pdf, html, other]
Title: The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
Michael Zhang, Kush Bhatia, Hermann Kumbong, Christopher Ré
Comments: 30 pages, 20 figures, 15 tables, ICLR 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[541] arXiv:2402.04359 [pdf, html, other]
Title: Adaptive Inference: Theoretical Limits and Unexplored Opportunities
Soheil Hor, Ying Qian, Mert Pilanci, Amin Arbabian
Subjects: Machine Learning (cs.LG)
[542] arXiv:2402.04362 [pdf, html, other]
Title: Neural Networks Learn Statistics of Increasing Complexity
Nora Belrose, Quintin Pope, Lucia Quirke, Alex Mallen, Xiaoli Fern
Subjects: Machine Learning (cs.LG)
[543] arXiv:2402.04375 [pdf, html, other]
Title: Bounding the Excess Risk for Linear Models Trained on Marginal-Preserving, Differentially-Private, Synthetic Data
Yvonne Zhou, Mingyu Liang, Ivan Brugere, Dana Dachman-Soled, Danial Dervovic, Antigoni Polychroniadou, Min Wu
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[544] arXiv:2402.04376 [pdf, html, other]
Title: Scaling laws for learning with real and surrogate data
Ayush Jain, Andrea Montanari, Eren Sasoglu
Comments: Added new experiment and minor changes
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[545] arXiv:2402.04377 [pdf, html, other]
Title: NeRCC: Nested-Regression Coded Computing for Resilient Distributed Prediction Serving Systems
Parsa Moradi, Mohammad Ali Maddah-Ali
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Information Theory (cs.IT)
[546] arXiv:2402.04379 [pdf, html, other]
Title: Fine-Tuned Language Models Generate Stable Inorganic Materials as Text
Nate Gruver, Anuroop Sriram, Andrea Madotto, Andrew Gordon Wilson, C. Lawrence Zitnick, Zachary Ulissi
Comments: ICLR 2024. Code available at: this https URL
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[547] arXiv:2402.04383 [pdf, html, other]
Title: FairWire: Fair Graph Generation
O. Deniz Kose, Yanning Shen
Comments: 16 pages, 1 figure, 7 tables
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[548] arXiv:2402.04384 [pdf, other]
Title: Denoising Diffusion Probabilistic Models in Six Simple Steps
Richard E. Turner, Cristiana-Diana Diaconu, Stratis Markou, Aliaksandra Shysheya, Andrew Y. K. Foong, Bruno Mlodozeniec
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[549] arXiv:2402.04390 [pdf, other]
Title: Densely Multiplied Physics Informed Neural Networks
Feilong Jiang, Xiaonan Hou, Min Xia
Comments: 15 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[550] arXiv:2402.04396 [pdf, html, other]
Title: QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks
Albert Tseng, Jerry Chee, Qingyao Sun, Volodymyr Kuleshov, Christopher De Sa
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Total of 3960 entries : 1-50 ... 351-400 401-450 451-500 501-550 551-600 601-650 651-700 ... 3951-3960
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack