Machine Learning

Authors and titles for February 2024

Total of 3960 entries : 1-50 ... 351-400 401-450 451-500 501-550 551-600 601-650 651-700 ... 3951-3960

Showing up to 50 entries per page: fewer | more | all

[501] arXiv:2402.04005 [pdf, html, other]: Title: Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning

Idan Achituve, Idit Diamant, Arnon Netzer, Gal Chechik, Ethan Fetaya

Subjects: Machine Learning (cs.LG)
[502] arXiv:2402.04010 [pdf, other]: Title: Efficient Availability Attacks against Supervised and Contrastive Learning Simultaneously

Yihan Wang, Yifan Zhu, Xiao-Shan Gao

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[503] arXiv:2402.04019 [pdf, other]: Title: Exploring the Effects of Population and Employment Characteristics on Truck Flows: An Analysis of NextGen NHTS Origin-Destination Data

Majbah Uddin, Yuandong Liu, Hyeonsup Lim

Journal-ref: In International Conference on Transportation and Development 2023 (pp. 503-513)

Subjects: Machine Learning (cs.LG)
[504] arXiv:2402.04029 [pdf, html, other]: Title: Positive concave deep equilibrium models

Mateusz Gabor, Tomasz Piotrowski, Renato L. G. Cavalcante

Subjects: Machine Learning (cs.LG)
[505] arXiv:2402.04030 [pdf, other]: Title: Reducing the Cost of Quantum Chemical Data By Backpropagating Through Density Functional Theory

Alexander Mathiasen, Hatem Helal, Paul Balanca, Adam Krzywaniak, Ali Parviz, Frederik Hvilshøj, Blazej Banaszewski, Carlo Luschi, Andrew William Fitzgibbon

Subjects: Machine Learning (cs.LG)
[506] arXiv:2402.04033 [pdf, html, other]: Title: On provable privacy vulnerabilities of graph representations

Ruofan Wu, Guanhua Fang, Qiying Pan, Mingyang Zhang, Tengfei Liu, Weiqiang Wang

Subjects: Machine Learning (cs.LG)
[507] arXiv:2402.04050 [pdf, html, other]: Title: Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models

Zhengbo Wang, Jian Liang, Ran He, Zilei Wang, Tieniu Tan

Comments: Accepted by ICML 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[508] arXiv:2402.04051 [pdf, html, other]: Title: Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods

Akira Ito, Masanori Yamada, Atsutoshi Kumagai

Comments: In Proceedings of the Thirteenth International Conference on Learning Representations (ICLR 2025)

Subjects: Machine Learning (cs.LG)
[509] arXiv:2402.04054 [pdf, html, other]: Title: More Flexible PAC-Bayesian Meta-Learning by Learning Learning Algorithms

Hossein Zakerinia, Amin Behjati, Christoph H. Lampert

Comments: International Conference on Machine Learning (ICML), 2024

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[510] arXiv:2402.04059 [pdf, html, other]: Title: Deep Learning for Multivariate Time Series Imputation: A Survey

Jun Wang, Wenjie Du, Yiyuan Yang, Linglong Qian, Wei Cao, Keli Zhang, Wenjia Wang, Yuxuan Liang, Qingsong Wen

Comments: Accepted by IJCAI 2025

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[511] arXiv:2402.04062 [pdf, other]: Title: Link Prediction with Relational Hypergraphs

Xingyue Huang, Miguel Romero Orth, Pablo Barceló, Michael M. Bronstein, İsmail İlkan Ceylan

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[512] arXiv:2402.04068 [pdf, html, other]: Title: Retrieve to Explain: Evidence-driven Predictions for Explainable Drug Target Identification

Ravi Patel, Angus Brayne, Rogier Hintzen, Daniel Jaroslawicz, Georgiana Neculae, Dane Corneil

Comments: Accepted at ACL 2025 (The 63rd Annual Meeting of the Association for Computational Linguistics)

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[513] arXiv:2402.04080 [pdf, html, other]: Title: Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning

Ruoqi Zhang, Ziwei Luo, Jens Sjölund, Thomas B. Schön, Per Mattsson

Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[514] arXiv:2402.04081 [pdf, html, other]: Title: Improved Generalization of Weight Space Networks via Augmentations

Aviv Shamsian, Aviv Navon, David W. Zhang, Yan Zhang, Ethan Fetaya, Gal Chechik, Haggai Maron

Comments: Under Review

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[515] arXiv:2402.04082 [pdf, other]: Title: An Optimal House Price Prediction Algorithm: XGBoost

Hemlata Sharma, Hitesh Harsora, Bayode Ogunleye

Comments: 16 pages, Journal of Analytics

Journal-ref: Analytics, 3(1), 30-45 (2024)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP); Methodology (stat.ME)
[516] arXiv:2402.04084 [pdf, other]: Title: Provably learning a multi-head attention layer

Sitan Chen, Yuanzhi Li

Comments: 105 pages, comments welcome

Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[517] arXiv:2402.04103 [pdf, other]: Title: An Exploration of Clustering Algorithms for Customer Segmentation in the UK Retail Market

Jeen Mary John, Olamilekan Shobayo, Bayode Ogunleye

Comments: 15 pages, Journal of Analytics

Journal-ref: Analytics, 2(4), 809-823 (2023)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP); Computation (stat.CO)
[518] arXiv:2402.04108 [pdf, other]: Title: Hierarchical Delay Attribution Classification using Unstructured Text in Train Management Systems

Anton Borg, Per Lingvall, Martin Svensson

Comments: 22 pages, 7 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[519] arXiv:2402.04119 [pdf, html, other]: Title: A quantitative analysis of knowledge-learning preferences in large language models in molecular science

Pengfei Liu, Jun Tao, Zhixiang Ren

Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE)
[520] arXiv:2402.04129 [pdf, html, other]: Title: OVOR: OnePrompt with Virtual Outlier Regularization for Rehearsal-Free Class-Incremental Learning

Wei-Cheng Huang, Chun-Fu Chen, Hsiang Hsu

Comments: Accepted by ICLR 2024

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[521] arXiv:2402.04161 [pdf, html, other]: Title: Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains

Ashok Vardhan Makkuva, Marco Bondaschi, Adway Girish, Alliot Nagle, Martin Jaggi, Hyeji Kim, Michael Gastpar

Comments: Published at ICLR 2025 under the title "Attention with Markov: A Curious Case of Single-Layer Transformers"

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Information Theory (cs.IT); Machine Learning (stat.ML)
[522] arXiv:2402.04163 [pdf, other]: Title: Tempered Calculus for ML: Application to Hyperbolic Model Embedding

Richard Nock, Ehsan Amid, Frank Nielsen, Alexander Soen, Manfred K. Warmuth

Comments: Subsumed by paper "Hyperbolic Embeddings of Supervised Models" by Richard Nock, Ehsan Amid, Frank Nielsen, Alexander Soen and Manfred K. Warmuth, appearing at NeurIPS'24

Subjects: Machine Learning (cs.LG)
[523] arXiv:2402.04168 [pdf, html, other]: Title: Informed Reinforcement Learning for Situation-Aware Traffic Rule Exceptions

Daniel Bogdoll, Jing Qin, Moritz Nekolla, Ahmed Abouelazm, Tim Joseph, J. Marius Zöllner

Comments: Daniel Bogdoll and Jing Qin contributed equally. Accepted for publication at ICRA 2024

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[524] arXiv:2402.04182 [pdf, other]: Title: Reinforcement Learning with Ensemble Model Predictive Safety Certification

Sven Gronauer, Tom Haider, Felippe Schmoeller da Roza, Klaus Diepold

Comments: Published in: Proc. of the 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024)

Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[525] arXiv:2402.04193 [pdf, html, other]: Title: Gradient Coding in Decentralized Learning for Evading Stragglers

Chengxi Li, Mikael Skoglund

Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[526] arXiv:2402.04209 [pdf, other]: Title: Acute kidney injury prediction for non-critical care patients: a retrospective external and internal validation study

Esra Adiyeke, Yuanfang Ren, Benjamin Shickel, Matthew M. Ruppert, Ziyuan Guan, Sandra L. Kane-Gill, Raghavan Murugan, Nabihah Amatullah, Britney A. Stottlemyer, Tiffany L. Tran, Dan Ricketts, Christopher M Horvat, Parisa Rashidi, Azra Bihorac, Tezcan Ozrazgat-Baslanti

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[527] arXiv:2402.04211 [pdf, other]: Title: Probabilistic Shapley Value Modeling and Inference

Mert Ketenci, Iñigo Urteaga, Victor Alfonso Rodriguez, Noémie Elhadad, Adler Perotte

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[528] arXiv:2402.04229 [pdf, other]: Title: MusicRL: Aligning Music Generation to Human Preferences

Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli

Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[529] arXiv:2402.04239 [pdf, html, other]: Title: CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers

Adjorn van Engelenhoven, Nicola Strisciuglio, Estefanía Talavera

Subjects: Machine Learning (cs.LG)
[530] arXiv:2402.04248 [pdf, html, other]: Title: Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

Jongho Park, Jaeseung Park, Zheyang Xiong, Nayoung Lee, Jaewoong Cho, Samet Oymak, Kangwook Lee, Dimitris Papailiopoulos

Comments: Changes in v2: experiments on formal language ICL and explorations of width vs. depth on ICL; code repo available (24 pages, 10 figures)

Subjects: Machine Learning (cs.LG)
[531] arXiv:2402.04249 [pdf, html, other]: Title: HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Mantas Mazeika, Long Phan, Xuwang Yin, Andy Zou, Zifan Wang, Norman Mu, Elham Sakhaee, Nathaniel Li, Steven Basart, Bo Li, David Forsyth, Dan Hendrycks

Comments: Website: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[532] arXiv:2402.04284 [pdf, html, other]: Title: PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networks

Junwei Su, Difan Zou, Chuan Wu

Subjects: Machine Learning (cs.LG)
[533] arXiv:2402.04290 [pdf, html, other]: Title: CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling

Junchao Gong, Lei Bai, Peng Ye, Wanghan Xu, Na Liu, Jianhua Dai, Xiaokang Yang, Wanli Ouyang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[534] arXiv:2402.04291 [pdf, html, other]: Title: BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Wei Huang, Yangdong Liu, Haotong Qin, Ying Li, Shiming Zhang, Xianglong Liu, Michele Magno, Xiaojuan Qi

Comments: 19 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[535] arXiv:2402.04292 [pdf, other]: Title: AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies

Xixi Hu, Bo Liu, Xingchao Liu, Qiang Liu

Comments: NeuRIPS 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[536] arXiv:2402.04296 [pdf, other]: Title: LightHGNN: Distilling Hypergraph Neural Networks into MLPs for $100\times$ Faster Inference

Yifan Feng, Yihe Luo, Shihui Ying, Yue Gao

Comments: Some details are missing. The method of this paper is not complete

Subjects: Machine Learning (cs.LG)
[537] arXiv:2402.04298 [pdf, html, other]: Title: Multi-View Symbolic Regression

Etienne Russeil, Fabrício Olivetti de França, Konstantin Malanchev, Bogdan Burlacu, Emille E. O. Ishida, Marion Leroux, Clément Michelin, Guillaume Moinard, Emmanuel Gangler

Comments: Published in GECCO-2024. 11 pages, 5 figures

Subjects: Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM); Applications (stat.AP)
[538] arXiv:2402.04325 [pdf, html, other]: Title: Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons

Zhenyu Liu, Garrett Gagnon, Swagath Venkataramani, Liu Liu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[539] arXiv:2402.04344 [pdf, html, other]: Title: Does confidence calibration improve conformal prediction?

Huajun Xi, Jianguo Huang, Kangdao Liu, Lei Feng, Hongxin Wei

Subjects: Machine Learning (cs.LG)
[540] arXiv:2402.04347 [pdf, html, other]: Title: The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry

Michael Zhang, Kush Bhatia, Hermann Kumbong, Christopher Ré

Comments: 30 pages, 20 figures, 15 tables, ICLR 2024

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[541] arXiv:2402.04359 [pdf, html, other]: Title: Adaptive Inference: Theoretical Limits and Unexplored Opportunities

Soheil Hor, Ying Qian, Mert Pilanci, Amin Arbabian

Subjects: Machine Learning (cs.LG)
[542] arXiv:2402.04362 [pdf, html, other]: Title: Neural Networks Learn Statistics of Increasing Complexity

Nora Belrose, Quintin Pope, Lucia Quirke, Alex Mallen, Xiaoli Fern

Subjects: Machine Learning (cs.LG)
[543] arXiv:2402.04375 [pdf, html, other]: Title: Bounding the Excess Risk for Linear Models Trained on Marginal-Preserving, Differentially-Private, Synthetic Data

Yvonne Zhou, Mingyu Liang, Ivan Brugere, Dana Dachman-Soled, Danial Dervovic, Antigoni Polychroniadou, Min Wu

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[544] arXiv:2402.04376 [pdf, html, other]: Title: Scaling laws for learning with real and surrogate data

Ayush Jain, Andrea Montanari, Eren Sasoglu

Comments: Added new experiment and minor changes

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[545] arXiv:2402.04377 [pdf, html, other]: Title: NeRCC: Nested-Regression Coded Computing for Resilient Distributed Prediction Serving Systems

Parsa Moradi, Mohammad Ali Maddah-Ali

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Information Theory (cs.IT)
[546] arXiv:2402.04379 [pdf, html, other]: Title: Fine-Tuned Language Models Generate Stable Inorganic Materials as Text

Nate Gruver, Anuroop Sriram, Andrea Madotto, Andrew Gordon Wilson, C. Lawrence Zitnick, Zachary Ulissi

Comments: ICLR 2024. Code available at: this https URL

Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[547] arXiv:2402.04383 [pdf, html, other]: Title: FairWire: Fair Graph Generation

O. Deniz Kose, Yanning Shen

Comments: 16 pages, 1 figure, 7 tables

Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[548] arXiv:2402.04384 [pdf, other]: Title: Denoising Diffusion Probabilistic Models in Six Simple Steps

Richard E. Turner, Cristiana-Diana Diaconu, Stratis Markou, Aliaksandra Shysheya, Andrew Y. K. Foong, Bruno Mlodozeniec

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[549] arXiv:2402.04390 [pdf, other]: Title: Densely Multiplied Physics Informed Neural Networks

Feilong Jiang, Xiaonan Hou, Min Xia

Comments: 15 pages, 9 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[550] arXiv:2402.04396 [pdf, html, other]: Title: QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks

Albert Tseng, Jerry Chee, Qingyao Sun, Volodymyr Kuleshov, Christopher De Sa

Comments: ICML 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Total of 3960 entries : 1-50 ... 351-400 401-450 451-500 501-550 551-600 601-650 651-700 ... 3951-3960

Showing up to 50 entries per page: fewer | more | all