Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for October 2024

Total of 4845 entries : 1-100 ... 601-700 701-800 801-900 901-1000 1001-1100 1101-1200 1201-1300 ... 4801-4845
Showing up to 100 entries per page: fewer | more | all
[901] arXiv:2410.07994 [pdf, html, other]
Title: Neuroplastic Expansion in Deep Reinforcement Learning
Jiashun Liu, Johan Obando-Ceron, Aaron Courville, Ling Pan
Subjects: Machine Learning (cs.LG)
[902] arXiv:2410.08000 [pdf, html, other]
Title: AHA: Human-Assisted Out-of-Distribution Generalization and Detection
Haoyue Bai, Jifan Zhang, Robert Nowak
Comments: NeurIPS 2024
Subjects: Machine Learning (cs.LG)
[903] arXiv:2410.08003 [pdf, html, other]
Title: More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing
Sagi Shaier, Francisco Pereira, Katharina von der Wense, Lawrence E Hunter, Matt Jones
Comments: Published as a conference paper at ICLR 2025
Subjects: Machine Learning (cs.LG)
[904] arXiv:2410.08007 [pdf, html, other]
Title: Time Can Invalidate Algorithmic Recourse
Giovanni De Toni, Stefano Teso, Bruno Lepri, Andrea Passerini
Comments: This is a preprint of a paper accepted at FAccT 2025. The content is identical to the published version, apart from minor cosmetic changes
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[905] arXiv:2410.08015 [pdf, html, other]
Title: Non-transferable Pruning
Ruyi Ding, Lili Su, Aidong Adam Ding, Yunsi Fei
Comments: Accepted in ECCV 2024
Subjects: Machine Learning (cs.LG)
[906] arXiv:2410.08020 [pdf, other]
Title: Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs
Jonas Hübotter, Sascha Bongni, Ido Hakimi, Andreas Krause
Comments: accepted in ICLR 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[907] arXiv:2410.08024 [pdf, html, other]
Title: Pretraining Graph Transformers with Atom-in-a-Molecule Quantum Properties for Improved ADMET Modeling
Alessio Fallani, Ramil Nugmanov, Jose Arjona-Medina, Jörg Kurt Wegner, Alexandre Tkatchenko, Kostiantyn Chernichenko
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[908] arXiv:2410.08026 [pdf, html, other]
Title: Generalization Bounds and Model Complexity for Kolmogorov-Arnold Networks
Xianyang Zhang, Huijuan Zhou
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[909] arXiv:2410.08037 [pdf, html, other]
Title: Composite Learning Units: Generalized Learning Beyond Parameter Updates to Transform LLMs into Adaptive Reasoners
Santosh Kumar Radha, Oktay Goktas
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multiagent Systems (cs.MA)
[910] arXiv:2410.08041 [pdf, html, other]
Title: On the Convergence of (Stochastic) Gradient Descent for Kolmogorov--Arnold Networks
Yihang Gao, Vincent Y. F. Tan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[911] arXiv:2410.08048 [pdf, html, other]
Title: VerifierQ: Enhancing LLM Test Time Compute with Q-Learning-based Verifiers
Jianing Qi, Hao Tang, Zhigang Zhu
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[912] arXiv:2410.08067 [pdf, html, other]
Title: Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxiang Yang, Yongfei Liu, Liyu Chen, Tao Sun, Zhaoran Wang
Comments: Published at ICML 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[913] arXiv:2410.08069 [pdf, html, other]
Title: Unlearning-based Neural Interpretations
Ching Lam Choi, Alexandre Duplessis, Serge Belongie
Comments: Accepted to ICLR 2025
Journal-ref: Choi, Ching Lam, Alexandre Duplessis, and Serge Belongie. 'Unlearning-Based Neural Interpretations'. In The Thirteenth International Conference on Learning Representations, 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[914] arXiv:2410.08071 [pdf, html, other]
Title: Gaussian Process Thompson Sampling via Rootfinding
Taiwo A. Adebiyi, Bach Do, Ruda Zhang
Comments: Paper accepted at the NeurIPS 2024 Workshop on Bayesian Decision-making and Uncertainty for an oral presentation
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[915] arXiv:2410.08074 [pdf, html, other]
Title: Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models
Vinith M. Suriyakumar, Rohan Alur, Ayush Sekhari, Manish Raghavan, Ashia C. Wilson
Comments: 20 pages, 13 figures
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[916] arXiv:2410.08081 [pdf, html, other]
Title: Packing Analysis: Packing Is More Appropriate for Large Models or Datasets in Supervised Fine-tuning
Shuhe Wang, Guoyin Wang, Yizhong Wang, Jiwei Li, Eduard Hovy, Chen Guo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[917] arXiv:2410.08087 [pdf, html, other]
Title: Noether's razor: Learning Conserved Quantities
Tycho F. A. van der Ouderaa, Mark van der Wilk, Pim de Haan
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[918] arXiv:2410.08111 [pdf, html, other]
Title: Active Fourier Auditor for Estimating Distributional Properties of ML Models
Ayoub Ajarra, Bishwamittra Ghosh, Debabrota Basu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (stat.ML)
[919] arXiv:2410.08117 [pdf, html, other]
Title: On Barycenter Computation: Semi-Unbalanced Optimal Transport-based Method on Gaussians
Ngoc-Hai Nguyen, Dung Le, Hoang-Phi Nguyen, Tung Pham, Nhat Ho
Comments: Ngoc-Hai Nguyen and Dung Le contributed equally to this work. 44 pages, 5 figures
Subjects: Machine Learning (cs.LG)
[920] arXiv:2410.08121 [pdf, html, other]
Title: Heterogeneous Graph Auto-Encoder for CreditCard Fraud Detection
Moirangthem Tiken Singh, Rabinder Kumar Prasad, Gurumayum Robert Michael, N K Kaphungkui, N.Hemarjit Singh
Journal-ref: International Journal of Computers and Their Applications, vol. 32, no. 2, pp 123-138, 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[921] arXiv:2410.08125 [pdf, html, other]
Title: Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation
Felix Petersen, Christian Borgelt, Aashwin Mishra, Stefano Ermon
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[922] arXiv:2410.08126 [pdf, html, other]
Title: Mars: Situated Inductive Reasoning in an Open-World Environment
Xiaojuan Tang, Jiaqi Li, Yitao Liang, Song-chun Zhu, Muhan Zhang, Zilong Zheng
Comments: Accepted by NeurIPS 2024 Track Datasets and Benchmarks. Project page: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[923] arXiv:2410.08130 [pdf, html, other]
Title: Think Beyond Size: Adaptive Prompting for More Effective Reasoning
Kamesh R
Comments: Submitted to ICLR 2025. This is a preprint version. Future revisions will include additional evaluations and refinements
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[924] arXiv:2410.08134 [pdf, html, other]
Title: Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction
Jarrid Rector-Brooks, Mohsin Hasan, Zhangzhi Peng, Zachary Quinn, Chenghao Liu, Sarthak Mittal, Nouha Dziri, Michael Bronstein, Yoshua Bengio, Pranam Chatterjee, Alexander Tong, Avishek Joey Bose
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[925] arXiv:2410.08146 [pdf, other]
Title: Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Amrith Setlur, Chirag Nagpal, Adam Fisch, Xinyang Geng, Jacob Eisenstein, Rishabh Agarwal, Alekh Agarwal, Jonathan Berant, Aviral Kumar
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[926] arXiv:2410.08165 [pdf, html, other]
Title: Chain-of-Sketch: Enabling Global Visual Reasoning
Aryo Lotfi, Enrico Fini, Samy Bengio, Moin Nabi, Emmanuel Abbe
Comments: additional experiments added, title changed from "Visual Scratchpads: Enabling Global Reasoning in Vision" to "Chain-of-Sketch: Enabling Global Visual Reasoning"
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[927] arXiv:2410.08198 [pdf, html, other]
Title: Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity
Shuo Xie, Mohamad Amin Mohamadi, Zhiyuan Li
Subjects: Machine Learning (cs.LG)
[928] arXiv:2410.08201 [pdf, html, other]
Title: Efficient Dictionary Learning with Switch Sparse Autoencoders
Anish Mudide, Joshua Engels, Eric J. Michaud, Max Tegmark, Christian Schroeder de Witt
Comments: Code available at this https URL
Subjects: Machine Learning (cs.LG)
[929] arXiv:2410.08243 [pdf, other]
Title: Self-Attention Mechanism in Multimodal Context for Banking Transaction Flow
Cyrile Delestre, Yoann Sola
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[930] arXiv:2410.08245 [pdf, html, other]
Title: Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts
Sukwon Yun, Inyoung Choi, Jie Peng, Yangfan Wu, Jingxuan Bao, Qiyiwen Zhang, Jiayi Xin, Qi Long, Tianlong Chen
Comments: NeurIPS 2024 Spotlight
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[931] arXiv:2410.08247 [pdf, html, other]
Title: Forecasting mortality associated emergency department crowding
Jalmari Nevanlinna, Anna Eidstø, Jari Ylä-Mattila, Teemu Koivistoinen, Niku Oksala, Juho Kanniainen, Ari Palomäki, Antti Roine
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP)
[932] arXiv:2410.08249 [pdf, html, other]
Title: Federated Graph Learning for Cross-Domain Recommendation
Ziqi Yang, Zhaopeng Peng, Zihui Wang, Jianzhong Qi, Chaochao Chen, Weike Pan, Chenglu Wen, Cheng Wang, Xiaoliang Fan
Comments: Accepted by NeurIPS'24
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[933] arXiv:2410.08255 [pdf, html, other]
Title: Generalization from Starvation: Hints of Universality in LLM Knowledge Graph Learning
David D. Baek, Yuxiao Li, Max Tegmark
Comments: 14 pages, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[934] arXiv:2410.08256 [pdf, html, other]
Title: AdaShadow: Responsive Test-time Model Adaptation in Non-stationary Mobile Environments
Cheng Fang, Sicong Liu, Zimu Zhou, Bin Guo, Jiaqi Tang, Ke Ma, Zhiwen Yu
Comments: This paper is accepted by SenSys 2024. Copyright may be transferred without notice
Journal-ref: The 22th ACM Conference on Embedded Networked Sensor Systems, 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[935] arXiv:2410.08288 [pdf, other]
Title: Towards Foundation Models for Mixed Integer Linear Programming
Sirui Li, Janardhan Kulkarni, Ishai Menache, Cathy Wu, Beibin Li
Subjects: Machine Learning (cs.LG)
[936] arXiv:2410.08292 [pdf, html, other]
Title: Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?
Khashayar Gatmiry, Nikunj Saunshi, Sashank J. Reddi, Stefanie Jegelka, Sanjiv Kumar
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[937] arXiv:2410.08295 [pdf, other]
Title: Impact of Missing Values in Machine Learning: A Comprehensive Analysis
Abu Fuad Ahmad, Md Shohel Sayeed, Khaznah Alshammari, Istiaque Ahmed
Subjects: Machine Learning (cs.LG)
[938] arXiv:2410.08299 [pdf, html, other]
Title: Privately Learning from Graphs with Applications in Fine-tuning Large Language Models
Haoteng Yin, Rongzhe Wei, Eli Chien, Pan Li
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[939] arXiv:2410.08300 [pdf, html, other]
Title: A Framework to Enable Algorithmic Design Choice Exploration in DNNs
Timothy L. Cronin IV, Sanmukh Kuppannagari
Comments: IEEE HPEC 2024
Subjects: Machine Learning (cs.LG)
[940] arXiv:2410.08304 [pdf, html, other]
Title: Global Lyapunov functions: a long-standing open problem in mathematics, with symbolic transformers
Alberto Alfarano, François Charton, Amaury Hayat
Subjects: Machine Learning (cs.LG)
[941] arXiv:2410.08305 [pdf, html, other]
Title: Randomized Asymmetric Chain of LoRA: The First Meaningful Theoretical Framework for Low-Rank Adaptation
Grigory Malinovsky, Umberto Michieli, Hasan Abed Al Kader Hammoud, Taha Ceritli, Hayder Elesedy, Mete Ozay, Peter Richtárik
Comments: 36 pages, 4 figures, 2 algorithms
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[942] arXiv:2410.08307 [pdf, html, other]
Title: UNIQ: Offline Inverse Q-learning for Avoiding Undesirable Demonstrations
Huy Hoang, Tien Mai, Pradeep Varakantham
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[943] arXiv:2410.08308 [pdf, other]
Title: Machine Learning for Missing Value Imputation
Abu Fuad Ahmad, Khaznah Alshammari, Istiaque Ahmed, MD Shohel Sayed
Subjects: Machine Learning (cs.LG)
[944] arXiv:2410.08309 [pdf, html, other]
Title: Swing-by Dynamics in Concept Learning and Compositional Generalization
Yongyi Yang, Core Francisco Park, Ekdeep Singh Lubana, Maya Okawa, Wei Hu, Hidenori Tanaka
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[945] arXiv:2410.08316 [pdf, html, other]
Title: COS-DPO: Conditioned One-Shot Multi-Objective Fine-Tuning Framework
Yinuo Ren, Tesi Xiao, Michael Shavlovsky, Lexing Ying, Holakou Rahmanian
Comments: Published at UAI 2025
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Optimization and Control (math.OC)
[946] arXiv:2410.08329 [pdf, html, other]
Title: Physics and Deep Learning in Computational Wave Imaging
Youzuo Lin, Shihang Feng, James Theiler, Yinpeng Chen, Umberto Villa, Jing Rao, John Greenhall, Cristian Pantea, Mark A. Anastasio, Brendt Wohlberg
Comments: 29 pages, 11 figures
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[947] arXiv:2410.08336 [pdf, html, other]
Title: Kernel Banzhaf: A Fast and Robust Estimator for Banzhaf Values
Yurong Liu, R. Teal Witter, Flip Korn, Tarfah Alrashed, Dimitris Paparas, Christopher Musco, Juliana Freire
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[948] arXiv:2410.08339 [pdf, html, other]
Title: Simultaneous Weight and Architecture Optimization for Neural Networks
Zitong Huang, Mansooreh Montazerin, Ajitesh Srivastava
Comments: Accepted to NeurIPS 2024 FITML (Fine-Tuning in Modern Machine Learning) Workshop
Subjects: Machine Learning (cs.LG)
[949] arXiv:2410.08355 [pdf, html, other]
Title: Metalic: Meta-Learning In-Context with Protein Language Models
Jacob Beck, Shikha Surana, Manus McAuliffe, Oliver Bent, Thomas D. Barrett, Juan Jose Garau Luis, Paul Duckworth
Comments: Published at The Thirteenth International Conference on Learning Representations (ICLR 2025). Code is provided at this https URL. Also relevant to searches for "metallic", "meta-learning in-context", "LLM", and "protein language model"
Journal-ref: The Thirteenth International Conference on Learning Representations (ICLR 2025)
Subjects: Machine Learning (cs.LG)
[950] arXiv:2410.08360 [pdf, html, other]
Title: Minimax Hypothesis Testing for the Bradley-Terry-Luce Model
Anuran Makur, Japneet Singh
Comments: 54 pages, 6 figures
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Statistics Theory (math.ST)
[951] arXiv:2410.08362 [pdf, html, other]
Title: Towards Optimal Environmental Policies: Policy Learning under Arbitrary Bipartite Network Interference
Raphael C. Kim, Falco J. Bargagli-Stoffi, Kevin L. Chen, Rachel C. Nethery
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[952] arXiv:2410.08368 [pdf, html, other]
Title: ElasticTok: Adaptive Tokenization for Image and Video
Wilson Yan, Volodymyr Mnih, Aleksandra Faust, Matei Zaharia, Pieter Abbeel, Hao Liu
Subjects: Machine Learning (cs.LG)
[953] arXiv:2410.08385 [pdf, html, other]
Title: Language model developers should report train-test overlap
Andy K Zhang, Kevin Klyman, Yifan Mai, Yoav Levine, Yian Zhang, Rishi Bommasani, Percy Liang
Comments: ICML 2025 Spotlight; 23 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Software Engineering (cs.SE)
[954] arXiv:2410.08389 [pdf, html, other]
Title: Heating Up Quasi-Monte Carlo Graph Random Features: A Diffusion Kernel Perspective
Brooke Feinberg, Aiwen Li
Comments: 18 pages, 16 figures
Subjects: Machine Learning (cs.LG); Combinatorics (math.CO)
[955] arXiv:2410.08394 [pdf, html, other]
Title: Identifying Money Laundering Subgraphs on the Blockchain
Kiwhan Song, Mohamed Ali Dhraief, Muhua Xu, Locke Cai, Xuhao Chen, Arvind, Jie Chen
Comments: ICAIF 2024. Code is available at this https URL
Subjects: Machine Learning (cs.LG); General Finance (q-fin.GN)
[956] arXiv:2410.08407 [pdf, html, other]
Title: What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias
Aida Mohammadshahi, Yani Ioannou
Comments: Published in Transactions on Machine Learning Research (TMLR), March 2024. this https URL
Journal-ref: Transactions on Machine Learning Research, 2835-8856, March 2025. https://openreview.net/forum?id=xBbj46Y2fN
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Machine Learning (stat.ML)
[957] arXiv:2410.08417 [pdf, html, other]
Title: Bilinear MLPs enable weight-based mechanistic interpretability
Michael T. Pearce, Thomas Dooms, Alice Rigg, Jose M. Oramas, Lee Sharkey
Comments: Accepted to ICLR'25
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[958] arXiv:2410.08421 [pdf, html, other]
Title: Generalizable autoregressive modeling of time series through functional narratives
Ran Liu, Wenrui Ma, Ellen Zippi, Hadi Pouransari, Jingyun Xiao, Chris Sandino, Behrooz Mahasseni, Juri Minxha, Erdrin Azemi, Eva L. Dyer, Ali Moin
Subjects: Machine Learning (cs.LG)
[959] arXiv:2410.08423 [pdf, html, other]
Title: A phase transition in sampling from Restricted Boltzmann Machines
Youngwoo Kwon, Qian Qin, Guanyang Wang, Yuchen Wei
Comments: 43 pages, 4 figures
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Mathematical Physics (math-ph); Probability (math.PR); Computation (stat.CO)
[960] arXiv:2410.08432 [pdf, html, other]
Title: MYCROFT: Towards Effective and Efficient External Data Augmentation
Zain Sarwar, Van Tran, Arjun Nitin Bhagoji, Nick Feamster, Ben Y. Zhao, Supriyo Chakraborty
Comments: 10 pages, 3 figures, 3 tables
Subjects: Machine Learning (cs.LG)
[961] arXiv:2410.08439 [pdf, html, other]
Title: Reinforcement Learning for Control of Non-Markovian Cellular Population Dynamics
Josiah C. Kratz, Jacob Adamczyk
Comments: Accepted at ICLR 2025
Subjects: Machine Learning (cs.LG); Populations and Evolution (q-bio.PE)
[962] arXiv:2410.08442 [pdf, other]
Title: JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles
Dom Nasrabadi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[963] arXiv:2410.08447 [pdf, html, other]
Title: Slow Convergence of Interacting Kalman Filters in Word-of-Mouth Social Learning
Vikram Krishnamurthy, Cristian Rojas
Subjects: Machine Learning (cs.LG); Theoretical Economics (econ.TH); Signal Processing (eess.SP)
[964] arXiv:2410.08449 [pdf, html, other]
Title: Finite Sample and Large Deviations Analysis of Stochastic Gradient Algorithm with Correlated Noise
George Yin, Vikram Krishnamurthy
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[965] arXiv:2410.08453 [pdf, html, other]
Title: AdvDiffuser: Generating Adversarial Safety-Critical Driving Scenarios via Guided Diffusion
Yuting Xie, Xianda Guo, Cong Wang, Kunhua Liu, Long Chen
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[966] arXiv:2410.08455 [pdf, html, other]
Title: Why pre-training is beneficial for downstream classification tasks?
Xin Jiang, Xu Cheng, Zechao Li
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[967] arXiv:2410.08458 [pdf, html, other]
Title: Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both
Abhijnan Nath, Changsoo Jung, Ethan Seefried, Nikhil Krishnaswamy
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[968] arXiv:2410.08469 [pdf, html, other]
Title: Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Eunji Kim, Kyuhong Shim, Simyung Chang, Sungroh Yoon
Comments: Accepted at EMNLP 2024 Findings
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[969] arXiv:2410.08473 [pdf, html, other]
Title: Deeper Insights into Deep Graph Convolutional Networks: Stability and Generalization
Guangrui Yang, Ming Li, Han Feng, Xiaosheng Zhuang
Comments: 44 pages, 3 figures, submitted to IEEE Trans. Pattern Anal. Mach. Intell. on 18-Jun-2024, under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[970] arXiv:2410.08497 [pdf, html, other]
Title: Towards Sharper Risk Bounds for Minimax Problems
Bowei Zhu, Shaojie Li, Yong Liu
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[971] arXiv:2410.08498 [pdf, html, other]
Title: On a Hidden Property in Computational Imaging
Yinan Feng, Yinpeng Chen, Yueh Lee, Youzuo Lin
Subjects: Machine Learning (cs.LG)
[972] arXiv:2410.08503 [pdf, html, other]
Title: Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data
Binghui Li, Yuanzhi Li
Comments: Published as a conference paper at ICLR 2025; 36 pages
Subjects: Machine Learning (cs.LG)
[973] arXiv:2410.08511 [pdf, html, other]
Title: Distributionally robust self-supervised learning for tabular data
Shantanu Ghosh, Tiankang Xie, Mikhail Kuznetsov
Comments: TRL Workshop@NeurIPS2024
Subjects: Machine Learning (cs.LG)
[974] arXiv:2410.08522 [pdf, html, other]
Title: Evaluating the effects of Data Sparsity on the Link-level Bicycling Volume Estimation: A Graph Convolutional Neural Network Approach
Mohit Gupta, Debjit Bhowmick, Meead Saberi, Shirui Pan, Ben Beck
Subjects: Machine Learning (cs.LG)
[975] arXiv:2410.08524 [pdf, html, other]
Title: IGNN-Solver: A Graph Neural Solver for Implicit Graph Neural Networks
Junchao Lin, Zenan Ling, Zhanbo Feng, Jingwen Xu, Minxuan Liao, Feng Zhou, Tianqi Hou, Zhenyu Liao, Robert C. Qiu
Subjects: Machine Learning (cs.LG)
[976] arXiv:2410.08537 [pdf, html, other]
Title: Robust Offline Policy Learning with Observational Data from Multiple Sources
Aldo Gael Carranza, Susan Athey
Comments: arXiv admin note: substantial text overlap with arXiv:2305.12407
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[977] arXiv:2410.08540 [pdf, html, other]
Title: Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning
Xinran Li, Ling Pan, Jun Zhang
Comments: Accepted by the Thirty-Eighth Annual Conference on Neural Information Processing Systems(NeurIPS 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[978] arXiv:2410.08549 [pdf, html, other]
Title: Score Neural Operator: A Generative Model for Learning and Generalizing Across Multiple Probability Distributions
Xinyu Liao, Aoyang Qin, Jacob Seidman, Junqi Wang, Wei Wang, Paris Perdikaris
Subjects: Machine Learning (cs.LG)
[979] arXiv:2410.08557 [pdf, html, other]
Title: MUSO: Achieving Exact Machine Unlearning in Over-Parameterized Regimes
Ruikai Yang, Mingzhen He, Zhengbao He, Youmei Qiu, Xiaolin Huang
Comments: Accepted by Machine Learning Journal
Subjects: Machine Learning (cs.LG)
[980] arXiv:2410.08559 [pdf, html, other]
Title: Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive Architecture
Sehun Kim
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[981] arXiv:2410.08578 [pdf, html, other]
Title: Logarithmic Regret for Unconstrained Submodular Maximization Stochastic Bandit
Julien Zhou (Thoth, STATIFY), Pierre Gaillard (Thoth), Thibaud Rahier, Julyan Arbel (STATIFY)
Comments: Camera-ready version for ALT 2025
Subjects: Machine Learning (cs.LG); Combinatorics (math.CO); Optimization and Control (math.OC); Machine Learning (stat.ML)
[982] arXiv:2410.08589 [pdf, html, other]
Title: Retraining-Free Merging of Sparse MoE via Hierarchical Clustering
I-Chun Chen, Hsu-Shen Liu, Wei-Fang Sun, Chen-Hao Chao, Yen-Chang Hsu, Chun-Yi Lee
Comments: Code: this https URL
Subjects: Machine Learning (cs.LG)
[983] arXiv:2410.08629 [pdf, html, other]
Title: Towards Cross-domain Few-shot Graph Anomaly Detection
Jiazhen Chen, Sichao Fu, Zhibin Zhang, Zheng Ma, Mingbin Feng, Tony S. Wirjanto, Qinmu Peng
Comments: Accepted by 24th IEEE International Conference on Data Mining (ICDM 2024)
Subjects: Machine Learning (cs.LG)
[984] arXiv:2410.08633 [pdf, html, other]
Title: Transformers Provably Solve Parity Efficiently with Chain of Thought
Juno Kim, Taiji Suzuki
Comments: ICLR 2025 Oral
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[985] arXiv:2410.08634 [pdf, html, other]
Title: GAI-Enabled Explainable Personalized Federated Semi-Supervised Learning
Yubo Peng, Feibo Jiang, Li Dong, Kezhi Wang, Kun Yang
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT)
[986] arXiv:2410.08635 [pdf, html, other]
Title: Efficient line search for optimizing Area Under the ROC Curve in gradient descent
Jadon Fowler, Toby Dylan Hocking
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[987] arXiv:2410.08641 [pdf, html, other]
Title: Multi-Source Temporal Attention Network for Precipitation Nowcasting
Rafael Pablos Sarabia, Joachim Nyborg, Morten Birk, Jeppe Liborius Sjørup, Anders Lillevang Vesterholt, Ira Assent
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[988] arXiv:2410.08651 [pdf, html, other]
Title: Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty Estimation
Gleb Radchenko, Victoria Andrea Fill
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA)
[989] arXiv:2410.08654 [pdf, html, other]
Title: Finite Sample Complexity Analysis of Binary Segmentation
Toby Dylan Hocking
Subjects: Machine Learning (cs.LG); Computation (stat.CO)
[990] arXiv:2410.08659 [pdf, html, other]
Title: Carefully Structured Compression: Efficiently Managing StarCraft II Data
Bryce Ferenczi, Rhys Newbury, Michael Burke, Tom Drummond
Comments: 14 pages, 7 figures
Subjects: Machine Learning (cs.LG)
[991] arXiv:2410.08665 [pdf, html, other]
Title: DistDD: Distributed Data Distillation Aggregation through Gradient Matching
Peiran Wang, Haohan Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[992] arXiv:2410.08666 [pdf, html, other]
Title: DeltaDQ: Ultra-High Delta Compression for Fine-Tuned LLMs via Group-wise Dropout and Separate Quantization
Yanfeng Jiang, Zelan Yang, Bohua Chen, Shen Li, Yong Li, Tao Li
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[993] arXiv:2410.08681 [pdf, html, other]
Title: Efficiently Scanning and Resampling Spatio-Temporal Tasks with Irregular Observations
Bryce Ferenczi, Michael Burke, Tom Drummond
Comments: 11 pages, 10 figures
Subjects: Machine Learning (cs.LG)
[994] arXiv:2410.08687 [pdf, html, other]
Title: Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation
Hanieh Shojaei, Qianqian Zou, Max Mehltretter
Comments: Accepted for publication in the Proceedings of the European Conference on Computer Vision (ECCV) 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[995] arXiv:2410.08709 [pdf, html, other]
Title: Distillation of Discrete Diffusion through Dimensional Correlations
Satoshi Hayakawa, Yuhta Takida, Masaaki Imaizumi, Hiromi Wakaki, Yuki Mitsufuji
Comments: 39 pages, ICML 2025 accepted
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[996] arXiv:2410.08710 [pdf, html, other]
Title: Preferential Normalizing Flows
Petrus Mikkola, Luigi Acerbi, Arto Klami
Comments: 29 pages, 18 figures, Accepted at NeurIPS2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[997] arXiv:2410.08734 [pdf, html, other]
Title: Gradients Stand-in for Defending Deep Leakage in Federated Learning
H. Yi, H. Ren, C. Hu, Y. Li, J. Deng, X. Xie
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[998] arXiv:2410.08751 [pdf, html, other]
Title: Zero-Shot Offline Imitation Learning via Optimal Transport
Thomas Rupf, Marco Bagatella, Nico Gürtler, Jonas Frey, Georg Martius
Subjects: Machine Learning (cs.LG)
[999] arXiv:2410.08759 [pdf, html, other]
Title: Enhancing GNNs with Architecture-Agnostic Graph Transformations: A Systematic Analysis
Zhifei Li, Gerrit Großmann, Verena Wolf
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[1000] arXiv:2410.08760 [pdf, html, other]
Title: Unlocking FedNL: Self-Contained Compute-Optimized Implementation
Konstantin Burlachenko, Peter Richtárik
Comments: 55 pages, 12 figures, 12 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Mathematical Software (cs.MS); Performance (cs.PF); Optimization and Control (math.OC)
Total of 4845 entries : 1-100 ... 601-700 701-800 801-900 901-1000 1001-1100 1101-1200 1201-1300 ... 4801-4845
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack