close this message
arXiv smileybones

Planned Database Maintenance 2025-09-17 11am-1pm UTC

  • Submission, registration, and all other functions that require login will be temporarily unavailable.
  • Browsing, viewing and searching papers will be unaffected.

Blog post
Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for October 2024

Total of 4845 entries : 1-100 ... 4501-4600 4601-4700 4701-4800 4801-4845
Showing up to 100 entries per page: fewer | more | all
[4801] arXiv:2410.23511 (cross-list from cs.CL) [pdf, html, other]
Title: Dynamic Strategy Planning for Efficient Question Answering with Large Language Models
Tanmay Parekh, Pradyot Prakash, Alexander Radovic, Akshay Shekher, Denis Savenkov
Comments: Accepted at NAACL 2025 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4802] arXiv:2410.23574 (cross-list from math.OC) [pdf, html, other]
Title: Online Convex Optimization with Memory and Limited Predictions
Lintao Ye, Zhengmiao Wang, Zhi-Wei Liu, Ming Chi, Xiaoling Wang, Housheng Su
Comments: 28 pages, 2 figures
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[4803] arXiv:2410.23583 (cross-list from cs.CL) [pdf, html, other]
Title: BioNCERE: Non-Contrastive Enhancement For Relation Extraction In Biomedical Texts
Farshad Noravesh
Comments: 4 figures, 2 tables, 10 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[4804] arXiv:2410.23595 (cross-list from stat.ML) [pdf, html, other]
Title: Disentangling Interpretable Factors with Supervised Independent Subspace Principal Component Analysis
Jiayu Su, David A. Knowles, Raul Rabadan
Comments: 10 pages and 6 figures in the main text; To be published in the Proceedings of the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Genomics (q-bio.GN)
[4805] arXiv:2410.23602 (cross-list from stat.ML) [pdf, html, other]
Title: Linearized Wasserstein Barycenters: Synthesis, Analysis, Representational Capacity, and Applications
Matthew Werenski, Brendan Mallery, Shuchin Aeron, James M. Murphy
Comments: 40 pages, 6 figures Minor revisions and proof fixes, accepted to AISTATS 2025
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[4806] arXiv:2410.23610 (cross-list from stat.ML) [pdf, other]
Title: Global Convergence in Training Large-Scale Transformers
Cheng Gao, Yuan Cao, Zihao Li, Yihan He, Mengdi Wang, Han Liu, Jason Matthew Klusowski, Jianqing Fan
Comments: to be published in 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[4807] arXiv:2410.23618 (cross-list from quant-ph) [pdf, html, other]
Title: Learning quantum states prepared by shallow circuits in polynomial time
Zeph Landau, Yunchao Liu
Comments: 19 pages
Journal-ref: In Proceedings of the 57th Annual ACM Symposium on Theory of Computing (STOC 2025)
Subjects: Quantum Physics (quant-ph); Computational Complexity (cs.CC); Machine Learning (cs.LG)
[4808] arXiv:2410.23725 (cross-list from cs.CY) [pdf, html, other]
Title: Artificial intelligence to improve clinical coding practice in Scandinavia: a crossover randomized controlled trial
Taridzo Chomutare, Therese Olsen Svenning, Miguel Ángel Tejedor Hernández, Phuong Dinh Ngo, Andrius Budrionis, Kaisa Markljung, Lill Irene Hind, Torbjørn Torsvik, Karl Øyvind Mikalsen, Aleksandar Babic, Hercules Dalianis
Comments: 13 pages, 4 figures, 4 tables
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[4809] arXiv:2410.23726 (cross-list from cs.AI) [pdf, html, other]
Title: Towards Reliable Alignment: Uncertainty-aware RLHF
Debangshu Banerjee, Aditya Gopalan
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4810] arXiv:2410.23743 (cross-list from cs.CL) [pdf, html, other]
Title: What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
Ming Li, Yanhong Li, Tianyi Zhou
Comments: ACL2025 main, Camera-ready
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4811] arXiv:2410.23751 (cross-list from cs.CV) [pdf, html, other]
Title: EXACFS -- A CIL Method to mitigate Catastrophic Forgetting
S Balasubramanian, M Sai Subramaniam, Sai Sriram Talasu, Yedu Krishna P, Manepalli Pranav Phanindra Sai, Ravi Mukkamala, Darshan Gera
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[4812] arXiv:2410.23771 (cross-list from cs.CL) [pdf, html, other]
Title: What is Wrong with Perplexity for Long-context Language Modeling?
Lizhe Fang, Yifei Wang, Zhaoyang Liu, Chenheng Zhang, Stefanie Jegelka, Jinyang Gao, Bolin Ding, Yisen Wang
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[4813] arXiv:2410.23790 (cross-list from cs.LO) [pdf, html, other]
Title: Neural Model Checking
Mirco Giacobbe, Daniel Kroening, Abhinandan Pal, Michael Tautschnig
Comments: To appear in NeurIPS 2024
Subjects: Logic in Computer Science (cs.LO); Machine Learning (cs.LG)
[4814] arXiv:2410.23846 (cross-list from cs.DB) [pdf, other]
Title: Case ID detection based on time series data -- the mining use case
Edyta Brzychczy, Tomasz Pełech-Pilichowski, Ziemowit Dworakowski
Comments: Presented at EdbA'24 - Fifth International Workshop on Event Data and Behavioral Analytics, ICPM 2024, Kopenhagen, Denmark
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[4815] arXiv:2410.23854 (cross-list from cs.CV) [pdf, html, other]
Title: Reflecting Topology Consistency and Abnormality via Learnable Attentions for Airway Labeling
Chenyu Li, Minghui Zhang, Chuyan Zhang, Yun Gu
Journal-ref: International Journal of Computer Assisted Radiology and Surgery (2025): 1-9
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[4816] arXiv:2410.23856 (cross-list from cs.CL) [pdf, html, other]
Title: Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?
Zhanke Zhou, Rong Tao, Jianing Zhu, Yiwen Luo, Zengmao Wang, Bo Han
Comments: Accepted by NeurIPS 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[4817] arXiv:2410.23870 (cross-list from cs.CR) [pdf, html, other]
Title: Noise as a Double-Edged Sword: Reinforcement Learning Exploits Randomized Defenses in Neural Networks
Steve Bakos, Pooria Madani, Heidar Davoudi
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[4818] arXiv:2410.23881 (cross-list from cs.DC) [pdf, html, other]
Title: DynaSplit: A Hardware-Software Co-Design Framework for Energy-Aware Inference on Edge
Daniel May, Alessandro Tundo, Shashikant Ilager, Ivona Brandic
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Software Engineering (cs.SE)
[4819] arXiv:2410.23883 (cross-list from cs.CL) [pdf, html, other]
Title: 'No' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue
Rena Gao, Xuetong Wu, Siwen Luo, Caren Han, Feng Liu
Comments: 16 pages, 5 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[4820] arXiv:2410.23894 (cross-list from cs.CR) [pdf, html, other]
Title: Metamorphic Malware Evolution: The Potential and Peril of Large Language Models
Pooria Madani
Journal-ref: 2023 5th IEEE International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (TPS-ISA)
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[4821] arXiv:2410.23903 (cross-list from cs.AI) [pdf, html, other]
Title: Neural Network Verification with PyRAT
Augustin Lemesle, Julien Lehmann, Tristan Le Gall
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4822] arXiv:2410.23912 (cross-list from cs.AI) [pdf, html, other]
Title: RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner
Fu-Chieh Chang, Yu-Ting Lee, Hui-Ying Shih, Yi Hsuan Tseng, Pei-Yuan Wu
Journal-ref: ICLR 2025 Workshop on Reasoning and Planning for Large Language Models
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4823] arXiv:2410.23918 (cross-list from cs.CL) [pdf, html, other]
Title: BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
Xinghao Wang, Pengyu Wang, Bo Wang, Dong Zhang, Yunhua Zhou, Xipeng Qiu
Comments: ICLR 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[4824] arXiv:2410.23949 (cross-list from cs.NI) [pdf, html, other]
Title: Deep Learning Frameworks for Cognitive Radio Networks: Review and Open Research Challenges
Senthil Kumar Jagatheesaperumal, Ijaz Ahmad, Marko Höyhtyä, Suleman Khan, Andrei Gurtov
Comments: The article has been accepted for publication in "Journal of Network and Computer Applications" during October 2024
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
[4825] arXiv:2410.23955 (cross-list from eess.AS) [pdf, html, other]
Title: An Empirical Analysis of Speech Self-Supervised Learning at Multiple Resolutions
Theo Clark, Benedetta Cevoli, Eloy de Jong, Timofey Abramski, Jamie Dougherty
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
[4826] arXiv:2410.23969 (cross-list from quant-ph) [pdf, html, other]
Title: Interactive proofs for verifying (quantum) learning and testing
Matthias C. Caro, Jens Eisert, Marcel Hinsche, Marios Ioannou, Alexander Nietner, Ryan Sweke
Comments: 13 + 33 + 13 pages; 1 table; 2 figures; some added clarifications in Sec 1
Subjects: Quantum Physics (quant-ph); Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[4827] arXiv:2410.23971 (cross-list from physics.flu-dyn) [pdf, html, other]
Title: Stochastic Reconstruction of Gappy Lagrangian Turbulent Signals by Conditional Diffusion Models
Tianyi Li, Luca Biferale, Fabio Bonaccorso, Michele Buzzicotti, Luca Centurioni
Subjects: Fluid Dynamics (physics.flu-dyn); Machine Learning (cs.LG); Chaotic Dynamics (nlin.CD); Atmospheric and Oceanic Physics (physics.ao-ph)
[4828] arXiv:2410.24006 (cross-list from cs.CV) [pdf, html, other]
Title: DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination
Jia Fu, Xiao Zhang, Sepideh Pashami, Fatemeh Rahimian, Anders Holst
Comments: Accepted to 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[4829] arXiv:2410.24022 (cross-list from q-bio.QM) [pdf, html, other]
Title: SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation
Liang He, Peiran Jin, Yaosen Min, Shufang Xie, Lijun Wu, Tao Qin, Xiaozhuan Liang, Kaiyuan Gao, Yuliang Jiang, Tie-Yan Liu
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[4830] arXiv:2410.24029 (cross-list from cs.CL) [pdf, html, other]
Title: Joint Training for Selective Prediction
Zhaohui Li, Rebecca J. Passonneau
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[4831] arXiv:2410.24035 (cross-list from cs.RO) [pdf, html, other]
Title: State- and context-dependent robotic manipulation and grasping via uncertainty-aware imitation learning
Tim R. Winter, Ashok M. Sundaram, Werner Friedl, Maximo A. Roa, Freek Stulp, João Silvério
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4832] arXiv:2410.24046 (cross-list from eess.IV) [pdf, other]
Title: Deep Learning with HM-VGG: AI Strategies for Multi-modal Image Analysis
Junliang Du, Yiru Cang, Tong Zhou, Jiacheng Hu, Weijie He
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[4833] arXiv:2410.24052 (cross-list from math.OC) [pdf, html, other]
Title: Attention is All You Need to Optimize Wind Farm Operations and Maintenance
Iman Kazemian, Murat Yildirim, Paritosh Ramanan
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[4834] arXiv:2410.24054 (cross-list from stat.ML) [pdf, html, other]
Title: EigenVI: score-based variational inference with orthogonal function expansions
Diana Cai, Chirag Modi, Charles C. Margossian, Robert M. Gower, David M. Blei, Lawrence K. Saul
Comments: 25 pages, 9 figures. Advances in Neural Information Processing Systems (NeurIPS), 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[4835] arXiv:2410.24058 (cross-list from quant-ph) [pdf, html, other]
Title: Natural gradient and parameter estimation for quantum Boltzmann machines
Dhrumil Patel, Mark M. Wilde
Comments: 23 pages, 4 figures
Subjects: Quantum Physics (quant-ph); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG); Optimization and Control (math.OC)
[4836] arXiv:2410.24089 (cross-list from stat.ML) [pdf, other]
Title: Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Joongkyu Lee, Min-hwan Oh
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[4837] arXiv:2410.24091 (cross-list from cs.RO) [pdf, html, other]
Title: 3D-ViTac: Learning Fine-Grained Manipulation with Visuo-Tactile Sensing
Binghao Huang, Yixuan Wang, Xinyi Yang, Yiyue Luo, Yunzhu Li
Comments: Accepted at Conference on Robot Learning (CoRL) 2024
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4838] arXiv:2410.24104 (cross-list from cs.DS) [pdf, other]
Title: Clustering to Minimize Cluster-Aware Norm Objectives
Martin G. Herold, Evangelos Kipouridis, Joachim Spoerhase
Comments: accepted at SODA 2025
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[4839] arXiv:2410.24116 (cross-list from cs.CV) [pdf, html, other]
Title: AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization
Amir Kazemi, Qurat ul ain Fatima, Volodymyr Kindratenko, Christopher Tessum
Comments: 19 pages, 4 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4840] arXiv:2410.24117 (cross-list from cs.SE) [pdf, html, other]
Title: AlphaTrans: A Neuro-Symbolic Compositional Approach for Repository-Level Code Translation and Validation
Ali Reza Ibrahimzada, Kaiyao Ke, Mrigank Pawagi, Muhammad Salman Abid, Rangeet Pan, Saurabh Sinha, Reyhaneh Jabbarvand
Comments: Published in FSE 2025
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG)
[4841] arXiv:2410.24145 (cross-list from stat.ML) [pdf, html, other]
Title: Projected random forests and conformal prediction of circular data
Paulo C. Marques F., Rinaldo Artes, Helton Graziadei
Comments: 7 pages; 4 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[4842] arXiv:2410.24177 (cross-list from eess.AS) [pdf, html, other]
Title: DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models
Heng-Jui Chang, Hongyu Gong, Changhan Wang, James Glass, Yu-An Chung
Comments: Preprint
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[4843] arXiv:2410.24185 (cross-list from cs.RO) [pdf, html, other]
Title: DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning
Zhenyu Jiang, Yuqi Xie, Kevin Lin, Zhenjia Xu, Weikang Wan, Ajay Mandlekar, Linxi Fan, Yuke Zhu
Comments: ICRA 2025. Project website: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[4844] arXiv:2410.24198 (cross-list from cs.CL) [pdf, html, other]
Title: SelfCodeAlign: Self-Alignment for Code Generation
Yuxiang Wei, Federico Cassano, Jiawei Liu, Yifeng Ding, Naman Jain, Zachary Mueller, Harm de Vries, Leandro von Werra, Arjun Guha, Lingming Zhang
Comments: Accepted to NeurIPS 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Software Engineering (cs.SE)
[4845] arXiv:2410.24218 (cross-list from cs.CL) [pdf, html, other]
Title: Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use
Jiajun Xi, Yinong He, Jianing Yang, Yinpei Dai, Joyce Chai
Comments: EMNLP 2024 Main. Project website: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
Total of 4845 entries : 1-100 ... 4501-4600 4601-4700 4701-4800 4801-4845
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack