Machine Learning

Authors and titles for October 2024

Total of 4845 entries : 1-100 101-200 201-300 301-400 401-500 501-600 ... 4801-4845

Showing up to 100 entries per page: fewer | more | all

[201] arXiv:2410.02064 [pdf, html, other]: Title: Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct

Christopher Ackerman, Nina Panickssery

Comments: 10 pages, 13 figs, 2 tables, accepted as conference paper to ICLR 2025

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[202] arXiv:2410.02068 [pdf, html, other]: Title: Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits

Jiabin Lin, Shana Moothedath, Namrata Vaswani

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[203] arXiv:2410.02070 [pdf, html, other]: Title: MMFNet: Multi-Scale Frequency Masking Neural Network for Multivariate Time Series Forecasting

Aitian Ma, Dongsheng Luo, Mo Sha

Subjects: Machine Learning (cs.LG)
[204] arXiv:2410.02077 [pdf, html, other]: Title: Kolmogorov-Arnold Network Autoencoders

Mohammadamin Moradi, Shirin Panahi, Erik Bollt, Ying-Cheng Lai

Comments: 12 pages, 5 figures, 1 table

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[205] arXiv:2410.02079 [pdf, html, other]: Title: Deep Generative Modeling for Identification of Noisy, Non-Stationary Dynamical Systems

Doris Voina, Steven Brunton, J. Nathan Kutz

Comments: 19 pages + 7 figures + Supplementary Materials (and supplementary figures)

Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[206] arXiv:2410.02081 [pdf, html, other]: Title: MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K Parameters

Aitian Ma, Dongsheng Luo, Mo Sha

Subjects: Machine Learning (cs.LG)
[207] arXiv:2410.02082 [pdf, html, other]: Title: FARM: Functional Group-Aware Representations for Small Molecules

Thao Nguyen, Kuan-Hao Huang, Ge Liu, Martin D. Burke, Ying Diao, Heng Ji

Comments: Preprint. The code is available at: this https URL

Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[208] arXiv:2410.02085 [pdf, html, other]: Title: Multi-Omic and Quantum Machine Learning Integration for Lung Subtypes Classification

Mandeep Kaur Saggi, Amandeep Singh Bhatia, Mensah Isaiah, Humaira Gowher, Sabre Kais

Comments: 27 pages, 17 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Genomics (q-bio.GN); Quantum Physics (quant-ph)
[209] arXiv:2410.02086 [pdf, html, other]: Title: Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations

Minoh Jeong, Min Namgung, Zae Myung Kim, Dongyeop Kang, Yao-Yi Chiang, Alfred Hero

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[210] arXiv:2410.02087 [pdf, html, other]: Title: HyperBrain: Anomaly Detection for Temporal Hypergraph Brain Networks

Sadaf Sadeghian, Xiaoxiao Li, Margo Seltzer

Subjects: Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[211] arXiv:2410.02113 [pdf, html, other]: Title: Mamba Neural Operator: Who Wins? Transformers vs. State-Space Models for PDEs

Chun-Wun Cheng, Jiahao Huang, Yi Zhang, Guang Yang, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[212] arXiv:2410.02116 [pdf, html, other]: Title: Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks

Siddharth Joshi, Jiayi Ni, Baharan Mirzasoleiman

Comments: ICLR 2025. Code at this https URL

Subjects: Machine Learning (cs.LG)
[213] arXiv:2410.02117 [pdf, html, other]: Title: Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices

Andres Potapczynski, Shikai Qiu, Marc Finzi, Christopher Ferri, Zixi Chen, Micah Goldblum, Bayan Bruss, Christopher De Sa, Andrew Gordon Wilson

Comments: NeurIPS 2024. Code available at this https URL

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[214] arXiv:2410.02128 [pdf, html, other]: Title: Breaking the mold: The challenge of large scale MARL specialization

Stefan Juang, Hugh Cao, Arielle Zhou, Ruochen Liu, Nevin L. Zhang, Elvis Liu

Comments: 19 pages

Subjects: Machine Learning (cs.LG)
[215] arXiv:2410.02131 [pdf, html, other]: Title: Boosting Masked ECG-Text Auto-Encoders as Discriminative Learners

Hung Manh Pham, Aaqib Saeed, Dong Ma

Comments: Accepted at ICML 2025

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[216] arXiv:2410.02132 [pdf, html, other]: Title: Nonuniform random feature models using derivative information

Konstantin Pieper, Zezhong Zhang, Guannan Zhang

Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[217] arXiv:2410.02133 [pdf, html, other]: Title: TrajGPT: Irregular Time-Series Representation Learning for Health Trajectory Analysis

Ziyang Song, Qingcheng Lu, He Zhu, David Buckeridge, Yue Li

Comments: 9 pages

Subjects: Machine Learning (cs.LG)
[218] arXiv:2410.02136 [pdf, html, other]: Title: Disentangled Representation Learning for Parametric Partial Differential Equations

Ning Liu, Lu Zhang, Tian Gao, Yue Yu

Subjects: Machine Learning (cs.LG)
[219] arXiv:2410.02140 [pdf, other]: Title: A Formal Framework for Understanding Length Generalization in Transformers

Xinting Huang, Andy Yang, Satwik Bhattamishra, Yash Sarrof, Andreas Krebs, Hattie Zhou, Preetum Nakkiran, Michael Hahn

Comments: 85 pages, 9 figures, 11 tables. Accepted for publication at ICLR 2025

Subjects: Machine Learning (cs.LG)
[220] arXiv:2410.02143 [pdf, html, other]: Title: Plug-and-Play Controllable Generation for Discrete Masked Models

Wei Guo, Yuchen Zhu, Molei Tao, Yongxin Chen

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[221] arXiv:2410.02145 [pdf, html, other]: Title: Active Learning of Deep Neural Networks via Gradient-Free Cutting Planes

Erica Zhang, Fangzhao Zhang, Mert Pilanci

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[222] arXiv:2410.02147 [pdf, other]: Title: Efficient Source-Free Time-Series Adaptation via Parameter Subspace Disentanglement

Gaurav Patel, Christopher Sandino, Behrooz Mahasseni, Ellen L Zippi, Erdrin Azemi, Ali Moin, Juri Minxha

Comments: Accepted at ICLR 2025

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)
[223] arXiv:2410.02151 [pdf, html, other]: Title: Quantitative Approximation for Neural Operators in Nonlinear Parabolic Equations

Takashi Furuya, Koichi Taniguchi, Satoshi Okuda

Comments: 31 pages

Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[224] arXiv:2410.02158 [pdf, html, other]: Title: ClassContrast: Bridging the Spatial and Contextual Gaps for Node Representations

Md Joshem Uddin, Astrit Tola, Varin Sikand, Cuneyt Gurcan Akcora, Baris Coskunuzer

Comments: 16 pages, 5 figures

Subjects: Machine Learning (cs.LG); Computational Geometry (cs.CG); Machine Learning (stat.ML)
[225] arXiv:2410.02159 [pdf, html, other]: Title: Mitigating Memorization In Language Models

Mansi Sakarvadia, Aswathy Ajith, Arham Khan, Nathaniel Hudson, Caleb Geniesse, Kyle Chard, Yaoqing Yang, Ian Foster, Michael W. Mahoney

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[226] arXiv:2410.02164 [pdf, other]: Title: Universality in Transfer Learning for Linear Models

Reza Ghane, Danil Akhtiamov, Babak Hassibi

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[227] arXiv:2410.02167 [pdf, html, other]: Title: Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis

Hongkang Li, Songtao Lu, Pin-Yu Chen, Xiaodong Cui, Meng Wang

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[228] arXiv:2410.02168 [pdf, html, other]: Title: Channel-aware Contrastive Conditional Diffusion for Multivariate Probabilistic Time Series Forecasting

Siyang Li, Yize Chen, Hui Xiong

Subjects: Machine Learning (cs.LG)
[229] arXiv:2410.02172 [pdf, html, other]: Title: Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation

Shreyas Chaudhari, Ameet Deshpande, Bruno Castro da Silva, Philip S. Thomas

Comments: Accepted at the Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[230] arXiv:2410.02173 [pdf, html, other]: Title: Efficiently Deploying LLMs with Controlled Risk

Michael J. Zellinger, Matt Thomson

Comments: 10 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[231] arXiv:2410.02176 [pdf, html, other]: Title: Towards Better Generalization: Weight Decay Induces Low-rank Bias for Neural Networks

Ke Chen, Chugang Yi, Haizhao Yang

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[232] arXiv:2410.02184 [pdf, html, other]: Title: CodeJudge: Evaluating Code Generation with Large Language Models

Weixi Tong, Tianyi Zhang

Comments: Accepted to EMNLP 2024 (Main, Long Paper)

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Software Engineering (cs.SE)
[233] arXiv:2410.02195 [pdf, html, other]: Title: BACKTIME: Backdoor Attacks on Multivariate Time Series Forecasting

Xiao Lin, Zhining Liu, Dongqi Fu, Ruizhong Qiu, Hanghang Tong

Comments: 23 pages. Neurips 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[234] arXiv:2410.02198 [pdf, html, other]: Title: G2T-LLM: Graph-to-Tree Text Encoding for Molecule Generation with Fine-Tuned Large Language Models

Zhaoning Yu, Xiangyang Xu, Hongyang Gao

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[235] arXiv:2410.02199 [pdf, html, other]: Title: Deep Koopman-layered Model with Universal Property Based on Toeplitz Matrices

Yuka Hashimoto, Tomoharu Iwata

Subjects: Machine Learning (cs.LG); Dynamical Systems (math.DS); Functional Analysis (math.FA); Machine Learning (stat.ML)
[236] arXiv:2410.02200 [pdf, html, other]: Title: Revisiting Prefix-tuning: Statistical Benefits of Reparameterization among Prompts

Minh Le, Chau Nguyen, Huy Nguyen, Quyen Tran, Trung Le, Nhat Ho

Comments: Accepted to ICLR 2025. 42 pages, 8 tables, 3 figures

Subjects: Machine Learning (cs.LG)
[237] arXiv:2410.02217 [pdf, html, other]: Title: Stochastic Sampling from Deterministic Flow Models

Saurabh Singh, Ian Fischer

Comments: Submitted to ICLR 2025

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[238] arXiv:2410.02226 [pdf, html, other]: Title: Doubly Optimal Policy Evaluation for Reinforcement Learning

Shuze Daniel Liu, Claire Chen, Shangtong Zhang

Subjects: Machine Learning (cs.LG)
[239] arXiv:2410.02230 [pdf, html, other]: Title: Mitigating Downstream Model Risks via Model Provenance

Keyu Wang, Abdullah Norozi Iranzad, Scott Schaffter, Meg Risdal, Doina Precup, Jonathan Lebensold

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[240] arXiv:2410.02236 [pdf, html, other]: Title: C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front

Ruohong Liu, Yuxin Pan, Linjie Xu, Lei Song, Jiang Bian, Pengcheng You, Yize Chen

Comments: Published as a conference paper at ICLR 2025. Code available at this https URL

Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[241] arXiv:2410.02242 [pdf, html, other]: Title: Robust Weight Initialization for Tanh Neural Networks with Fixed Point Analysis

Hyunwoo Lee, Hayoung Choi, Hyunju Kim

Comments: ICLR 2025

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[242] arXiv:2410.02246 [pdf, html, other]: Title: PFGuard: A Generative Framework with Privacy and Fairness Safeguards

Soyeon Kim, Yuji Roh, Geon Heo, Steven Euijong Whang

Comments: In Proceedings of the 13th International Conference on Learning Representations (ICLR), 2025

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[243] arXiv:2410.02247 [pdf, html, other]: Title: Theoretical Insights into Fine-Tuning Attention Mechanism: Generalization and Optimization

Xinhao Yao, Hongjin Qian, Xiaolin Hu, Gengze Xu, Wei Liu, Jian Luan, Bin Wang, Yong Liu

Comments: IJCAI 2025

Subjects: Machine Learning (cs.LG)
[244] arXiv:2410.02260 [pdf, html, other]: Title: FedScalar: A Communication efficient Federated Learning

M. Rostami, S. S. Kia

Subjects: Machine Learning (cs.LG)
[245] arXiv:2410.02267 [pdf, other]: Title: Unsupervised Meta-Learning via Dynamic Head and Heterogeneous Task Construction for Few-Shot Classification

Yunchuan Guan, Yu Liu, Ketong Liu, Ke Zhou, Zhiqi Shen

Subjects: Machine Learning (cs.LG)
[246] arXiv:2410.02268 [pdf, html, other]: Title: Structural-Entropy-Based Sample Selection for Efficient and Effective Learning

Tianchi Xie, Jiangning Zhu, Guozu Ma, Minzhi Lin, Wei Chen, Weikai Yang, Shixia Liu

Comments: Published as a conference paper at ICLR 2025

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[247] arXiv:2410.02269 [pdf, html, other]: Title: Best-of-Both-Worlds Policy Optimization for CMDPs with Bandit Feedback

Francesco Emanuele Stradi, Anna Lunghi, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

Subjects: Machine Learning (cs.LG)
[248] arXiv:2410.02273 [pdf, other]: Title: Perfect Counterfactuals in Imperfect Worlds: Modelling Noisy Implementation of Actions in Sequential Algorithmic Recourse

Yueqing Xuan, Kacper Sokol, Mark Sanderson, Jeffrey Chan

Comments: Accepted to ECML-PKDD 2025 Journal Track

Journal-ref: Machine Learning 114, no. 8 (2025): 187

Subjects: Machine Learning (cs.LG)
[249] arXiv:2410.02275 [pdf, html, other]: Title: Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization

Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti

Comments: arXiv admin note: text overlap with arXiv:2405.14372

Subjects: Machine Learning (cs.LG)
[250] arXiv:2410.02290 [pdf, html, other]: Title: Density based Spatial Clustering of Lines via Probabilistic Generation of Neighbourhood

Akanksha Das, Malay Bhattacharyya

Subjects: Machine Learning (cs.LG)
[251] arXiv:2410.02293 [pdf, html, other]: Title: Efficient Second-Order Neural Network Optimization via Adaptive Trust Region Methods

James Vo

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[252] arXiv:2410.02321 [pdf, html, other]: Title: Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis

Zikun Zhang, Zixiang Chen, Quanquan Gu

Comments: 26 pages, 1 figure

Journal-ref: The Thirteenth International Conference on Learning Representations, 2025

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[253] arXiv:2410.02324 [pdf, html, other]: Title: Automated Tone Transcription and Clustering with Tone2Vec

Yi Yang, Yiming Wang, ZhiQiang Tang, Jiahong Yuan

Comments: Accepted by EMNLP 2024 Findings

Subjects: Machine Learning (cs.LG)
[254] arXiv:2410.02335 [pdf, html, other]: Title: Data Optimisation of Machine Learning Models for Smart Irrigation in Urban Parks

Nasser Ghadiri, Bahman Javadi, Oliver Obst, Sebastian Pfautsch

Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[255] arXiv:2410.02344 [pdf, html, other]: Title: EntryPrune: Neural Network Feature Selection using First Impressions

Felix Zimmer, Patrik Okanovic, Torsten Hoefler

Subjects: Machine Learning (cs.LG)
[256] arXiv:2410.02348 [pdf, html, other]: Title: Simplicity bias and optimization threshold in two-layer ReLU networks

Etienne Boursier, Nicolas Flammarion

Comments: ICML camera ready version

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[257] arXiv:2410.02367 [pdf, other]: Title: SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Jintao Zhang, Jia Wei, Haofeng Huang, Pengle Zhang, Jun Zhu, Jianfei Chen

Comments: @inproceedings{zhang2025sageattention, title={SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration}, author={Zhang, Jintao and Wei, Jia and Zhang, Pengle and Zhu, Jun and Chen, Jianfei}, booktitle={International Conference on Learning Representations (ICLR)}, year={2025} }

Journal-ref: The Thirteenth International Conference on Learning Representations (ICLR 2025)

Subjects: Machine Learning (cs.LG)
[258] arXiv:2410.02384 [pdf, html, other]: Title: Unveiling AI's Blind Spots: An Oracle for In-Domain, Out-of-Domain, and Adversarial Errors

Shuangpeng Han, Mengmi Zhang

Subjects: Machine Learning (cs.LG)
[259] arXiv:2410.02387 [pdf, html, other]: Title: BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization

Gustav Wagner Zakarias, Lars Kai Hansen, Zheng-Hua Tan

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[260] arXiv:2410.02392 [pdf, html, other]: Title: MANTRA: The Manifold Triangulations Assemblage

Rubén Ballester, Ernst Röell, Daniel Bīn Schmid, Mathieu Alain, Sergio Escalera, Carles Casacuberta, Bastian Rieck

Comments: Accepted at ICLR 2025 (this https URL)

Subjects: Machine Learning (cs.LG); Algebraic Topology (math.AT)
[261] arXiv:2410.02394 [pdf, html, other]: Title: Online Multi-Label Classification under Noisy and Changing Label Distribution

Yizhang Zou, Xuegang Hu, Peipei Li, Jun Hu, You Wu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[262] arXiv:2410.02400 [pdf, html, other]: Title: An Online Feasible Point Method for Benign Generalized Nash Equilibrium Problems

Sarah Sachs, Hedi Hadiji, Tim van Erven, Mathias Staudigl

Subjects: Machine Learning (cs.LG)
[263] arXiv:2410.02416 [pdf, other]: Title: Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models

Seyedmorteza Sadat, Otmar Hilliges, Romann M. Weber

Comments: Published as a conference paper at ICLR 2025

Journal-ref: The Thirteenth International Conference on Learning Representations (ICLR 2025)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[264] arXiv:2410.02438 [pdf, html, other]: Title: Learning K-U-Net with constant complexity: An Application to time series forecasting

Jiang You, Arben Cela, René Natowicz, Jacob Ouanounou, Patrick Siarry

Subjects: Machine Learning (cs.LG)
[265] arXiv:2410.02450 [pdf, html, other]: Title: Personalized Federated Learning for Generative AI-Assisted Semantic Communications

Yubo Peng, Feibo Jiang, Li Dong, Kezhi Wang, Kun Yang

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Information Theory (cs.IT)
[266] arXiv:2410.02467 [pdf, html, other]: Title: SIDE: Surrogate Conditional Data Extraction from Diffusion Models

Yunhao Chen, Shujie Wang, Difan Zou, Xingjun Ma

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[267] arXiv:2410.02472 [pdf, html, other]: Title: Meta-Models: An Architecture for Decoding LLM Behaviors Through Interpreted Embeddings and Natural Language

Anthony Costarelli, Mat Allen, Severin Field

Comments: 11 pages, 2 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[268] arXiv:2410.02476 [pdf, html, other]: Title: Online Convex Optimization with a Separation Oracle

Zakaria Mhammedi

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[269] arXiv:2410.02490 [pdf, html, other]: Title: Stochastic variance-reduced Gaussian variational inference on the Bures-Wasserstein manifold

Hoang Phuc Hau Luu, Hanlin Yu, Bernardo Williams, Marcelo Hartmann, Arto Klami

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[270] arXiv:2410.02496 [pdf, html, other]: Title: Efficient learning of differential network in multi-source non-paranormal graphical models

Mojtaba Nikahd, Seyed Abolfazl Motahari

Subjects: Machine Learning (cs.LG)
[271] arXiv:2410.02498 [pdf, html, other]: Title: Dynamic Gradient Alignment for Online Data Mixing

Simin Fan, David Grangier, Pierre Ablin

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[272] arXiv:2410.02512 [pdf, html, other]: Title: SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation

Mucong Ding, Bang An, Yuancheng Xu, Anirudh Satheesh, Furong Huang

Comments: ICLR 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[273] arXiv:2410.02513 [pdf, html, other]: Title: Minimax Group Fairness in Strategic Classification

Emily Diana, Saeed Sharifi-Malvajerdi, Ali Vakilian

Subjects: Machine Learning (cs.LG)
[274] arXiv:2410.02519 [pdf, html, other]: Title: Semantic-Guided RL for Interpretable Feature Engineering

Mohamed Bouadi, Arta Alavi, Salima Benbernou, Mourad Ouziri

Comments: arXiv admin note: substantial text overlap with arXiv:2406.00544

Subjects: Machine Learning (cs.LG)
[275] arXiv:2410.02541 [pdf, html, other]: Title: Fair Decentralized Learning

Sayan Biswas, Anne-Marie Kermarrec, Rishi Sharma, Thibaud Trinca, Martijn de Vos

Comments: To appear in the proceedings of "3rd IEEE Conference on Secure and Trustworthy Machine Learning" (SatML'25)

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[276] arXiv:2410.02551 [pdf, html, other]: Title: ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration

Zixiang Wang, Yinghao Zhu, Huiya Zhao, Xiaochen Zheng, Dehao Sui, Tianlong Wang, Wen Tang, Yasha Wang, Ewen Harrison, Chengwei Pan, Junyi Gao, Liantao Ma

Comments: ACM TheWebConf 2025 Conference (WWW 2025) Research Track

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[277] arXiv:2410.02566 [pdf, html, other]: Title: Deep Learning-Based Prediction of Suspension Dynamics Performance in Multi-Axle Vehicles

Kai Chun Lin, Bo-Yi Lin

Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Numerical Analysis (math.NA)
[278] arXiv:2410.02581 [pdf, html, other]: Title: Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance

Joshua McClellan, Naveed Haghani, John Winder, Furong Huang, Pratap Tokekar

Comments: accepted as a poster at NeurIPS 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[279] arXiv:2410.02596 [pdf, html, other]: Title: Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks

Rui Hu, Yifan Zhang, Zhuoran Li, Longbo Huang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[280] arXiv:2410.02597 [pdf, html, other]: Title: HAINAN: Fast and Accurate Transducer for Hybrid-Autoregressive ASR

Hainan Xu, Travis M. Bartley, Vladimir Bataev, Boris Ginsburg

Subjects: Machine Learning (cs.LG)
[281] arXiv:2410.02601 [pdf, html, other]: Title: Diffusion & Adversarial Schrödinger Bridges via Iterative Proportional Markovian Fitting

Sergei Kholkin, Grigoriy Ksenofontov, David Li, Nikita Kornilov, Nikita Gushchin, Alexandra Suvorikova, Alexey Kroshnin, Evgeny Burnaev, Alexander Korotin

Subjects: Machine Learning (cs.LG)
[282] arXiv:2410.02605 [pdf, html, other]: Title: A Prospect-Theoretic Policy Gradient Algorithm for Behavioral Alignment in Reinforcement Learning

Olivier Lepel, Anas Barakat

Comments: revised version

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[283] arXiv:2410.02615 [pdf, html, other]: Title: EXGRA-MED: Extended Context Graph Alignment for Medical Vision- Language Models

Duy M. H. Nguyen, Nghiem T. Diep, Trung Q. Nguyen, Hoang-Bao Le, Tai Nguyen, Tien Nguyen, TrungTin Nguyen, Nhat Ho, Pengtao Xie, Roger Wattenhofer, James Zou, Daniel Sonntag, Mathias Niepert

Comments: Version 2

Subjects: Machine Learning (cs.LG)
[284] arXiv:2410.02622 [pdf, html, other]: Title: Diss-l-ECT: Dissecting Graph Data with Local Euler Characteristic Transforms

Julius von Rohrscheidt, Bastian Rieck

Comments: Accepted at ICML 2025

Subjects: Machine Learning (cs.LG); Algebraic Topology (math.AT)
[285] arXiv:2410.02628 [pdf, html, other]: Title: Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization

Mikhail Persiianov, Arip Asadulaev, Nikita Andreev, Nikita Starodubcev, Dmitry Baranchuk, Anastasis Kratsios, Evgeny Burnaev, Alexander Korotin

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[286] arXiv:2410.02639 [pdf, html, other]: Title: Labor Migration Modeling through Large-scale Job Query Data

Zhuoning Guo, Le Zhang, Hengshu Zhu, Weijia Zhang, Hui Xiong, Hao Liu

Subjects: Machine Learning (cs.LG)
[287] arXiv:2410.02647 [pdf, html, other]: Title: Immunogenicity Prediction with Dual Attention Enables Vaccine Target Selection

Song Li, Yang Tan, Song Ke, Liang Hong, Bingxin Zhou

Comments: 20 pages, 17 tables, 6 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Biomolecules (q-bio.BM)
[288] arXiv:2410.02651 [pdf, html, other]: Title: CAX: Cellular Automata Accelerated in JAX

Maxence Faldor, Antoine Cully

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[289] arXiv:2410.02654 [pdf, html, other]: Title: Deconstructing Recurrence, Attention, and Gating: Investigating the transferability of Transformers and Gated Recurrent Neural Networks in forecasting of dynamical systems

Hunter S. Heidenreich, Pantelis R. Vlachas, Petros Koumoutsakos

Subjects: Machine Learning (cs.LG); Chaotic Dynamics (nlin.CD); Computational Physics (physics.comp-ph)
[290] arXiv:2410.02656 [pdf, html, other]: Title: Scalable Simulation-free Entropic Unbalanced Optimal Transport

Jaemoo Choi, Jaewoong Choi

Comments: 26 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[291] arXiv:2410.02666 [pdf, html, other]: Title: AlphaIntegrator: Transformer Action Search for Symbolic Integration Proofs

Mert Ünsal, Timon Gehr, Martin Vechev

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Symbolic Computation (cs.SC)
[292] arXiv:2410.02667 [pdf, html, other]: Title: GUD: Generation with Unified Diffusion

Mathis Gerdes, Max Welling, Miranda C. N. Cheng

Comments: 11 pages, 8 figures

Subjects: Machine Learning (cs.LG); High Energy Physics - Theory (hep-th); Machine Learning (stat.ML)
[293] arXiv:2410.02675 [pdf, html, other]: Title: FAN: Fourier Analysis Networks

Yihong Dong, Ge Li, Yongding Tao, Xue Jiang, Kechi Zhang, Jia Li, Jinliang Deng, Jing Su, Jun Zhang, Jingjing Xu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[294] arXiv:2410.02681 [pdf, html, other]: Title: Understanding and Mitigating Miscalibration in Prompt Tuning for Vision-Language Models

Shuoyuan Wang, Yixuan Li, Hongxin Wei

Comments: Accepted by ICML 2025

Subjects: Machine Learning (cs.LG)
[295] arXiv:2410.02698 [pdf, html, other]: Title: Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups

Zakhar Shumaylov, Peter Zaika, James Rowbottom, Ferdia Sherry, Melanie Weber, Carola-Bibiane Schönlieb

Comments: 44 pages; accepted at ICLR 2025

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[296] arXiv:2410.02711 [pdf, html, other]: Title: NETS: A Non-Equilibrium Transport Sampler

Michael S. Albergo, Eric Vanden-Eijnden

Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); High Energy Physics - Lattice (hep-lat)
[297] arXiv:2410.02718 [pdf, html, other]: Title: SynthFormer: Equivariant Pharmacophore-based Generation of Synthesizable Molecules for Ligand-Based Drug Design

Zygimantas Jocys, Zhanxing Zhu, Henriette M.G. Willems, Katayoun Farrahi

Subjects: Machine Learning (cs.LG)
[298] arXiv:2410.02733 [pdf, html, other]: Title: Data Similarity-Based One-Shot Clustering for Multi-Task Hierarchical Federated Learning

Abdulmoneam Ali, Ahmed Arafa

Comments: To appear in Asilomar 2024

Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Networking and Internet Architecture (cs.NI); Signal Processing (eess.SP)
[299] arXiv:2410.02735 [pdf, html, other]: Title: OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?

Liangze Jiang, Damien Teney

Comments: ICML 2025

Subjects: Machine Learning (cs.LG)
[300] arXiv:2410.02749 [pdf, html, other]: Title: Training Language Models on Synthetic Edit Sequences Improves Code Synthesis

Ulyana Piterbarg, Lerrel Pinto, Rob Fergus

Comments: ICLR 2025

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)

Total of 4845 entries : 1-100 101-200 201-300 301-400 401-500 501-600 ... 4801-4845

Showing up to 100 entries per page: fewer | more | all