Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for October 2024

Total of 4845 entries : 1-100 101-200 201-300 301-400 401-500 501-600 ... 4801-4845
Showing up to 100 entries per page: fewer | more | all
[201] arXiv:2410.02064 [pdf, html, other]
Title: Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct
Christopher Ackerman, Nina Panickssery
Comments: 10 pages, 13 figs, 2 tables, accepted as conference paper to ICLR 2025
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[202] arXiv:2410.02068 [pdf, html, other]
Title: Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits
Jiabin Lin, Shana Moothedath, Namrata Vaswani
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[203] arXiv:2410.02070 [pdf, html, other]
Title: MMFNet: Multi-Scale Frequency Masking Neural Network for Multivariate Time Series Forecasting
Aitian Ma, Dongsheng Luo, Mo Sha
Subjects: Machine Learning (cs.LG)
[204] arXiv:2410.02077 [pdf, html, other]
Title: Kolmogorov-Arnold Network Autoencoders
Mohammadamin Moradi, Shirin Panahi, Erik Bollt, Ying-Cheng Lai
Comments: 12 pages, 5 figures, 1 table
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[205] arXiv:2410.02079 [pdf, html, other]
Title: Deep Generative Modeling for Identification of Noisy, Non-Stationary Dynamical Systems
Doris Voina, Steven Brunton, J. Nathan Kutz
Comments: 19 pages + 7 figures + Supplementary Materials (and supplementary figures)
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[206] arXiv:2410.02081 [pdf, html, other]
Title: MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K Parameters
Aitian Ma, Dongsheng Luo, Mo Sha
Subjects: Machine Learning (cs.LG)
[207] arXiv:2410.02082 [pdf, html, other]
Title: FARM: Functional Group-Aware Representations for Small Molecules
Thao Nguyen, Kuan-Hao Huang, Ge Liu, Martin D. Burke, Ying Diao, Heng Ji
Comments: Preprint. The code is available at: this https URL
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[208] arXiv:2410.02085 [pdf, html, other]
Title: Multi-Omic and Quantum Machine Learning Integration for Lung Subtypes Classification
Mandeep Kaur Saggi, Amandeep Singh Bhatia, Mensah Isaiah, Humaira Gowher, Sabre Kais
Comments: 27 pages, 17 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Genomics (q-bio.GN); Quantum Physics (quant-ph)
[209] arXiv:2410.02086 [pdf, html, other]
Title: Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Minoh Jeong, Min Namgung, Zae Myung Kim, Dongyeop Kang, Yao-Yi Chiang, Alfred Hero
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[210] arXiv:2410.02087 [pdf, html, other]
Title: HyperBrain: Anomaly Detection for Temporal Hypergraph Brain Networks
Sadaf Sadeghian, Xiaoxiao Li, Margo Seltzer
Subjects: Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[211] arXiv:2410.02113 [pdf, html, other]
Title: Mamba Neural Operator: Who Wins? Transformers vs. State-Space Models for PDEs
Chun-Wun Cheng, Jiahao Huang, Yi Zhang, Guang Yang, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[212] arXiv:2410.02116 [pdf, html, other]
Title: Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks
Siddharth Joshi, Jiayi Ni, Baharan Mirzasoleiman
Comments: ICLR 2025. Code at this https URL
Subjects: Machine Learning (cs.LG)
[213] arXiv:2410.02117 [pdf, html, other]
Title: Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices
Andres Potapczynski, Shikai Qiu, Marc Finzi, Christopher Ferri, Zixi Chen, Micah Goldblum, Bayan Bruss, Christopher De Sa, Andrew Gordon Wilson
Comments: NeurIPS 2024. Code available at this https URL
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[214] arXiv:2410.02128 [pdf, html, other]
Title: Breaking the mold: The challenge of large scale MARL specialization
Stefan Juang, Hugh Cao, Arielle Zhou, Ruochen Liu, Nevin L. Zhang, Elvis Liu
Comments: 19 pages
Subjects: Machine Learning (cs.LG)
[215] arXiv:2410.02131 [pdf, html, other]
Title: Boosting Masked ECG-Text Auto-Encoders as Discriminative Learners
Hung Manh Pham, Aaqib Saeed, Dong Ma
Comments: Accepted at ICML 2025
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[216] arXiv:2410.02132 [pdf, html, other]
Title: Nonuniform random feature models using derivative information
Konstantin Pieper, Zezhong Zhang, Guannan Zhang
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[217] arXiv:2410.02133 [pdf, html, other]
Title: TrajGPT: Irregular Time-Series Representation Learning for Health Trajectory Analysis
Ziyang Song, Qingcheng Lu, He Zhu, David Buckeridge, Yue Li
Comments: 9 pages
Subjects: Machine Learning (cs.LG)
[218] arXiv:2410.02136 [pdf, html, other]
Title: Disentangled Representation Learning for Parametric Partial Differential Equations
Ning Liu, Lu Zhang, Tian Gao, Yue Yu
Subjects: Machine Learning (cs.LG)
[219] arXiv:2410.02140 [pdf, other]
Title: A Formal Framework for Understanding Length Generalization in Transformers
Xinting Huang, Andy Yang, Satwik Bhattamishra, Yash Sarrof, Andreas Krebs, Hattie Zhou, Preetum Nakkiran, Michael Hahn
Comments: 85 pages, 9 figures, 11 tables. Accepted for publication at ICLR 2025
Subjects: Machine Learning (cs.LG)
[220] arXiv:2410.02143 [pdf, html, other]
Title: Plug-and-Play Controllable Generation for Discrete Masked Models
Wei Guo, Yuchen Zhu, Molei Tao, Yongxin Chen
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[221] arXiv:2410.02145 [pdf, html, other]
Title: Active Learning of Deep Neural Networks via Gradient-Free Cutting Planes
Erica Zhang, Fangzhao Zhang, Mert Pilanci
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[222] arXiv:2410.02147 [pdf, other]
Title: Efficient Source-Free Time-Series Adaptation via Parameter Subspace Disentanglement
Gaurav Patel, Christopher Sandino, Behrooz Mahasseni, Ellen L Zippi, Erdrin Azemi, Ali Moin, Juri Minxha
Comments: Accepted at ICLR 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)
[223] arXiv:2410.02151 [pdf, html, other]
Title: Quantitative Approximation for Neural Operators in Nonlinear Parabolic Equations
Takashi Furuya, Koichi Taniguchi, Satoshi Okuda
Comments: 31 pages
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[224] arXiv:2410.02158 [pdf, html, other]
Title: ClassContrast: Bridging the Spatial and Contextual Gaps for Node Representations
Md Joshem Uddin, Astrit Tola, Varin Sikand, Cuneyt Gurcan Akcora, Baris Coskunuzer
Comments: 16 pages, 5 figures
Subjects: Machine Learning (cs.LG); Computational Geometry (cs.CG); Machine Learning (stat.ML)
[225] arXiv:2410.02159 [pdf, html, other]
Title: Mitigating Memorization In Language Models
Mansi Sakarvadia, Aswathy Ajith, Arham Khan, Nathaniel Hudson, Caleb Geniesse, Kyle Chard, Yaoqing Yang, Ian Foster, Michael W. Mahoney
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[226] arXiv:2410.02164 [pdf, other]
Title: Universality in Transfer Learning for Linear Models
Reza Ghane, Danil Akhtiamov, Babak Hassibi
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[227] arXiv:2410.02167 [pdf, html, other]
Title: Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis
Hongkang Li, Songtao Lu, Pin-Yu Chen, Xiaodong Cui, Meng Wang
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[228] arXiv:2410.02168 [pdf, html, other]
Title: Channel-aware Contrastive Conditional Diffusion for Multivariate Probabilistic Time Series Forecasting
Siyang Li, Yize Chen, Hui Xiong
Subjects: Machine Learning (cs.LG)
[229] arXiv:2410.02172 [pdf, html, other]
Title: Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation
Shreyas Chaudhari, Ameet Deshpande, Bruno Castro da Silva, Philip S. Thomas
Comments: Accepted at the Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[230] arXiv:2410.02173 [pdf, html, other]
Title: Efficiently Deploying LLMs with Controlled Risk
Michael J. Zellinger, Matt Thomson
Comments: 10 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[231] arXiv:2410.02176 [pdf, html, other]
Title: Towards Better Generalization: Weight Decay Induces Low-rank Bias for Neural Networks
Ke Chen, Chugang Yi, Haizhao Yang
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[232] arXiv:2410.02184 [pdf, html, other]
Title: CodeJudge: Evaluating Code Generation with Large Language Models
Weixi Tong, Tianyi Zhang
Comments: Accepted to EMNLP 2024 (Main, Long Paper)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Software Engineering (cs.SE)
[233] arXiv:2410.02195 [pdf, html, other]
Title: BACKTIME: Backdoor Attacks on Multivariate Time Series Forecasting
Xiao Lin, Zhining Liu, Dongqi Fu, Ruizhong Qiu, Hanghang Tong
Comments: 23 pages. Neurips 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[234] arXiv:2410.02198 [pdf, html, other]
Title: G2T-LLM: Graph-to-Tree Text Encoding for Molecule Generation with Fine-Tuned Large Language Models
Zhaoning Yu, Xiangyang Xu, Hongyang Gao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[235] arXiv:2410.02199 [pdf, html, other]
Title: Deep Koopman-layered Model with Universal Property Based on Toeplitz Matrices
Yuka Hashimoto, Tomoharu Iwata
Subjects: Machine Learning (cs.LG); Dynamical Systems (math.DS); Functional Analysis (math.FA); Machine Learning (stat.ML)
[236] arXiv:2410.02200 [pdf, html, other]
Title: Revisiting Prefix-tuning: Statistical Benefits of Reparameterization among Prompts
Minh Le, Chau Nguyen, Huy Nguyen, Quyen Tran, Trung Le, Nhat Ho
Comments: Accepted to ICLR 2025. 42 pages, 8 tables, 3 figures
Subjects: Machine Learning (cs.LG)
[237] arXiv:2410.02217 [pdf, html, other]
Title: Stochastic Sampling from Deterministic Flow Models
Saurabh Singh, Ian Fischer
Comments: Submitted to ICLR 2025
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[238] arXiv:2410.02226 [pdf, html, other]
Title: Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Daniel Liu, Claire Chen, Shangtong Zhang
Subjects: Machine Learning (cs.LG)
[239] arXiv:2410.02230 [pdf, html, other]
Title: Mitigating Downstream Model Risks via Model Provenance
Keyu Wang, Abdullah Norozi Iranzad, Scott Schaffter, Meg Risdal, Doina Precup, Jonathan Lebensold
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[240] arXiv:2410.02236 [pdf, html, other]
Title: C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu, Yuxin Pan, Linjie Xu, Lei Song, Jiang Bian, Pengcheng You, Yize Chen
Comments: Published as a conference paper at ICLR 2025. Code available at this https URL
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[241] arXiv:2410.02242 [pdf, html, other]
Title: Robust Weight Initialization for Tanh Neural Networks with Fixed Point Analysis
Hyunwoo Lee, Hayoung Choi, Hyunju Kim
Comments: ICLR 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[242] arXiv:2410.02246 [pdf, html, other]
Title: PFGuard: A Generative Framework with Privacy and Fairness Safeguards
Soyeon Kim, Yuji Roh, Geon Heo, Steven Euijong Whang
Comments: In Proceedings of the 13th International Conference on Learning Representations (ICLR), 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[243] arXiv:2410.02247 [pdf, html, other]
Title: Theoretical Insights into Fine-Tuning Attention Mechanism: Generalization and Optimization
Xinhao Yao, Hongjin Qian, Xiaolin Hu, Gengze Xu, Wei Liu, Jian Luan, Bin Wang, Yong Liu
Comments: IJCAI 2025
Subjects: Machine Learning (cs.LG)
[244] arXiv:2410.02260 [pdf, html, other]
Title: FedScalar: A Communication efficient Federated Learning
M. Rostami, S. S. Kia
Subjects: Machine Learning (cs.LG)
[245] arXiv:2410.02267 [pdf, other]
Title: Unsupervised Meta-Learning via Dynamic Head and Heterogeneous Task Construction for Few-Shot Classification
Yunchuan Guan, Yu Liu, Ketong Liu, Ke Zhou, Zhiqi Shen
Subjects: Machine Learning (cs.LG)
[246] arXiv:2410.02268 [pdf, html, other]
Title: Structural-Entropy-Based Sample Selection for Efficient and Effective Learning
Tianchi Xie, Jiangning Zhu, Guozu Ma, Minzhi Lin, Wei Chen, Weikai Yang, Shixia Liu
Comments: Published as a conference paper at ICLR 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[247] arXiv:2410.02269 [pdf, html, other]
Title: Best-of-Both-Worlds Policy Optimization for CMDPs with Bandit Feedback
Francesco Emanuele Stradi, Anna Lunghi, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti
Subjects: Machine Learning (cs.LG)
[248] arXiv:2410.02273 [pdf, other]
Title: Perfect Counterfactuals in Imperfect Worlds: Modelling Noisy Implementation of Actions in Sequential Algorithmic Recourse
Yueqing Xuan, Kacper Sokol, Mark Sanderson, Jeffrey Chan
Comments: Accepted to ECML-PKDD 2025 Journal Track
Journal-ref: Machine Learning 114, no. 8 (2025): 187
Subjects: Machine Learning (cs.LG)
[249] arXiv:2410.02275 [pdf, html, other]
Title: Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization
Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti
Comments: arXiv admin note: text overlap with arXiv:2405.14372
Subjects: Machine Learning (cs.LG)
[250] arXiv:2410.02290 [pdf, html, other]
Title: Density based Spatial Clustering of Lines via Probabilistic Generation of Neighbourhood
Akanksha Das, Malay Bhattacharyya
Subjects: Machine Learning (cs.LG)
[251] arXiv:2410.02293 [pdf, html, other]
Title: Efficient Second-Order Neural Network Optimization via Adaptive Trust Region Methods
James Vo
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[252] arXiv:2410.02321 [pdf, html, other]
Title: Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis
Zikun Zhang, Zixiang Chen, Quanquan Gu
Comments: 26 pages, 1 figure
Journal-ref: The Thirteenth International Conference on Learning Representations, 2025
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[253] arXiv:2410.02324 [pdf, html, other]
Title: Automated Tone Transcription and Clustering with Tone2Vec
Yi Yang, Yiming Wang, ZhiQiang Tang, Jiahong Yuan
Comments: Accepted by EMNLP 2024 Findings
Subjects: Machine Learning (cs.LG)
[254] arXiv:2410.02335 [pdf, html, other]
Title: Data Optimisation of Machine Learning Models for Smart Irrigation in Urban Parks
Nasser Ghadiri, Bahman Javadi, Oliver Obst, Sebastian Pfautsch
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[255] arXiv:2410.02344 [pdf, html, other]
Title: EntryPrune: Neural Network Feature Selection using First Impressions
Felix Zimmer, Patrik Okanovic, Torsten Hoefler
Subjects: Machine Learning (cs.LG)
[256] arXiv:2410.02348 [pdf, html, other]
Title: Simplicity bias and optimization threshold in two-layer ReLU networks
Etienne Boursier, Nicolas Flammarion
Comments: ICML camera ready version
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[257] arXiv:2410.02367 [pdf, other]
Title: SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Jintao Zhang, Jia Wei, Haofeng Huang, Pengle Zhang, Jun Zhu, Jianfei Chen
Comments: @inproceedings{zhang2025sageattention, title={SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration}, author={Zhang, Jintao and Wei, Jia and Zhang, Pengle and Zhu, Jun and Chen, Jianfei}, booktitle={International Conference on Learning Representations (ICLR)}, year={2025} }
Journal-ref: The Thirteenth International Conference on Learning Representations (ICLR 2025)
Subjects: Machine Learning (cs.LG)
[258] arXiv:2410.02384 [pdf, html, other]
Title: Unveiling AI's Blind Spots: An Oracle for In-Domain, Out-of-Domain, and Adversarial Errors
Shuangpeng Han, Mengmi Zhang
Subjects: Machine Learning (cs.LG)
[259] arXiv:2410.02387 [pdf, html, other]
Title: BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization
Gustav Wagner Zakarias, Lars Kai Hansen, Zheng-Hua Tan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[260] arXiv:2410.02392 [pdf, html, other]
Title: MANTRA: The Manifold Triangulations Assemblage
Rubén Ballester, Ernst Röell, Daniel Bīn Schmid, Mathieu Alain, Sergio Escalera, Carles Casacuberta, Bastian Rieck
Comments: Accepted at ICLR 2025 (this https URL)
Subjects: Machine Learning (cs.LG); Algebraic Topology (math.AT)
[261] arXiv:2410.02394 [pdf, html, other]
Title: Online Multi-Label Classification under Noisy and Changing Label Distribution
Yizhang Zou, Xuegang Hu, Peipei Li, Jun Hu, You Wu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[262] arXiv:2410.02400 [pdf, html, other]
Title: An Online Feasible Point Method for Benign Generalized Nash Equilibrium Problems
Sarah Sachs, Hedi Hadiji, Tim van Erven, Mathias Staudigl
Subjects: Machine Learning (cs.LG)
[263] arXiv:2410.02416 [pdf, other]
Title: Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models
Seyedmorteza Sadat, Otmar Hilliges, Romann M. Weber
Comments: Published as a conference paper at ICLR 2025
Journal-ref: The Thirteenth International Conference on Learning Representations (ICLR 2025)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[264] arXiv:2410.02438 [pdf, html, other]
Title: Learning K-U-Net with constant complexity: An Application to time series forecasting
Jiang You, Arben Cela, René Natowicz, Jacob Ouanounou, Patrick Siarry
Subjects: Machine Learning (cs.LG)
[265] arXiv:2410.02450 [pdf, html, other]
Title: Personalized Federated Learning for Generative AI-Assisted Semantic Communications
Yubo Peng, Feibo Jiang, Li Dong, Kezhi Wang, Kun Yang
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Information Theory (cs.IT)
[266] arXiv:2410.02467 [pdf, html, other]
Title: SIDE: Surrogate Conditional Data Extraction from Diffusion Models
Yunhao Chen, Shujie Wang, Difan Zou, Xingjun Ma
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[267] arXiv:2410.02472 [pdf, html, other]
Title: Meta-Models: An Architecture for Decoding LLM Behaviors Through Interpreted Embeddings and Natural Language
Anthony Costarelli, Mat Allen, Severin Field
Comments: 11 pages, 2 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[268] arXiv:2410.02476 [pdf, html, other]
Title: Online Convex Optimization with a Separation Oracle
Zakaria Mhammedi
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[269] arXiv:2410.02490 [pdf, html, other]
Title: Stochastic variance-reduced Gaussian variational inference on the Bures-Wasserstein manifold
Hoang Phuc Hau Luu, Hanlin Yu, Bernardo Williams, Marcelo Hartmann, Arto Klami
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[270] arXiv:2410.02496 [pdf, html, other]
Title: Efficient learning of differential network in multi-source non-paranormal graphical models
Mojtaba Nikahd, Seyed Abolfazl Motahari
Subjects: Machine Learning (cs.LG)
[271] arXiv:2410.02498 [pdf, html, other]
Title: Dynamic Gradient Alignment for Online Data Mixing
Simin Fan, David Grangier, Pierre Ablin
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[272] arXiv:2410.02512 [pdf, html, other]
Title: SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation
Mucong Ding, Bang An, Yuancheng Xu, Anirudh Satheesh, Furong Huang
Comments: ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[273] arXiv:2410.02513 [pdf, html, other]
Title: Minimax Group Fairness in Strategic Classification
Emily Diana, Saeed Sharifi-Malvajerdi, Ali Vakilian
Subjects: Machine Learning (cs.LG)
[274] arXiv:2410.02519 [pdf, html, other]
Title: Semantic-Guided RL for Interpretable Feature Engineering
Mohamed Bouadi, Arta Alavi, Salima Benbernou, Mourad Ouziri
Comments: arXiv admin note: substantial text overlap with arXiv:2406.00544
Subjects: Machine Learning (cs.LG)
[275] arXiv:2410.02541 [pdf, html, other]
Title: Fair Decentralized Learning
Sayan Biswas, Anne-Marie Kermarrec, Rishi Sharma, Thibaud Trinca, Martijn de Vos
Comments: To appear in the proceedings of "3rd IEEE Conference on Secure and Trustworthy Machine Learning" (SatML'25)
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[276] arXiv:2410.02551 [pdf, html, other]
Title: ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration
Zixiang Wang, Yinghao Zhu, Huiya Zhao, Xiaochen Zheng, Dehao Sui, Tianlong Wang, Wen Tang, Yasha Wang, Ewen Harrison, Chengwei Pan, Junyi Gao, Liantao Ma
Comments: ACM TheWebConf 2025 Conference (WWW 2025) Research Track
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[277] arXiv:2410.02566 [pdf, html, other]
Title: Deep Learning-Based Prediction of Suspension Dynamics Performance in Multi-Axle Vehicles
Kai Chun Lin, Bo-Yi Lin
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Numerical Analysis (math.NA)
[278] arXiv:2410.02581 [pdf, html, other]
Title: Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance
Joshua McClellan, Naveed Haghani, John Winder, Furong Huang, Pratap Tokekar
Comments: accepted as a poster at NeurIPS 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[279] arXiv:2410.02596 [pdf, html, other]
Title: Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Rui Hu, Yifan Zhang, Zhuoran Li, Longbo Huang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[280] arXiv:2410.02597 [pdf, html, other]
Title: HAINAN: Fast and Accurate Transducer for Hybrid-Autoregressive ASR
Hainan Xu, Travis M. Bartley, Vladimir Bataev, Boris Ginsburg
Subjects: Machine Learning (cs.LG)
[281] arXiv:2410.02601 [pdf, html, other]
Title: Diffusion & Adversarial Schrödinger Bridges via Iterative Proportional Markovian Fitting
Sergei Kholkin, Grigoriy Ksenofontov, David Li, Nikita Kornilov, Nikita Gushchin, Alexandra Suvorikova, Alexey Kroshnin, Evgeny Burnaev, Alexander Korotin
Subjects: Machine Learning (cs.LG)
[282] arXiv:2410.02605 [pdf, html, other]
Title: A Prospect-Theoretic Policy Gradient Algorithm for Behavioral Alignment in Reinforcement Learning
Olivier Lepel, Anas Barakat
Comments: revised version
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[283] arXiv:2410.02615 [pdf, html, other]
Title: EXGRA-MED: Extended Context Graph Alignment for Medical Vision- Language Models
Duy M. H. Nguyen, Nghiem T. Diep, Trung Q. Nguyen, Hoang-Bao Le, Tai Nguyen, Tien Nguyen, TrungTin Nguyen, Nhat Ho, Pengtao Xie, Roger Wattenhofer, James Zou, Daniel Sonntag, Mathias Niepert
Comments: Version 2
Subjects: Machine Learning (cs.LG)
[284] arXiv:2410.02622 [pdf, html, other]
Title: Diss-l-ECT: Dissecting Graph Data with Local Euler Characteristic Transforms
Julius von Rohrscheidt, Bastian Rieck
Comments: Accepted at ICML 2025
Subjects: Machine Learning (cs.LG); Algebraic Topology (math.AT)
[285] arXiv:2410.02628 [pdf, html, other]
Title: Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization
Mikhail Persiianov, Arip Asadulaev, Nikita Andreev, Nikita Starodubcev, Dmitry Baranchuk, Anastasis Kratsios, Evgeny Burnaev, Alexander Korotin
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[286] arXiv:2410.02639 [pdf, html, other]
Title: Labor Migration Modeling through Large-scale Job Query Data
Zhuoning Guo, Le Zhang, Hengshu Zhu, Weijia Zhang, Hui Xiong, Hao Liu
Subjects: Machine Learning (cs.LG)
[287] arXiv:2410.02647 [pdf, html, other]
Title: Immunogenicity Prediction with Dual Attention Enables Vaccine Target Selection
Song Li, Yang Tan, Song Ke, Liang Hong, Bingxin Zhou
Comments: 20 pages, 17 tables, 6 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Biomolecules (q-bio.BM)
[288] arXiv:2410.02651 [pdf, html, other]
Title: CAX: Cellular Automata Accelerated in JAX
Maxence Faldor, Antoine Cully
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[289] arXiv:2410.02654 [pdf, html, other]
Title: Deconstructing Recurrence, Attention, and Gating: Investigating the transferability of Transformers and Gated Recurrent Neural Networks in forecasting of dynamical systems
Hunter S. Heidenreich, Pantelis R. Vlachas, Petros Koumoutsakos
Subjects: Machine Learning (cs.LG); Chaotic Dynamics (nlin.CD); Computational Physics (physics.comp-ph)
[290] arXiv:2410.02656 [pdf, html, other]
Title: Scalable Simulation-free Entropic Unbalanced Optimal Transport
Jaemoo Choi, Jaewoong Choi
Comments: 26 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[291] arXiv:2410.02666 [pdf, html, other]
Title: AlphaIntegrator: Transformer Action Search for Symbolic Integration Proofs
Mert Ünsal, Timon Gehr, Martin Vechev
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Symbolic Computation (cs.SC)
[292] arXiv:2410.02667 [pdf, html, other]
Title: GUD: Generation with Unified Diffusion
Mathis Gerdes, Max Welling, Miranda C. N. Cheng
Comments: 11 pages, 8 figures
Subjects: Machine Learning (cs.LG); High Energy Physics - Theory (hep-th); Machine Learning (stat.ML)
[293] arXiv:2410.02675 [pdf, html, other]
Title: FAN: Fourier Analysis Networks
Yihong Dong, Ge Li, Yongding Tao, Xue Jiang, Kechi Zhang, Jia Li, Jinliang Deng, Jing Su, Jun Zhang, Jingjing Xu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[294] arXiv:2410.02681 [pdf, html, other]
Title: Understanding and Mitigating Miscalibration in Prompt Tuning for Vision-Language Models
Shuoyuan Wang, Yixuan Li, Hongxin Wei
Comments: Accepted by ICML 2025
Subjects: Machine Learning (cs.LG)
[295] arXiv:2410.02698 [pdf, html, other]
Title: Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups
Zakhar Shumaylov, Peter Zaika, James Rowbottom, Ferdia Sherry, Melanie Weber, Carola-Bibiane Schönlieb
Comments: 44 pages; accepted at ICLR 2025
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[296] arXiv:2410.02711 [pdf, html, other]
Title: NETS: A Non-Equilibrium Transport Sampler
Michael S. Albergo, Eric Vanden-Eijnden
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); High Energy Physics - Lattice (hep-lat)
[297] arXiv:2410.02718 [pdf, html, other]
Title: SynthFormer: Equivariant Pharmacophore-based Generation of Synthesizable Molecules for Ligand-Based Drug Design
Zygimantas Jocys, Zhanxing Zhu, Henriette M.G. Willems, Katayoun Farrahi
Subjects: Machine Learning (cs.LG)
[298] arXiv:2410.02733 [pdf, html, other]
Title: Data Similarity-Based One-Shot Clustering for Multi-Task Hierarchical Federated Learning
Abdulmoneam Ali, Ahmed Arafa
Comments: To appear in Asilomar 2024
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Networking and Internet Architecture (cs.NI); Signal Processing (eess.SP)
[299] arXiv:2410.02735 [pdf, html, other]
Title: OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?
Liangze Jiang, Damien Teney
Comments: ICML 2025
Subjects: Machine Learning (cs.LG)
[300] arXiv:2410.02749 [pdf, html, other]
Title: Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
Ulyana Piterbarg, Lerrel Pinto, Rob Fergus
Comments: ICLR 2025
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
Total of 4845 entries : 1-100 101-200 201-300 301-400 401-500 501-600 ... 4801-4845
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack