Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for February 2024

Total of 3960 entries : 1-100 101-200 201-300 301-400 401-500 501-600 ... 3901-3960
Showing up to 100 entries per page: fewer | more | all
[201] arXiv:2402.01922 [pdf, html, other]
Title: A General Framework for Learning from Weak Supervision
Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj
Comments: 24 pages, 20 tables, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[202] arXiv:2402.01928 [pdf, html, other]
Title: Robust Counterfactual Explanations in Machine Learning: A Survey
Junqi Jiang, Francesco Leofante, Antonio Rago, Francesca Toni
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[203] arXiv:2402.01929 [pdf, html, other]
Title: Sample, estimate, aggregate: A recipe for causal discovery foundation models
Menghua Wu, Yujia Bao, Regina Barzilay, Tommi Jaakkola
Comments: Our code is available at this https URL
Journal-ref: Transactions on Machine Learning Research (03/2025)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[204] arXiv:2402.01931 [pdf, html, other]
Title: Digits micro-model for accurate and secure transactions
Chirag Chhablani, Nikhita Sharma, Jordan Hosier, Vijay K. Gurbani
Comments: 7 pages, 1 figure, 5 tables
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[205] arXiv:2402.01943 [pdf, html, other]
Title: Precedence-Constrained Winter Value for Effective Graph Data Valuation
Hongliang Chi, Wei Jin, Charu Aggarwal, Yao Ma
Comments: 17 pages in total
Subjects: Machine Learning (cs.LG)
[206] arXiv:2402.01955 [pdf, html, other]
Title: OPSurv: Orthogonal Polynomials Quadrature Algorithm for Survival Analysis
Lilian W. Bialokozowicz, Hoang M. Le, Tristan Sylvain, Peter A. I. Forsyth, Vineel Nagisetty, Greg Mori
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Functional Analysis (math.FA)
[207] arXiv:2402.01960 [pdf, other]
Title: Calibrated Uncertainty Quantification for Operator Learning via Conformal Prediction
Ziqi Ma, Kamyar Azizzadenesheli, Anima Anandkumar
Comments: 14 pages, 7 figures
Subjects: Machine Learning (cs.LG)
[208] arXiv:2402.01963 [pdf, html, other]
Title: Improving Large-Scale k-Nearest Neighbor Text Categorization with Label Autoencoders
Francisco J. Ribadas-Pena, Shuyuan Cao, Víctor M. Darriba Bilbao
Comments: 22 pages, 4 figures
Journal-ref: Mathematics 2022, 10(16), 2867
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[209] arXiv:2402.01964 [pdf, html, other]
Title: Scalable and Efficient Temporal Graph Representation Learning via Forward Recent Sampling
Yuhong Luo, Pan Li
Comments: Learning on Graphs Conference (LoG 2024)
Subjects: Machine Learning (cs.LG)
[210] arXiv:2402.01965 [pdf, html, other]
Title: Analyzing Neural Network-Based Generative Diffusion Models through Convex Optimization
Fangzhao Zhang, Mert Pilanci
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[211] arXiv:2402.01969 [pdf, html, other]
Title: Simulation-Enhanced Data Augmentation for Machine Learning Pathloss Prediction
Ahmed P. Mohamed, Byunghyun Lee, Yaguang Zhang, Max Hollingsworth, C. Robert Anderson, James V. Krogmeier, David J. Love
Comments: 6 pages, 5 figures, Accepted at ICC 2024
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[212] arXiv:2402.01975 [pdf, html, other]
Title: Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks
Duy M. H. Nguyen, Nina Lukashina, Tai Nguyen, An T. Le, TrungTin Nguyen, Nhat Ho, Jan Peters, Daniel Sonntag, Viktor Zaverkin, Mathias Niepert
Comments: Accepted at ICML 2024 (updated version)
Subjects: Machine Learning (cs.LG)
[213] arXiv:2402.01987 [pdf, html, other]
Title: Online Transfer Learning for RSV Case Detection
Yiming Sun, Yuhe Gao, Runxue Bao, Gregory F. Cooper, Jessi Espino, Harry Hochheiser, Marian G. Michaels, John M. Aronis, Chenxi Song, Ye Ye
Comments: 10 pages, 2 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[214] arXiv:2402.01995 [pdf, html, other]
Title: Online Uniform Sampling: Randomized Learning-Augmented Approximation Algorithms with Application to Digital Health
Xueqing Liu, Kyra Gan, Esmaeil Keyvanshokooh, Susan Murphy
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[215] arXiv:2402.01999 [pdf, html, other]
Title: A Novel Hyperdimensional Computing Framework for Online Time Series Forecasting on the Edge
Mohamed Mejri, Chandramouli Amarnath, Abhijit Chatterjee
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[216] arXiv:2402.02000 [pdf, html, other]
Title: A Survey on Graph Condensation
Hongjia Xu, Liangliang Zhang, Yao Ma, Sheng Zhou, Zhuonan Zheng, Bu Jiajun
Subjects: Machine Learning (cs.LG)
[217] arXiv:2402.02005 [pdf, html, other]
Title: Topology-Informed Graph Transformer
Yun Young Choi, Sun Woo Park, Minho Lee, Youngho Woo
Comments: Proceedings of the Geometry-grounded Representation Learning and Generative Modeling Workshop (GRaM) at ICML 2024
Subjects: Machine Learning (cs.LG)
[218] arXiv:2402.02006 [pdf, html, other]
Title: PresAIse, A Prescriptive AI Solution for Enterprises
Wei Sun, Scott McFaddin, Linh Ha Tran, Shivaram Subramanian, Kristjan Greenewald, Yeshi Tenzin, Zack Xue, Youssef Drissi, Markus Ettl
Comments: 14 pages
Subjects: Machine Learning (cs.LG)
[219] arXiv:2402.02007 [pdf, html, other]
Title: Understanding Time Series Anomaly State Detection through One-Class Classification
Hanxu Zhou, Yuan Zhang, Guangjie Leng, Ruofan Wang, Zhi-Qin John Xu
Subjects: Machine Learning (cs.LG)
[220] arXiv:2402.02009 [pdf, html, other]
Title: Robust Multi-Task Learning with Excess Risks
Yifei He, Shiji Zhou, Guojun Zhang, Hyokun Yun, Yi Xu, Belinda Zeng, Trishul Chilimbi, Han Zhao
Comments: ICML 2024 camera-ready version
Subjects: Machine Learning (cs.LG)
[221] arXiv:2402.02010 [pdf, html, other]
Title: GenFormer: A Deep-Learning-Based Approach for Generating Multivariate Stochastic Processes
Haoran Zhao, Wayne Isaac Tan Uy
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[222] arXiv:2402.02017 [pdf, html, other]
Title: Adaptive $Q$-Aid for Conditional Supervised Learning in Offline Reinforcement Learning
Jeonghye Kim, Suyoung Lee, Woojun Kim, Youngchul Sung
Comments: Accepted to NeurIPS2024. The project page is available at this https URL
Subjects: Machine Learning (cs.LG)
[223] arXiv:2402.02018 [pdf, html, other]
Title: The Landscape and Challenges of HPC Research and LLMs
Le Chen, Nesreen K. Ahmed, Akash Dutta, Arijit Bhattacharjee, Sixing Yu, Quazi Ishtiaque Mahmud, Waqwoya Abebe, Hung Phan, Aishwarya Sarkar, Branden Butler, Niranjan Hasabnis, Gal Oren, Vy A. Vo, Juan Pablo Munoz, Theodore L. Willke, Tim Mattson, Ali Jannesari
Subjects: Machine Learning (cs.LG)
[224] arXiv:2402.02021 [pdf, html, other]
Title: Transfer Learning in ECG Diagnosis: Is It Effective?
Cuong V. Nguyen, Cuong D.Do
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[225] arXiv:2402.02023 [pdf, html, other]
Title: Self-Supervised Contrastive Learning for Long-term Forecasting
Junwoo Park, Daehoon Gwak, Jaegul Choo, Edward Choi
Comments: Accepted at International Conference on Learning Representations (ICLR) 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[226] arXiv:2402.02025 [pdf, html, other]
Title: A Survey of Constraint Formulations in Safe Reinforcement Learning
Akifumi Wachi, Xun Shen, Yanan Sui
Comments: Accepted at IJCAI-24 survey track
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[227] arXiv:2402.02028 [pdf, html, other]
Title: Unlearnable Examples For Time Series
Yujing Jiang, Xingjun Ma, Sarah Monazam Erfani, James Bailey
Subjects: Machine Learning (cs.LG)
[228] arXiv:2402.02031 [pdf, html, other]
Title: Multi-fidelity physics constrained neural networks for dynamical systems
Hao Zhou, Sibo Cheng, Rossella Arcucci
Journal-ref: Computer Methods in Applied Mechanics and Engineering. 2024 Feb 15;420:116758
Subjects: Machine Learning (cs.LG); Fluid Dynamics (physics.flu-dyn)
[229] arXiv:2402.02032 [pdf, html, other]
Title: RobustTSF: Towards Theory and Design of Robust Time Series Forecasting with Anomalies
Hao Cheng, Qingsong Wen, Yang Liu, Liang Sun
Comments: Accepted by the 12th International Conference on Learning Representations (ICLR 2024)
Subjects: Machine Learning (cs.LG)
[230] arXiv:2402.02036 [pdf, html, other]
Title: Generating In-Distribution Proxy Graphs for Explaining Graph Neural Networks
Zhuomin Chen, Jiaxing Zhang, Jingchao Ni, Xiaoting Li, Yuchen Bian, Md Mezbahul Islam, Ananda Mohan Mondal, Hua Wei, Dongsheng Luo
Comments: Accepted to International Conference on Machine Learning (ICML 2024)
Subjects: Machine Learning (cs.LG)
[231] arXiv:2402.02042 [pdf, html, other]
Title: Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm
Qinbo Bai, Washim Uddin Mondal, Vaneet Aggarwal
Journal-ref: NeurIPS 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[232] arXiv:2402.02043 [pdf, other]
Title: A Plug-in Tiny AI Module for Intelligent and Selective Sensor Data Transmission
Wenjun Huang, Arghavan Rezvani, Hanning Chen, Yang Ni, Sanggeon Yun, Sungheon Jeong, Mohsen Imani
Comments: 14 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Networking and Internet Architecture (cs.NI)
[233] arXiv:2402.02044 [pdf, other]
Title: Locally-Adaptive Quantization for Streaming Vector Search
Cecilia Aguerrebere, Mark Hildebrand, Ishwar Singh Bhati, Theodore Willke, Mariano Tepper
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR)
[234] arXiv:2402.02051 [pdf, html, other]
Title: Nonlinear subspace clustering by functional link neural networks
Long Shi, Lei Cao, Zhongpu Chen, Badong Chen, Yu Zhao
Subjects: Machine Learning (cs.LG)
[235] arXiv:2402.02052 [pdf, other]
Title: Feature Selection using the concept of Peafowl Mating in IDS
Partha Ghosh, Joy Sharma, Nilesh Pandey
Journal-ref: International Journal of Computer Networks & Communications (IJCNC) Vol.16, No.1, January 2024
Subjects: Machine Learning (cs.LG)
[236] arXiv:2402.02054 [pdf, html, other]
Title: Towards Neural Scaling Laws on Graphs
Jingzhe Liu, Haitao Mao, Zhikai Chen, Tong Zhao, Neil Shah, Jiliang Tang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[237] arXiv:2402.02055 [pdf, html, other]
Title: Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning
Yiping Wang, Yifang Chen, Wendan Yan, Kevin Jamieson, Simon Shaolei Du
Comments: 17 pages, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[238] arXiv:2402.02057 [pdf, html, other]
Title: Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Yichao Fu, Peter Bailis, Ion Stoica, Hao Zhang
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[239] arXiv:2402.02065 [pdf, html, other]
Title: Training Implicit Networks for Image Deblurring using Jacobian-Free Backpropagation
Linghai Liu, Shuaicheng Tong, Lisa Zhao
Subjects: Machine Learning (cs.LG)
[240] arXiv:2402.02081 [pdf, html, other]
Title: Risk-Sensitive Diffusion: Robustly Optimizing Diffusion Models with Noisy Samples
Yangming Li, Max Ruiz Luyten, Mihaela van der Schaar
Comments: Paper under review
Subjects: Machine Learning (cs.LG)
[241] arXiv:2402.02095 [pdf, html, other]
Title: Contrasting Adversarial Perturbations: The Space of Harmless Perturbations
Lu Chen, Shaofeng Li, Benhao Huang, Fan Yang, Zheng Li, Jie Li, Yuan Luo
Subjects: Machine Learning (cs.LG)
[242] arXiv:2402.02104 [pdf, other]
Title: Learning Structure-Aware Representations of Dependent Types
Konstantinos Kogkalidis, Orestis Melkonian, Jean-Philippe Bernardy
Comments: NeurIPS 2024
Subjects: Machine Learning (cs.LG); Programming Languages (cs.PL)
[243] arXiv:2402.02110 [pdf, html, other]
Title: Composite Active Learning: Towards Multi-Domain Active Learning with Theoretical Guarantees
Guang-Yuan Hao, Hengguan Huang, Haotian Wang, Jie Gao, Hao Wang
Journal-ref: AAAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[244] arXiv:2402.02114 [pdf, html, other]
Title: Handling Delayed Feedback in Distributed Online Optimization : A Projection-Free Approach
Tuan-Anh Nguyen, Nguyen Kim Thang, Denis Trystram
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
[245] arXiv:2402.02124 [pdf, html, other]
Title: Grammar-based evolutionary approach for automated workflow composition with domain-specific operators and ensemble diversity
Rafael Barbudo, Aurora Ramírez, José Raúl Romero
Comments: 32 pages, 7 figures, 6 tables, journal paper
Journal-ref: Applied Soft Computing, 111292. 2024
Subjects: Machine Learning (cs.LG)
[246] arXiv:2402.02139 [pdf, other]
Title: Using Deep Ensemble Forest for High Resolution Mapping of PM2.5 from MODIS MAIAC AOD in Tehran, Iran
Hossein Bagheri
Journal-ref: Environ Monit Assess 195, 377 (2023)
Subjects: Machine Learning (cs.LG)
[247] arXiv:2402.02165 [pdf, html, other]
Title: Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
Haoran Li, Zicheng Zhang, Wang Luo, Congying Han, Yudong Hu, Tiande Guo, Shichen Liao
Journal-ref: ICML 2024 Oral
Subjects: Machine Learning (cs.LG)
[248] arXiv:2402.02168 [pdf, html, other]
Title: Enhancing Cross-domain Link Prediction via Evolution Process Modeling
Xuanwen Huang, Wei Chow, Yize Zhu, Yang Wang, Ziwei Chai, Chunping Wang, Lei Chen, Yang Yang
Comments: Accepted by WWW'25
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[249] arXiv:2402.02186 [pdf, html, other]
Title: Evolution Guided Generative Flow Networks
Zarif Ikram, Ling Pan, Dianbo Liu
Comments: Transaction of machine learning research
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[250] arXiv:2402.02207 [pdf, html, other]
Title: Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models
Yongshuo Zong, Ondrej Bohdal, Tingyang Yu, Yongxin Yang, Timothy Hospedales
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[251] arXiv:2402.02211 [pdf, other]
Title: Query-decision Regression between Shortest Path and Minimum Steiner Tree
Guangmo Tong, Peng Zhao, Mina Samizadeh
Comments: PAKDD 2024
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
[252] arXiv:2402.02216 [pdf, html, other]
Title: Position: Graph Foundation Models are Already Here
Haitao Mao, Zhikai Chen, Wenzhuo Tang, Jianan Zhao, Yao Ma, Tong Zhao, Neil Shah, Mikhail Galkin, Jiliang Tang
Comments: 23 pages, 2 figures
Subjects: Machine Learning (cs.LG)
[253] arXiv:2402.02225 [pdf, html, other]
Title: Rethinking the Starting Point: Collaborative Pre-Training for Federated Downstream Tasks
Yun-Wei Chu, Dong-Jun Han, Seyyedali Hosseinalipour, Christopher G. Brinton
Comments: AAAI 2025
Subjects: Machine Learning (cs.LG)
[254] arXiv:2402.02229 [pdf, html, other]
Title: Vanilla Bayesian Optimization Performs Great in High Dimensions
Carl Hvarfner, Erik Orm Hellsten, Luigi Nardi
Journal-ref: International Conference on Machine Learning, 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[255] arXiv:2402.02230 [pdf, other]
Title: Federated Learning with Differential Privacy
Adrien Banse, Jan Kreischer, Xavier Oliva i Jürgens
Comments: Machine Learning (ML) & Federated Learning (FL); 4 pages, 3 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[256] arXiv:2402.02239 [pdf, html, other]
Title: Distributional Reduction: Unifying Dimensionality Reduction and Clustering with Gromov-Wasserstein
Hugues Van Assel, Cédric Vincent-Cuaz, Nicolas Courty, Rémi Flamary, Pascal Frossard, Titouan Vayer
Comments: 45 pages, 20 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[257] arXiv:2402.02249 [pdf, html, other]
Title: Don't Label Twice: Quantity Beats Quality when Comparing Binary Classifiers on a Budget
Florian E. Dorner, Moritz Hardt
Comments: 34 pages, 3 Figures, Published at ICML 2024
Subjects: Machine Learning (cs.LG)
[258] arXiv:2402.02254 [pdf, other]
Title: Teacher-Student Learning based Low Complexity Relay Selection in Wireless Powered Communications
Aysun Gurur Onalan, Berkay Kopru, Sinem Coleri
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI); Signal Processing (eess.SP)
[259] arXiv:2402.02258 [pdf, html, other]
Title: XTSFormer: Cross-Temporal-Scale Transformer for Irregular-Time Event Prediction in Clinical Applications
Tingsong Xiao, Zelin Xu, Wenchong He, Zhengkun Xiao, Yupu Zhang, Zibo Liu, Shigang Chen, My T. Thai, Jiang Bian, Parisa Rashidi, Zhe Jiang
Comments: Accepted at AAAI 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[260] arXiv:2402.02263 [pdf, html, other]
Title: MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers
Yatong Bai, Mo Zhou, Vishal M. Patel, Somayeh Sojoudi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[261] arXiv:2402.02268 [pdf, html, other]
Title: Federated Learning with New Knowledge: Fundamentals, Advances, and Futures
Lixu Wang, Yang Zhao, Jiahua Dong, Ating Yin, Qinbin Li, Xiao Wang, Dusit Niyato, Qi Zhu
Comments: 10 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[262] arXiv:2402.02275 [pdf, other]
Title: SudokuSens: Enhancing Deep Learning Robustness for IoT Sensing Applications using a Generative Approach
Tianshi Wang, Jinyang Li, Ruijie Wang, Denizhan Kara, Shengzhong Liu, Davis Wertheimer, Antoni Viros-i-Martin, Raghu Ganti, Mudhakar Srivatsa, Tarek Abdelzaher
Comments: Published in ACM Conference on Embedded Networked Sensor Systems (SenSys 23), November, 2023, Istanbul, Turkiye. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. Publication rights licensed to the Association for Computing Machinery
Subjects: Machine Learning (cs.LG)
[263] arXiv:2402.02277 [pdf, html, other]
Title: Causal Bayesian Optimization via Exogenous Distribution Learning
Shaogang Ren, Xiaoning Qian
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[264] arXiv:2402.02287 [pdf, other]
Title: Future Directions in the Theory of Graph Machine Learning
Christopher Morris, Fabrizio Frasca, Nadav Dym, Haggai Maron, İsmail İlkan Ceylan, Ron Levie, Derek Lim, Michael Bronstein, Martin Grohe, Stefanie Jegelka
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Discrete Mathematics (cs.DM); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[265] arXiv:2402.02309 [pdf, other]
Title: Jailbreaking Attack against Multimodal Large Language Model
Zhenxing Niu, Haodong Ren, Xinbo Gao, Gang Hua, Rong Jin
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[266] arXiv:2402.02314 [pdf, html, other]
Title: Selecting Large Language Model to Fine-tune via Rectified Scaling Law
Haowei Lin, Baizhou Huang, Haotian Ye, Qinyu Chen, Zihao Wang, Sujian Li, Jianzhu Ma, Xiaojun Wan, James Zou, Yitao Liang
Journal-ref: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[267] arXiv:2402.02316 [pdf, other]
Title: Your Diffusion Model is Secretly a Certifiably Robust Classifier
Huanran Chen, Yinpeng Dong, Shitong Shao, Zhongkai Hao, Xiao Yang, Hang Su, Jun Zhu
Comments: Accepted by NeurIPS 2024. Also named as "Diffusion Models are Certifiably Robust Classifiers"
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[268] arXiv:2402.02317 [pdf, html, other]
Title: INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer
Han Fang, Zhihao Song, Paul Weng, Yutong Ban
Comments: Accepted as poster of ICML-2024
Subjects: Machine Learning (cs.LG)
[269] arXiv:2402.02318 [pdf, other]
Title: Diversity Measurement and Subset Selection for Instruction Tuning Datasets
Peiqi Wang, Yikang Shen, Zhen Guo, Matthew Stallone, Yoon Kim, Polina Golland, Rameswar Panda
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[270] arXiv:2402.02321 [pdf, other]
Title: Active Learning for Graphs with Noisy Structures
Hongliang Chi, Cong Qi, Suhang Wang, Yao Ma
Subjects: Machine Learning (cs.LG)
[271] arXiv:2402.02322 [pdf, html, other]
Title: Dynamic Incremental Optimization for Best Subset Selection
Shaogang Ren, Xiaoning Qian
Comments: arXiv admin note: substantial text overlap with arXiv:2207.02058
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[272] arXiv:2402.02325 [pdf, other]
Title: Momentum Does Not Reduce Stochastic Noise in Stochastic Gradient Descent
Naoki Sato, Hideaki Iiduka
Comments: We retract this paper due to an irrecoverable and critical error in its content
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[273] arXiv:2402.02328 [pdf, html, other]
Title: Sample Complexity of Algorithm Selection Using Neural Networks and Its Applications to Branch-and-Cut
Hongyu Cheng, Sammy Khalife, Barbara Fiedorowicz, Amitabh Basu
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[274] arXiv:2402.02332 [pdf, html, other]
Title: Minusformer: Improving Time Series Forecasting by Progressively Learning Residuals
Daojun Liang, Haixia Zhang, Dongfeng Yuan, Bingzheng Zhang, Minggao Zhang
Subjects: Machine Learning (cs.LG)
[275] arXiv:2402.02334 [pdf, html, other]
Title: Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning
Yi Cheng, Renjun Hu, Haochao Ying, Xing Shi, Jian Wu, Wei Lin
Comments: 11 pages, 8 figures, to be published to AAAI2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[276] arXiv:2402.02342 [pdf, html, other]
Title: MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters
Arsalan Sharifnassab, Saber Salehkaleybar, Richard Sutton
Journal-ref: ICML 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[277] arXiv:2402.02345 [pdf, other]
Title: Stereographic Spherical Sliced Wasserstein Distances
Huy Tran, Yikun Bai, Abihith Kothapalli, Ashkan Shahbazi, Xinran Liu, Rocio Diaz Martin, Soheil Kolouri
Comments: Published at ICML 2024 (Spotlight). Project page: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[278] arXiv:2402.02347 [pdf, html, other]
Title: Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models
Fangzhao Zhang, Mert Pilanci
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC)
[279] arXiv:2402.02354 [pdf, other]
Title: A Paradigm for Potential Model Performance Improvement in Classification and Regression Problems. A Proof of Concept
Francisco Javier Lobo-Cabrera
Subjects: Machine Learning (cs.LG)
[280] arXiv:2402.02355 [pdf, html, other]
Title: Symbol: Generating Flexible Black-Box Optimizers through Symbolic Equation Learning
Jiacheng Chen, Zeyuan Ma, Hongshu Guo, Yining Ma, Jie Zhang, Yue-Jiao Gong
Comments: Published as a conference paper at ICLR 2024
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[281] arXiv:2402.02357 [pdf, other]
Title: Multi-modal Causal Structure Learning and Root Cause Analysis
Lecheng Zheng, Zhengzhang Chen, Jingrui He, Haifeng Chen
Comments: Accepted by the Web Conference 2024
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[282] arXiv:2402.02361 [pdf, html, other]
Title: Pruner: A Draft-then-Verify Exploration Mechanism to Accelerate Tensor Program Tuning
Liang Qiao, Jun Shi, Xiaoyu Hao, Xi Fang, Sen Zhang, Minfan Zhao, Ziqi Zhu, Junshi Chen, Hong An, Xulong Tang, Bing Li, Honghui Yuan, Xinyang Wang
Subjects: Machine Learning (cs.LG)
[283] arXiv:2402.02362 [pdf, html, other]
Title: Unification of Symmetries Inside Neural Networks: Transformer, Feedforward and Neural ODE
Koji Hashimoto, Yuji Hirono, Akiyoshi Sannai
Comments: 11 pages, 3 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); High Energy Physics - Theory (hep-th); Computational Physics (physics.comp-ph)
[284] arXiv:2402.02364 [pdf, html, other]
Title: Loss Landscape Degeneracy and Stagewise Development in Transformers
Jesse Hoogland, George Wang, Matthew Farrugia-Roberts, Liam Carroll, Susan Wei, Daniel Murfet
Comments: To appear, TMLR. Material on essential dynamics from v1 of this preprint has been removed and developed in arXiv:2501.17745
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[285] arXiv:2402.02366 [pdf, html, other]
Title: Transolver: A Fast Transformer Solver for PDEs on General Geometries
Haixu Wu, Huakun Luo, Haowen Wang, Jianmin Wang, Mingsheng Long
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[286] arXiv:2402.02368 [pdf, html, other]
Title: Timer: Generative Pre-trained Transformers Are Large Time Series Models
Yong Liu, Haoran Zhang, Chenyu Li, Xiangdong Huang, Jianmin Wang, Mingsheng Long
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[287] arXiv:2402.02370 [pdf, html, other]
Title: AutoTimes: Autoregressive Time Series Forecasters via Large Language Models
Yong Liu, Guo Qin, Xiangdong Huang, Jianmin Wang, Mingsheng Long
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[288] arXiv:2402.02399 [pdf, html, other]
Title: FreDF: Learning to Forecast in the Frequency Domain
Hao Wang, Licheng Pan, Zhichao Chen, Degui Yang, Sen Zhang, Yifei Yang, Xinggao Liu, Haoxuan Li, Dacheng Tao
Comments: Accepted by ICLR 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP); Machine Learning (stat.ML)
[289] arXiv:2402.02407 [pdf, html, other]
Title: Defining Neural Network Architecture through Polytope Structures of Dataset
Sangmin Lee, Abbas Mammadov, Jong Chul Ye
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[290] arXiv:2402.02423 [pdf, other]
Title: Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
Yifu Yuan, Jianye Hao, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, Yan Zheng
Comments: Published as a conference paper at ICLR 2024. The website is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
[291] arXiv:2402.02425 [pdf, html, other]
Title: DeepLag: Discovering Deep Lagrangian Dynamics for Intuitive Fluid Prediction
Qilong Ma, Haixu Wu, Lanxiang Xing, Shangchen Miao, Mingsheng Long
Subjects: Machine Learning (cs.LG); Fluid Dynamics (physics.flu-dyn)
[292] arXiv:2402.02429 [pdf, html, other]
Title: Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Lanqing Li, Hai Zhang, Xinyu Zhang, Shatong Zhu, Yang Yu, Junqiao Zhao, Pheng-Ann Heng
Comments: 26 pages, 8 figures, 7 tables. TLDR: We propose a novel information theoretic framework of the context-based offline meta-RL paradigm, which unifies several mainstream methods and leads to two robust algorithm implementations
Journal-ref: 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Subjects: Machine Learning (cs.LG)
[293] arXiv:2402.02438 [pdf, other]
Title: Fast and interpretable Support Vector Classification based on the truncated ANOVA decomposition
Kseniya Akhalaya, Franziska Nestler, Daniel Potts
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[294] arXiv:2402.02439 [pdf, html, other]
Title: DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Guanghe Li, Yixiang Shan, Zhengbang Zhu, Ting Long, Weinan Zhang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[295] arXiv:2402.02441 [pdf, html, other]
Title: TopoX: A Suite of Python Packages for Machine Learning on Topological Domains
Mustafa Hajij, Mathilde Papillon, Florian Frantzen, Jens Agerberg, Ibrahem AlJabea, Rubén Ballester, Claudio Battiloro, Guillermo Bernárdez, Tolga Birdal, Aiden Brent, Peter Chin, Sergio Escalera, Simone Fiorellino, Odin Hoff Gardaa, Gurusankar Gopalakrishnan, Devendra Govil, Josef Hoppe, Maneel Reddy Karri, Jude Khouja, Manuel Lecha, Neal Livesay, Jan Meißner, Soham Mukherjee, Alexander Nikitin, Theodore Papamarkou, Jaro Prílepok, Karthikeyan Natesan Ramamurthy, Paul Rosen, Aldo Guzmán-Sáenz, Alessandro Salatiello, Shreyas N. Samaga, Simone Scardapane, Michael T. Schaub, Luca Scofano, Indro Spinelli, Lev Telyatnikov, Quang Truong, Robin Walters, Maosheng Yang, Olga Zaghen, Ghada Zamzmi, Ali Zia, Nina Miolane
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Mathematical Software (cs.MS); Computation (stat.CO)
[296] arXiv:2402.02442 [pdf, html, other]
Title: A Momentum Accelerated Algorithm for ReLU-based Nonlinear Matrix Decomposition
Qingsong Wang, Chunfeng Cui, Deren Han
Comments: 5 pages, 7 figures
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[297] arXiv:2402.02446 [pdf, html, other]
Title: LQER: Low-Rank Quantization Error Reconstruction for LLMs
Cheng Zhang, Jianyi Cheng, George A. Constantinides, Yiren Zhao
Comments: Accepted at ICML2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[298] arXiv:2402.02447 [pdf, other]
Title: Breaking MLPerf Training: A Case Study on Optimizing BERT
Yongdeok Kim, Jaehyung Ahn, Myeongwoo Kim, Changin Choi, Heejae Kim, Narankhuu Tuvshinjargal, Seungwon Lee, Yanzi Zhang, Yuan Pei, Xiongzhan Linghu, Jingkun Ma, Lin Chen, Yuehua Dai, Sungjoo Yoo
Comments: Total 15 pages (Appendix 3 pages)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[299] arXiv:2402.02454 [pdf, other]
Title: On the Role of Initialization on the Implicit Bias in Deep Linear Networks
Oria Gruber, Haim Avron
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[300] arXiv:2402.02456 [pdf, other]
Title: tnGPS: Discovering Unknown Tensor Network Structure Search Algorithms via Large Language Models (LLMs)
Junhua Zeng, Chao Li, Zhun Sun, Qibin Zhao, Guoxu Zhou
Comments: Accepted by ICML2024, pre-printed version
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
Total of 3960 entries : 1-100 101-200 201-300 301-400 401-500 501-600 ... 3901-3960
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack