close this message
arXiv smileybones

arXiv Is Hiring a DevOps Engineer

Work on one of the world's most important websites and make an impact on open science.

View Jobs
Skip to main content
Cornell University

arXiv Is Hiring a DevOps Engineer

View Jobs
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > stat.ML

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for February 2024

Total of 673 entries : 1-100 101-200 201-300 301-400 401-500 501-600 601-673
Showing up to 100 entries per page: fewer | more | all
[401] arXiv:2402.07087 (cross-list from cs.LG) [pdf, html, other]
Title: Self-Correcting Self-Consuming Loops for Generative Model Training
Nate Gillman, Michael Freeman, Daksh Aggarwal, Chia-Hong Hsu, Calvin Luo, Yonglong Tian, Chen Sun
Comments: Camera ready version (ICML 2024). Code at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[402] arXiv:2402.07114 (cross-list from cs.LG) [pdf, other]
Title: Towards Quantifying the Preconditioning Effect of Adam
Rudrajit Das, Naman Agarwal, Sujay Sanghavi, Inderjit S. Dhillon
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC); Machine Learning (stat.ML)
[403] arXiv:2402.07193 (cross-list from cs.LG) [pdf, html, other]
Title: Parameter Symmetry and Noise Equilibrium of Stochastic Gradient Descent
Liu Ziyin, Mingze Wang, Hongchao Li, Lei Wu
Comments: NeurIPS camera ready
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[404] arXiv:2402.07211 (cross-list from cs.LG) [pdf, html, other]
Title: Towards Fast Stochastic Sampling in Diffusion Generative Models
Kushagra Pandey, Maja Rudolph, Stephan Mandt
Comments: Accepted in the NeurIPS'23 Workshop on Diffusion Models. Full version of this work can be found at arXiv:2310.07894
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[405] arXiv:2402.07240 (cross-list from math.ST) [pdf, other]
Title: Oja's Algorithm for Streaming Sparse PCA
Syamantak Kumar, Purnamrita Sarkar
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
[406] arXiv:2402.07248 (cross-list from cs.LG) [pdf, other]
Title: Depth Separations in Neural Networks: Separating the Dimension from the Accuracy
Itay Safran, Daniel Reichman, Paul Valiant
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[407] arXiv:2402.07296 (cross-list from math.ST) [pdf, other]
Title: Estimating the Mixing Coefficients of Geometrically Ergodic Markov Processes
Steffen Grünewälder, Azadeh Khaleghi
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
[408] arXiv:2402.07309 (cross-list from cs.LG) [pdf, other]
Title: HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed Hypergraphs
Adrián Bazaga, Pietro Liò, Gos Micklem
Comments: EMNLP 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[409] arXiv:2402.07314 (cross-list from cs.LG) [pdf, html, other]
Title: Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
Chenlu Ye, Wei Xiong, Yuheng Zhang, Hanze Dong, Nan Jiang, Tong Zhang
Comments: RLHF, Preference Learning, Alignment for LLMs
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[410] arXiv:2402.07340 (cross-list from cs.LG) [pdf, html, other]
Title: Perfect Recovery for Random Geometric Graph Matching with Shallow Graph Neural Networks
Suqi Liu, Morgane Austern
Comments: 27 pages, 5 figures, 3 tables; to appear in the Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Social and Information Networks (cs.SI); Probability (math.PR); Statistics Theory (math.ST); Machine Learning (stat.ML)
[411] arXiv:2402.07355 (cross-list from math.ST) [pdf, html, other]
Title: Sampling from the Mean-Field Stationary Distribution
Yunbum Kook, Matthew S. Zhang, Sinho Chewi, Murat A. Erdogdu, Mufan Bill Li
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
[412] arXiv:2402.07356 (cross-list from cs.LG) [pdf, other]
Title: A Novel Gaussian Min-Max Theorem and its Applications
Danil Akhtiamov, David Bosch, Reza Ghane, K Nithin Varma, Babak Hassibi
Comments: Added more references to related works
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[413] arXiv:2402.07388 (cross-list from math.ST) [pdf, html, other]
Title: The Limits of Assumption-free Tests for Algorithm Performance
Yuetian Luo, Rina Foygel Barber
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
[414] arXiv:2402.07407 (cross-list from eess.SY) [pdf, html, other]
Title: Conformal Predictive Programming for Chance Constrained Optimization
Yiqi Zhao, Xinyi Yu, Matteo Sesia, Jyotirmoy V. Deshmukh, Lars Lindemann
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[415] arXiv:2402.07419 (cross-list from cs.LG) [pdf, other]
Title: Conditional Generative Models are Sufficient to Sample from Any Causal Effect Estimand
Md Musfiqur Rahman, Matt Jordan, Murat Kocaoglu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[416] arXiv:2402.07453 (cross-list from cs.LG) [pdf, html, other]
Title: Bandit-Feedback Online Multiclass Classification: Variants and Tradeoffs
Yuval Filmus, Steve Hanneke, Idan Mehalel, Shay Moran
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[417] arXiv:2402.07458 (cross-list from cs.LG) [pdf, html, other]
Title: On the Distance from Calibration in Sequential Prediction
Mingda Qiao, Letian Zheng
Comments: To appear at COLT 2024; v2 fixed minor typos
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[418] arXiv:2402.07465 (cross-list from cs.LG) [pdf, other]
Title: Score-Based Physics-Informed Neural Networks for High-Dimensional Fokker-Planck Equations
Zheyuan Hu, Zhongqiang Zhang, George Em Karniadakis, Kenji Kawaguchi
Comments: 22 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Dynamical Systems (math.DS); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[419] arXiv:2402.07521 (cross-list from stat.ME) [pdf, other]
Title: A step towards the integration of machine learning and small area estimation
Tomasz Żądło, Adam Chwila
Subjects: Methodology (stat.ME); Econometrics (econ.EM); Machine Learning (stat.ML)
[420] arXiv:2402.07568 (cross-list from cs.LG) [pdf, html, other]
Title: Weisfeiler-Leman at the margin: When more expressivity matters
Billy J. Franks, Christopher Morris, Ameya Velingker, Floris Geerts
Comments: Accepted at ICML 2024. arXiv admin note: text overlap with arXiv:2301.11039
Subjects: Machine Learning (cs.LG); Discrete Mathematics (cs.DM); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[421] arXiv:2402.07588 (cross-list from cs.GT) [pdf, html, other]
Title: Understanding Model Selection For Learning In Strategic Environments
Tinashe Handina, Eric Mazumdar
Comments: NeurIPS 2024
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Machine Learning (stat.ML)
[422] arXiv:2402.07598 (cross-list from cs.LG) [pdf, html, other]
Title: Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model
Mark Rowland, Li Kevin Wenliang, Rémi Munos, Clare Lyle, Yunhao Tang, Will Dabney
Comments: NeurIPS 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[423] arXiv:2402.07613 (cross-list from math.ST) [pdf, html, other]
Title: Global optimality under amenable symmetry constraints
Peter Orbanz
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
[424] arXiv:2402.07712 (cross-list from cs.LG) [pdf, html, other]
Title: Model Collapse Demystified: The Case of Regression
Elvis Dohmatob, Yunzhen Feng, Julia Kempe
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[425] arXiv:2402.07717 (cross-list from math.ST) [pdf, other]
Title: Computationally efficient reductions between some statistical models
Mengqi Lou, Guy Bresler, Ashwin Pananjady
Comments: v2 contains numerical illustrations and more exposition in narrative
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Probability (math.PR); Methodology (stat.ME); Machine Learning (stat.ML)
[426] arXiv:2402.07747 (cross-list from math.ST) [pdf, html, other]
Title: Optimal score estimation via empirical Bayes smoothing
Andre Wibisono, Yihong Wu, Kaylee Yingxi Yang
Comments: COLT 2024; added the new results on extending to beta-Holder scores with beta <= 1
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
[427] arXiv:2402.07793 (cross-list from math.OC) [pdf, html, other]
Title: Tuning-Free Stochastic Optimization
Ahmed Khaled, Chi Jin
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[428] arXiv:2402.07821 (cross-list from cs.LG) [pdf, html, other]
Title: On Computationally Efficient Multi-Class Calibration
Parikshit Gopalan, Lunjia Hu, Guy N. Rothblum
Comments: In COLT 2024
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS); Statistics Theory (math.ST); Machine Learning (stat.ML)
[429] arXiv:2402.07846 (cross-list from cs.LG) [pdf, other]
Title: Generative Modeling of Discrete Joint Distributions by E-Geodesic Flow Matching on Assignment Manifolds
Bastian Boll, Daniel Gonzalez-Alvarado, Christoph Schnörr
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[430] arXiv:2402.07875 (cross-list from cs.LG) [pdf, html, other]
Title: Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States
Noam Razin, Yotam Alexander, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY); Machine Learning (stat.ML)
[431] arXiv:2402.08010 (cross-list from cs.LG) [pdf, html, other]
Title: Which Frequencies do CNNs Need? Emergent Bottleneck Structure in Feature Learning
Yuxiao Wen, Arthur Jacot
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[432] arXiv:2402.08018 (cross-list from cs.LG) [pdf, html, other]
Title: Nearest Neighbour Score Estimators for Diffusion Generative Models
Matthew Niedoba, Dylan Green, Saeid Naderiparizi, Vasileios Lioutas, Jonathan Wilder Lavington, Xiaoxuan Liang, Yunpeng Liu, Ke Zhang, Setareh Dabiri, Adam Ścibior, Berend Zwartsenberg, Frank Wood
Comments: 25 pages, 9 figures. To be published in ICML 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[433] arXiv:2402.08097 (cross-list from math.OC) [pdf, html, other]
Title: An Accelerated Gradient Method for Convex Smooth Simple Bilevel Optimization
Jincheng Cao, Ruichen Jiang, Erfan Yazdandoost Hamedani, Aryan Mokhtari
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[434] arXiv:2402.08105 (cross-list from cs.LG) [pdf, html, other]
Title: Learning Cartesian Product Graphs with Laplacian Constraints
Changhao Shi, Gal Mishne
Comments: Accepted to AISTATS 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[435] arXiv:2402.08156 (cross-list from cs.LG) [pdf, html, other]
Title: Differentially Private Distributed Inference
Marios Papachristou, M. Amin Rahimian
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[436] arXiv:2402.08182 (cross-list from cs.LG) [pdf, html, other]
Title: Variational Continual Test-Time Adaptation
Fan Lyu, Kaile Du, Yuyang Li, Hanyu Zhao, Zhang Zhang, Guangcan Liu, Liang Wang
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[437] arXiv:2402.08193 (cross-list from cs.LG) [pdf, html, other]
Title: Gaussian Ensemble Belief Propagation for Efficient Inference in High-Dimensional Systems
Dan MacKinlay, Russell Tsuchida, Dan Pagendam, Petra Kuhnert
Comments: Under conference submission
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[438] arXiv:2402.08229 (cross-list from cs.LG) [pdf, html, other]
Title: Causal Discovery under Off-Target Interventions
Davin Choo, Kirankumar Shiragur, Caroline Uhler
Comments: Accepted into AISTATS 2024
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Methodology (stat.ME); Machine Learning (stat.ML)
[439] arXiv:2402.08283 (cross-list from stat.ME) [pdf, html, other]
Title: Classification Using Global and Local Mahalanobis Distances
Annesha Ghosh, Anil K. Ghosh, Rita SahaRay, Soham Sarkar
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[440] arXiv:2402.08321 (cross-list from cs.LG) [pdf, html, other]
Title: Exploration by Optimization with Hybrid Regularizers: Logarithmic Regret with Adversarial Robustness in Partial Monitoring
Taira Tsuchiya, Shinji Ito, Junya Honda
Comments: Published version in Proceedings of 41st International Conference on Machine Learning (ICML 2024), 23 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[441] arXiv:2402.08493 (cross-list from cs.LG) [pdf, other]
Title: Sparsity via Sparse Group $k$-max Regularization
Qinghua Tao, Xiangming Xi, Jun Xu, Johan A.K. Suykens
Comments: 7 pages, accepted to American Control Conference 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[442] arXiv:2402.08530 (cross-list from cs.LG) [pdf, html, other]
Title: A Distributional Analogue to the Successor Representation
Harley Wiltzer, Jesse Farebrother, Arthur Gretton, Yunhao Tang, André Barreto, Will Dabney, Marc G. Bellemare, Mark Rowland
Comments: Accepted to ICML 2024. First two authors contributed equally
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[443] arXiv:2402.08543 (cross-list from math.ST) [pdf, html, other]
Title: Theoretical Analysis of Leave-one-out Cross Validation for Non-differentiable Penalties under High-dimensional Settings
Haolin Zou, Arnab Auddy, Kamiar Rahnama Rad, Arian Maleki
Comments: 30 pages
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
[444] arXiv:2402.08602 (cross-list from math.ST) [pdf, other]
Title: Globally-Optimal Greedy Experiment Selection for Active Sequential Estimation
Xiaoou Li, Hongru Zhao
Subjects: Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
[445] arXiv:2402.08621 (cross-list from cs.LG) [pdf, html, other]
Title: A Unified Framework for Analyzing Meta-algorithms in Online Convex Optimization
Mohammad Pedramfar, Vaneet Aggarwal
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[446] arXiv:2402.08667 (cross-list from cs.LG) [pdf, other]
Title: Target Score Matching
Valentin De Bortoli, Michael Hutchinson, Peter Wirnsberger, Arnaud Doucet
Subjects: Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
[447] arXiv:2402.08799 (cross-list from cs.LG) [pdf, html, other]
Title: Projection-Free Online Convex Optimization with Time-Varying Constraints
Dan Garber, Ben Kretzu
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[448] arXiv:2402.08808 (cross-list from cs.LG) [pdf, html, other]
Title: Depth Separation in Norm-Bounded Infinite-Width Neural Networks
Suzanna Parkinson, Greg Ongie, Rebecca Willett, Ohad Shamir, Nathan Srebro
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[449] arXiv:2402.08828 (cross-list from stat.ME) [pdf, html, other]
Title: Fusing Individualized Treatment Rules Using Secondary Outcomes
Daiqi Gao, Yuanjia Wang, Donglin Zeng
Journal-ref: Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, PMLR 238:712-720, 2024
Subjects: Methodology (stat.ME); Applications (stat.AP); Machine Learning (stat.ML)
[450] arXiv:2402.08856 (cross-list from cs.LG) [pdf, html, other]
Title: Approximation of relation functions and attention mechanisms
Awni Altabaa, John Lafferty
Comments: 24 pages; added discussion on curse of dimensionality in v2
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[451] arXiv:2402.08871 (cross-list from cs.LG) [pdf, html, other]
Title: Position: Topological Deep Learning is the New Frontier for Relational Learning
Theodore Papamarkou, Tolga Birdal, Michael Bronstein, Gunnar Carlsson, Justin Curry, Yue Gao, Mustafa Hajij, Roland Kwitt, Pietro Liò, Paolo Di Lorenzo, Vasileios Maroulas, Nina Miolane, Farzana Nasrin, Karthikeyan Natesan Ramamurthy, Bastian Rieck, Simone Scardapane, Michael T. Schaub, Petar Veličković, Bei Wang, Yusu Wang, Guo-Wei Wei, Ghada Zamzmi
Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[452] arXiv:2402.08922 (cross-list from cs.LG) [pdf, html, other]
Title: The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes
Myeongseob Ko, Feiyang Kang, Weiyan Shi, Ming Jin, Zhou Yu, Ruoxi Jia
Journal-ref: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[453] arXiv:2402.08929 (cross-list from cs.LG) [pdf, other]
Title: Second Order Methods for Bandit Optimization and Control
Arun Suggala, Y. Jennifer Sun, Praneeth Netrapalli, Elad Hazan
Comments: COLT 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[454] arXiv:2402.08992 (cross-list from math.OC) [pdf, html, other]
Title: Variance Reduction and Low Sample Complexity in Stochastic Optimization via Proximal Point Method
Jiaming Liang
Comments: 23 pages
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[455] arXiv:2402.08998 (cross-list from cs.LG) [pdf, other]
Title: Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Qiwei Di, Jiafan He, Dongruo Zhou, Quanquan Gu
Comments: 28 pages, 1 figure, In ICML 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[456] arXiv:2402.09033 (cross-list from econ.EM) [pdf, html, other]
Title: Cross-Temporal Forecast Reconciliation at Digital Platforms with Machine Learning
Jeroen Rombouts, Marie Ternes, Ines Wilms
Subjects: Econometrics (econ.EM); Applications (stat.AP); Methodology (stat.ME); Machine Learning (stat.ML)
[457] arXiv:2402.09201 (cross-list from cs.LG) [pdf, html, other]
Title: Better-than-KL PAC-Bayes Bounds
Ilja Kuzborskij, Kwang-Sung Jun, Yulian Wu, Kyoungseok Jang, Francesco Orabona
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[458] arXiv:2402.09226 (cross-list from cs.LG) [pdf, html, other]
Title: Directional Convergence Near Small Initializations and Saddles in Two-Homogeneous Neural Networks
Akshay Kumar, Jarvis Haupt
Comments: tmlr-final-version
Journal-ref: Transactions on Machine Learning Research (06/2024)
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[459] arXiv:2402.09236 (cross-list from cs.LG) [pdf, html, other]
Title: Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models
Goutham Rajendran, Simon Buchholz, Bryon Aragam, Bernhard Schölkopf, Pradeep Ravikumar
Comments: To appear in NeurIPS 2024 under the modified title 'From Causal to Concept-Based Representation Learning'
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Statistics Theory (math.ST); Machine Learning (stat.ML)
[460] arXiv:2402.09373 (cross-list from cs.LG) [pdf, html, other]
Title: Loss Shaping Constraints for Long-Term Time Series Forecasting
Ignacio Hounie, Javier Porras-Valenzuela, Alejandro Ribeiro
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[461] arXiv:2402.09401 (cross-list from cs.LG) [pdf, html, other]
Title: Reinforcement Learning from Human Feedback with Active Queries
Kaixuan Ji, Jiafan He, Quanquan Gu
Comments: 28 pages, 1 figure, 4 table
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Optimization and Control (math.OC); Machine Learning (stat.ML)
[462] arXiv:2402.09456 (cross-list from cs.LG) [pdf, html, other]
Title: Optimistic Thompson Sampling for No-Regret Learning in Unknown Games
Yingru Li, Liangqi Liu, Wenqiang Pu, Hao Liang, Zhi-Quan Luo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Machine Learning (stat.ML)
[463] arXiv:2402.09469 (cross-list from cs.LG) [pdf, html, other]
Title: Fourier Circuits in Neural Networks and Transformers: A Case Study of Modular Arithmetic with Multiple Inputs
Chenyang Li, Yingyu Liang, Zhenmei Shi, Zhao Song, Tianyi Zhou
Comments: AIStats 2025
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[464] arXiv:2402.09470 (cross-list from cs.LG) [pdf, html, other]
Title: Rolling Diffusion Models
David Ruhe, Jonathan Heek, Tim Salimans, Emiel Hoogeboom
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[465] arXiv:2402.09473 (cross-list from cs.LG) [pdf, html, other]
Title: One-for-many Counterfactual Explanations by Column Generation
Andrea Lodi, Jasone Ramírez-Ayerbe
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[466] arXiv:2402.09553 (cross-list from cs.AI) [pdf, html, other]
Title: Statistical and Machine Learning Models for Predicting Fire and Other Emergency Events
Dilli Prasad Sharma, Nasim Beigi-Mohammadi, Hongxiang Geng, Dawn Dixon, Rob Madro, Phil Emmenegger, Carlos Tobar, Jeff Li, Alberto Leon-Garcia
Journal-ref: IEEE Access 12(2024) 56880-56909
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[467] arXiv:2402.09560 (cross-list from cs.LG) [pdf, html, other]
Title: Distribution-Free Rates in Neyman-Pearson Classification
Mohammadreza M. Kalan, Samory Kpotufe
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[468] arXiv:2402.09600 (cross-list from cs.LG) [pdf, html, other]
Title: Low-Rank Graph Contrastive Learning for Node Classification
Yancheng Wang, Yingzhen Yang
Comments: arXiv admin note: text overlap with arXiv:2205.14109
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[469] arXiv:2402.09608 (cross-list from cs.LG) [pdf, html, other]
Title: Exact, Fast and Expressive Poisson Point Processes via Squared Neural Families
Russell Tsuchida, Cheng Soon Ong, Dino Sejdinovic
Comments: AAAI 2024 camera ready submission
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[470] arXiv:2402.09654 (cross-list from cs.AI) [pdf, html, other]
Title: GPT-4's assessment of its performance in a USMLE-based case study
Uttam Dhakal, Aniket Kumar Singh, Suman Devkota, Yogesh Sapkota, Bishal Lamichhane, Suprinsa Paudyal, Chandra Dhakal
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[471] arXiv:2402.09698 (cross-list from stat.ME) [pdf, html, other]
Title: Combining Evidence Across Filtrations
Yo Joong Choe, Aaditya Ramdas
Comments: Under review. Previous title was "Combining Evidence Across Filtrations Using Adjusters". Code is available at this https URL
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Probability (math.PR); Statistics Theory (math.ST); Machine Learning (stat.ML)
[472] arXiv:2402.09702 (cross-list from cs.LG) [pdf, html, other]
Title: Sparse and Faithful Explanations Without Sparse Models
Yiyang Sun, Zhi Chen, Vittorio Orlandi, Tong Wang, Cynthia Rudin
Comments: Accepted in AISTATS 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[473] arXiv:2402.09758 (cross-list from stat.ME) [pdf, html, other]
Title: Extrapolation-Aware Nonparametric Statistical Inference
Niklas Pfister, Peter Bühlmann
Subjects: Methodology (stat.ME); Statistics Theory (math.ST); Machine Learning (stat.ML)
[474] arXiv:2402.09807 (cross-list from math.OC) [pdf, html, other]
Title: Two trust region type algorithms for solving nonconvex-strongly concave minimax problems
Tongliang Yao, Zi Xu
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[475] arXiv:2402.09849 (cross-list from cs.LG) [pdf, other]
Title: Recommendations for Baselines and Benchmarking Approximate Gaussian Processes
Sebastian W. Ober, Artem Artemev, Marcel Wagenländer, Rudolfs Grobins, Mark van der Wilk
Comments: Preprint. 25 pages, 16 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[476] arXiv:2402.09891 (cross-list from cs.LG) [pdf, other]
Title: Do causal predictors generalize better to new domains?
Vivian Y. Nastl, Moritz Hardt
Comments: 118 pages, 55 figures, accepted at NeurIPS'24
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[477] arXiv:2402.09941 (cross-list from cs.LG) [pdf, html, other]
Title: FedLion: Faster Adaptive Federated Optimization with Fewer Communication
Zhiwei Tang, Tsung-Hui Chang
Comments: ICASSP 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[478] arXiv:2402.09970 (cross-list from cs.LG) [pdf, html, other]
Title: Accelerating Parallel Sampling of Diffusion Models
Zhiwei Tang, Jiasheng Tang, Hao Luo, Fan Wang, Tsung-Hui Chang
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[479] arXiv:2402.10028 (cross-list from cs.LG) [pdf, other]
Title: Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
Comments: 26 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[480] arXiv:2402.10062 (cross-list from cs.LG) [pdf, other]
Title: Optimal Parameter and Neuron Pruning for Out-of-Distribution Detection
Chao Chen, Zhihang Fu, Kai Liu, Ze Chen, Mingyuan Tao, Jieping Ye
Comments: Accepted by NeurIPS 2023. 19 pages
Journal-ref: NeurIPS 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[481] arXiv:2402.10065 (cross-list from cs.LG) [pdf, html, other]
Title: Some Targets Are Harder to Identify than Others: Quantifying the Target-dependent Membership Leakage
Achraf Azize, Debabrota Basu
Comments: Appears in AISTATS 2025 (Oral)
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Statistics Theory (math.ST); Machine Learning (stat.ML)
[482] arXiv:2402.10198 (cross-list from cs.LG) [pdf, html, other]
Title: SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention
Romain Ilbert, Ambroise Odonnat, Vasilii Feofanov, Aladin Virmaux, Giuseppe Paolo, Themis Palpanas, Ievgen Redko
Comments: Accepted as an Oral at ICML 2024, Vienna. The first two authors contributed equally
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[483] arXiv:2402.10210 (cross-list from cs.LG) [pdf, other]
Title: Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Gu
Comments: 28 pages, 8 figures, 10 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[484] arXiv:2402.10227 (cross-list from cs.LG) [pdf, html, other]
Title: Correlational Lagrangian Schrödinger Bridge: Learning Dynamics with Population-Level Regularization
Yuning You, Ruida Zhou, Yang Shen
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[485] arXiv:2402.10228 (cross-list from cs.LG) [pdf, html, other]
Title: Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li, Jiawei Xu, Lei Han, Zhi-Quan Luo
Comments: Proceedings of the $\mathit{41}^{st}$ International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024. Copyright 2024 by the author(s). Invited talk in Informs Optimization Conference 2024 and International Symposium on Mathematical Programming 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[486] arXiv:2402.10252 (cross-list from eess.SY) [pdf, html, other]
Title: Online Control of Linear Systems with Unbounded and Degenerate Noise
Kaito Ito, Taira Tsuchiya
Comments: 26 pages
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[487] arXiv:2402.10282 (cross-list from cs.LG) [pdf, html, other]
Title: Information Capacity Regret Bounds for Bandits with Mediator Feedback
Khaled Eldowa, Nicolò Cesa-Bianchi, Alberto Maria Metelli, Marcello Restelli
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[488] arXiv:2402.10291 (cross-list from cs.LG) [pdf, html, other]
Title: An Evaluation of Real-time Adaptive Sampling Change Point Detection Algorithm using KCUSUM
Vijayalakshmi Saravanan, Perry Siehien, Shinjae Yoo, Hubertus Van Dam, Thomas Flynn, Christopher Kelly, Khaled Z Ibrahim
Comments: 16 pages. arXiv admin note: text overlap with arXiv:1903.01661
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[489] arXiv:2402.10326 (cross-list from math.OC) [pdf, html, other]
Title: Mathematical Opportunities in Digital Twins (MATH-DT)
Harbir Antil
Subjects: Optimization and Control (math.OC); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[490] arXiv:2402.10357 (cross-list from math.ST) [pdf, other]
Title: Efficient Sampling on Riemannian Manifolds via Langevin MCMC
Xiang Cheng, Jingzhao Zhang, Suvrit Sra
Comments: This is an old paper from NeurIPS 2022. arXiv admin note: text overlap with arXiv:2204.13665
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Probability (math.PR); Computation (stat.CO); Machine Learning (stat.ML)
[491] arXiv:2402.10360 (cross-list from cs.LG) [pdf, html, other]
Title: Transductive Learning Is Compact
Julian Asilis, Siddartha Devic, Shaddin Dughmi, Vatsal Sharan, Shang-Hua Teng
Comments: NeurIPS 2024, 18 pages
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS); Logic in Computer Science (cs.LO); Machine Learning (stat.ML)
[492] arXiv:2402.10445 (cross-list from cs.LG) [pdf, html, other]
Title: Collaborative Learning with Different Labeling Functions
Yuyang Deng, Mingda Qiao
Comments: To appear at ICML 2024; v2 and v3 included additional discussion on related work
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[493] arXiv:2402.10470 (cross-list from cs.LG) [pdf, other]
Title: Theoretical Understanding of Learning from Adversarial Perturbations
Soichiro Kumano, Hiroshi Kera, Toshihiko Yamasaki
Comments: ICLR24
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[494] arXiv:2402.10474 (cross-list from cs.LG) [pdf, html, other]
Title: One-Bit Quantization and Sparsification for Multiclass Linear Classification with Strong Regularization
Reza Ghane, Danil Akhtiamov, Babak Hassibi
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[495] arXiv:2402.10482 (cross-list from cs.LG) [pdf, html, other]
Title: Rethinking Self-Distillation: Label Averaging and Enhanced Soft Label Refinement with Partial Labels
Hyeonsu Jeong, Hye Won Chung
Comments: ICLR 2025
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[496] arXiv:2402.10504 (cross-list from math.PR) [pdf, html, other]
Title: Resilience of Rademacher chaos of low degree
Elad Aigner-Horev, Daniel Rosenberg, Roi Weiss
Comments: Small corrections from previous version
Subjects: Probability (math.PR); Information Theory (cs.IT); Machine Learning (cs.LG); Combinatorics (math.CO); Machine Learning (stat.ML)
[497] arXiv:2402.10574 (cross-list from econ.EM) [pdf, html, other]
Title: Nowcasting with Mixed Frequency Data Using Gaussian Processes
Niko Hauzenberger, Massimiliano Marcellino, Michael Pfarrhofer, Anna Stelzer
Comments: Keywords: prediction, MIDAS, machine learning, Bayesian additive regression trees; JEL: C11, C22, C53, E31, E37
Subjects: Econometrics (econ.EM); Machine Learning (stat.ML)
[498] arXiv:2402.10592 (cross-list from cs.LG) [pdf, html, other]
Title: Optimizing Adaptive Experiments: A Unified Approach to Regret Minimization and Best-Arm Identification
Chao Qin, Daniel Russo
Subjects: Machine Learning (cs.LG); Econometrics (econ.EM); Machine Learning (stat.ML)
[499] arXiv:2402.10774 (cross-list from cs.LG) [pdf, other]
Title: Error Feedback Reloaded: From Quadratic to Arithmetic Mean of Smoothness Constants
Peter Richtárik, Elnur Gasanov, Konstantin Burlachenko
Comments: 70 pages, 14 figures, 6 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[500] arXiv:2402.10797 (cross-list from cs.MS) [pdf, html, other]
Title: BlackJAX: Composable Bayesian inference in JAX
Alberto Cabezas, Adrien Corenflos, Junpeng Lao, Rémi Louf, Antoine Carnec, Kaustubh Chaudhari, Reuben Cohn-Gordon, Jeremie Coullon, Wei Deng, Sam Duffield, Gerardo Durán-Martín, Marcin Elantkowski, Dan Foreman-Mackey, Michele Gregori, Carlos Iguaran, Ravin Kumar, Martin Lysy, Kevin Murphy, Juan Camilo Orduz, Karm Patel, Xi Wang, Rob Zinkov
Comments: Companion paper for the library this https URL Update: minor changes and updated the list of authors to include technical contributors
Subjects: Mathematical Software (cs.MS); Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
Total of 673 entries : 1-100 101-200 201-300 301-400 401-500 501-600 601-673
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack