Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > stat.ML

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for March 2024

Total of 407 entries : 1-100 101-200 201-300 301-400 401-407
Showing up to 100 entries per page: fewer | more | all
[201] arXiv:2403.04629 (cross-list from cs.LG) [pdf, html, other]
Title: Explaining Bayesian Optimization by Shapley Values Facilitates Human-AI Collaboration
Julian Rodemann, Federico Croppi, Philipp Arens, Yusuf Sale, Julia Herbinger, Bernd Bischl, Eyke Hüllermeier, Thomas Augustin, Conor J. Walsh, Giuseppe Casalicchio
Comments: Preprint. Copyright by the authors. 19 pages, 24 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Robotics (cs.RO); Machine Learning (stat.ML)
[202] arXiv:2403.04726 (cross-list from cs.DS) [pdf, html, other]
Title: A Sub-Quadratic Time Algorithm for Robust Sparse Mean Estimation
Ankit Pensia
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[203] arXiv:2403.04744 (cross-list from cs.LG) [pdf, html, other]
Title: SQ Lower Bounds for Non-Gaussian Component Analysis with Weaker Assumptions
Ilias Diakonikolas, Daniel Kane, Lisheng Ren, Yuxin Sun
Comments: Conference version published in NeurIPS 2023
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Statistics Theory (math.ST); Machine Learning (stat.ML)
[204] arXiv:2403.04747 (cross-list from cs.LG) [pdf, html, other]
Title: GNN-VPA: A Variance-Preserving Aggregation Strategy for Graph Neural Networks
Lisa Schneckenreiter, Richard Freinschlag, Florian Sestak, Johannes Brandstetter, Günter Klambauer, Andreas Mayr
Comments: Accepted at ICLR 2024 (Tiny Papers Track)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[205] arXiv:2403.04764 (cross-list from cs.LG) [pdf, html, other]
Title: TS-RSR: A provably efficient approach for batch Bayesian Optimization
Zhaolin Ren, Na Li
Comments: Accepted by the SIAM Journal on Optimization
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[206] arXiv:2403.04805 (cross-list from cs.LG) [pdf, html, other]
Title: Pruning neural network models for gene regulatory dynamics using data and domain knowledge
Intekhab Hossain, Jonas Fischer, Rebekka Burkholz, John Quackenbush
Comments: Accepted to Conference on Neural Information Processing Systems (NeurIPS) 2024
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM); Applications (stat.AP); Machine Learning (stat.ML)
[207] arXiv:2403.04867 (cross-list from cs.CR) [pdf, other]
Title: Unified Mechanism-Specific Amplification by Subsampling and Group Privacy Amplification
Jan Schuchardt, Mihail Stoian, Arthur Kosmala, Stephan Günnemann
Comments: Accepted at NeurIPS 2024
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[208] arXiv:2403.04975 (cross-list from math.OC) [pdf, html, other]
Title: Deep Backward and Galerkin Methods for the Finite State Master Equation
Asaf Cohen, Mathieu Laurière, Ethan Zell
Subjects: Optimization and Control (math.OC); Machine Learning (stat.ML)
[209] arXiv:2403.04978 (cross-list from cs.LG) [pdf, html, other]
Title: Stacking as Accelerated Gradient Descent
Naman Agarwal, Pranjal Awasthi, Satyen Kale, Eric Zhao
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[210] arXiv:2403.05006 (cross-list from cs.LG) [pdf, html, other]
Title: Provable Multi-Party Reinforcement Learning with Diverse Human Feedback
Huiying Zhong, Zhun Deng, Weijie J. Su, Zhiwei Steven Wu, Linjun Zhang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[211] arXiv:2403.05175 (cross-list from cs.LG) [pdf, html, other]
Title: Continual Learning and Catastrophic Forgetting
Gido M. van de Ven, Nicholas Soures, Dhireesha Kudithipudi
Comments: Preprint of a book chapter; 21 pages, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC); Machine Learning (stat.ML)
[212] arXiv:2403.05293 (cross-list from cs.LG) [pdf, html, other]
Title: Leveraging Continuous Time to Understand Momentum When Training Diagonal Linear Networks
Hristo Papazov, Scott Pesme, Nicolas Flammarion
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[213] arXiv:2403.05358 (cross-list from cs.CY) [pdf, html, other]
Title: Variational Inference of Parameters in Opinion Dynamics Models
Jacopo Lenti, Fabrizio Silvestri, Gianmarco De Francisci Morales
Subjects: Computers and Society (cs.CY); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[214] arXiv:2403.05446 (cross-list from cs.LG) [pdf, html, other]
Title: An Improved Algorithm for Learning Drifting Discrete Distributions
Alessio Mazzetto
Comments: To be published in AISTATS 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[215] arXiv:2403.05490 (cross-list from cs.LG) [pdf, html, other]
Title: Poly-View Contrastive Learning
Amitis Shidani, Devon Hjelm, Jason Ramapuram, Russ Webb, Eeshan Gunesh Dhekane, Dan Busbridge
Comments: Accepted to ICLR 2024. 42 pages, 7 figures, 3 tables, loss pseudo-code included in appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (stat.ML)
[216] arXiv:2403.05529 (cross-list from cs.LG) [pdf, html, other]
Title: Computational-Statistical Gaps in Gaussian Single-Index Models
Alex Damian, Loucas Pillaud-Vivien, Jason D. Lee, Joan Bruna
Comments: 61 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[217] arXiv:2403.05600 (cross-list from cs.LG) [pdf, html, other]
Title: Density-Regression: Efficient and Distance-Aware Deep Regressor for Uncertainty Estimation under Distribution Shifts
Ha Manh Bui, Anqi Liu
Comments: International Conference on Artificial Intelligence and Statistics, 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[218] arXiv:2403.05759 (cross-list from cs.LG) [pdf, html, other]
Title: Membership Testing in Markov Equivalence Classes via Independence Query Oracles
Jiaqi Zhang, Kirankumar Shiragur, Caroline Uhler
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[219] arXiv:2403.06100 (cross-list from cs.HC) [pdf, html, other]
Title: Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Yusuke Yasuda, Tomoki Toda
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[220] arXiv:2403.06183 (cross-list from cs.LG) [pdf, html, other]
Title: An Improved Analysis of Langevin Algorithms with Prior Diffusion for Non-Log-Concave Sampling
Xunpeng Huang, Hanze Dong, Difan Zou, Tong Zhang
Comments: 32 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Statistics Theory (math.ST); Machine Learning (stat.ML)
[221] arXiv:2403.06230 (cross-list from cs.LG) [pdf, html, other]
Title: LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem
Yun-Ang Wu, Yun-Da Tsai, Shou-De Lin
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[222] arXiv:2403.06235 (cross-list from cs.LG) [pdf, html, other]
Title: Probabilistic Neural Circuits
Pedro Zuidberg Dos Martires
Comments: Proceedings of the AAAI Conference on Artificial Intelligence
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[223] arXiv:2403.06311 (cross-list from cs.LG) [pdf, html, other]
Title: How much data do you need? Part 2: Predicting DL class specific training dataset sizes
Thomas Mühlenstädt, Jelena Frtunikj
Comments: 17 pages, 10 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[224] arXiv:2403.06560 (cross-list from cs.LG) [pdf, html, other]
Title: Sliced-Wasserstein Distances and Flows on Cartan-Hadamard Manifolds
Clément Bonet, Lucas Drumetz, Nicolas Courty
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[225] arXiv:2403.06571 (cross-list from cs.LG) [pdf, other]
Title: Scalable Online Exploration via Coverability
Philip Amortila, Dylan J. Foster, Akshay Krishnamurthy
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[226] arXiv:2403.06807 (cross-list from cs.LG) [pdf, html, other]
Title: Multistep Consistency Models
Jonathan Heek, Emiel Hoogeboom, Tim Salimans
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[227] arXiv:2403.06812 (cross-list from cs.LG) [pdf, html, other]
Title: Monotone Individual Fairness
Yahav Bechavod
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Machine Learning (stat.ML)
[228] arXiv:2403.06826 (cross-list from cs.LG) [pdf, html, other]
Title: In-context Exploration-Exploitation for Reinforcement Learning
Zhenwen Dai, Federico Tomasi, Sina Ghiassian
Comments: Published at ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[229] arXiv:2403.06871 (cross-list from cs.LG) [pdf, html, other]
Title: On the Generalization Ability of Unsupervised Pretraining
Yuyang Deng, Junyuan Hong, Jiayu Zhou, Mehrdad Mahdavi
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[230] arXiv:2403.06903 (cross-list from cs.LG) [pdf, html, other]
Title: Benign overfitting in leaky ReLU networks with moderate input dimension
Kedar Karhadkar, Erin George, Michael Murray, Guido Montúfar, Deanna Needell
Comments: 39 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[231] arXiv:2403.06925 (cross-list from cs.LG) [pdf, html, other]
Title: Transformers Learn Low Sensitivity Functions: Investigations and Implications
Bhavya Vasudeva, Deqing Fu, Tianyi Zhou, Elliott Kau, Youqi Huang, Vatsal Sharan
Comments: ICLR 2025. 24 pages, 19 figures, 3 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[232] arXiv:2403.06942 (cross-list from eess.SY) [pdf, html, other]
Title: Grid Monitoring with Synchro-Waveform and AI Foundation Model Technologies
Lang Tong, Xinyi Wang, Qing Zhao
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Machine Learning (stat.ML)
[233] arXiv:2403.07004 (cross-list from cs.AI) [pdf, html, other]
Title: Convergence of Some Convex Message Passing Algorithms to a Fixed Point
Vaclav Voracek, Tomas Werner
Comments: ICML 2024; comments are welcome
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[234] arXiv:2403.07031 (cross-list from cs.LG) [pdf, other]
Title: Cramming Contextual Bandits for On-policy Statistical Evaluation
Zeyang Jia, Kosuke Imai, Michael Lingzhi Li
Subjects: Machine Learning (cs.LG); Computation (stat.CO); Methodology (stat.ME); Machine Learning (stat.ML)
[235] arXiv:2403.07136 (cross-list from cs.LG) [pdf, other]
Title: On the Limited Representational Power of Value Functions and its Links to Statistical (In)Efficiency
David Cheikhi, Daniel Russo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[236] arXiv:2403.07148 (cross-list from math.OC) [pdf, other]
Title: Stochastic Extragradient with Random Reshuffling: Improved Convergence for Variational Inequalities
Konstantinos Emmanouilidis, René Vidal, Nicolas Loizou
Comments: AISTATS 2024. Changes in v2: Some minor typos were fixed; Statement and proof of Theorem 2.3 were updated and improved
Subjects: Optimization and Control (math.OC); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Machine Learning (stat.ML)
[237] arXiv:2403.07185 (cross-list from cs.LG) [pdf, html, other]
Title: Uncertainty in Graph Neural Networks: A Survey
Fangxin Wang, Yuqing Liu, Kay Liu, Yibo Wang, Sourav Medya, Philip S. Yu
Comments: 14 main pages, 4 figures, 1 table
Journal-ref: Transactions on Machine Learning Research (11/2024)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[238] arXiv:2403.07213 (cross-list from cs.LG) [pdf, html, other]
Title: Which LLM to Play? Convergence-Aware Online Model Selection with Time-Increasing Bandits
Yu Xia, Fang Kong, Tong Yu, Liya Guo, Ryan A. Rossi, Sungchul Kim, Shuai Li
Comments: Accepted by WWW'24 (Oral)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[239] arXiv:2403.07263 (cross-list from cs.CV) [pdf, html, other]
Title: Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction
Alexander Timans, Christoph-Nikolas Straehle, Kaspar Sakmann, Eric Nalisnick
Comments: European Conference on Computer Vision (ECCV) 2024; 37 pages, 14 figures, 6 tables (incl. appendix)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[240] arXiv:2403.07379 (cross-list from cs.LG) [pdf, html, other]
Title: Hallmarks of Optimization Trajectories in Neural Networks: Directional Exploration and Redundancy
Sidak Pal Singh, Bobby He, Thomas Hofmann, Bernhard Schölkopf
Comments: Preprint, 57 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[241] arXiv:2403.07442 (cross-list from cs.LG) [pdf, html, other]
Title: Proxy Methods for Domain Adaptation
Katherine Tsai, Stephen R. Pfohl, Olawale Salaudeen, Nicole Chiou, Matt J. Kusner, Alexander D'Amour, Sanmi Koyejo, Arthur Gretton
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[242] arXiv:2403.07456 (cross-list from cs.LG) [pdf, html, other]
Title: A tutorial on multi-view autoencoders using the multi-view-AE library
Ana Lawry Aguila, Andre Altmann
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[243] arXiv:2403.07464 (cross-list from math.ST) [pdf, html, other]
Title: On Ranking-based Tests of Independence
Myrto Limnios (UCPH), Stéphan Clémençon (LTCI, IDS, S2A, IP Paris)
Subjects: Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
[244] arXiv:2403.07495 (cross-list from stat.CO) [pdf, html, other]
Title: Tuning diagonal scale matrices for HMC
Jimmy Huy Tran, Tore Selland Kleppe
Subjects: Computation (stat.CO); Methodology (stat.ME); Machine Learning (stat.ML)
[245] arXiv:2403.07723 (cross-list from cs.LG) [pdf, html, other]
Title: On the Last-Iterate Convergence of Shuffling Gradient Methods
Zijian Liu, Zhengyuan Zhou
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[246] arXiv:2403.07724 (cross-list from cs.LG) [pdf, html, other]
Title: Balancing Fairness and Accuracy in Data-Restricted Binary Classification
Zachary McBride Lazri, Danial Dervovic, Antigoni Polychroniadou, Ivan Brugere, Dana Dachman-Soled, Min Wu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (stat.ML)
[247] arXiv:2403.07735 (cross-list from math.ST) [pdf, html, other]
Title: The Minimax Rate of HSIC Estimation for Translation-Invariant Kernels
Florian Kalinke, Zoltan Szabo
Comments: Accepted for publication at NeurIPS 2024
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (cs.LG); Machine Learning (stat.ML)
[248] arXiv:2403.07862 (cross-list from math.ST) [pdf, html, other]
Title: Low coordinate degree algorithms I: Universality of computational thresholds for hypothesis testing
Dmitriy Kunisky
Comments: 49 pages
Subjects: Statistics Theory (math.ST); Data Structures and Algorithms (cs.DS); Probability (math.PR); Machine Learning (stat.ML)
[249] arXiv:2403.07929 (cross-list from cs.LG) [pdf, html, other]
Title: Sketching the Heat Kernel: Using Gaussian Processes to Embed Data
Anna C. Gilbert, Kevin O'Neill
Comments: 28 pages
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[250] arXiv:2403.08118 (cross-list from stat.ME) [pdf, html, other]
Title: Characterising harmful data sources when constructing multi-fidelity surrogate models
Nicolau Andrés-Thió, Mario Andrés Muñoz, Kate Smith-Miles
Subjects: Methodology (stat.ME); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[251] arXiv:2403.08121 (cross-list from cs.LG) [pdf, html, other]
Title: Early Directional Convergence in Deep Homogeneous Neural Networks for Small Initializations
Akshay Kumar, Jarvis Haupt
Comments: tmlr-final-version
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[252] arXiv:2403.08194 (cross-list from cs.LG) [pdf, html, other]
Title: Unsupervised Learning of Hybrid Latent Dynamics: A Learn-to-Identify Framework
Yubo Ye, Sumeet Vadhavkar, Xiajun Jiang, Ryan Missel, Huafeng Liu, Linwei Wang
Comments: Under Review
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[253] arXiv:2403.08220 (cross-list from math.NA) [pdf, html, other]
Title: Derivative-informed neural operator acceleration of geometric MCMC for infinite-dimensional Bayesian inverse problems
Lianghao Cao, Thomas O'Leary-Roseberry, Omar Ghattas
Comments: Updated manuscript: changed title, changed format, typo correction, and minor terminology changes
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
[254] arXiv:2403.08331 (cross-list from cs.LG) [pdf, other]
Title: Bayesian Optimization that Limits Search Region to Lower Dimensions Utilizing Local GPR
Yasunori Taguchi, Hiro Gangi
Comments: 8 pages, 13 figures, 22nd International Conference on Machine Learning and Applications (ICMLA2023)
Journal-ref: 2023 International Conference on Machine Learning and Applications (ICMLA), Jacksonville, FL, USA, 2023, pp. 202-209
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[255] arXiv:2403.08335 (cross-list from cs.LG) [pdf, html, other]
Title: A Sparsity Principle for Partially Observable Causal Representation Learning
Danru Xu, Dingling Yao, Sébastien Lachapelle, Perouz Taslakian, Julius von Kügelgen, Francesco Locatello, Sara Magliacane
Comments: 45 pages, 32 figures, 16 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[256] arXiv:2403.08609 (cross-list from cs.LG) [pdf, html, other]
Title: On the Convergence of Locally Adaptive and Scalable Diffusion-Based Sampling Methods for Deep Bayesian Neural Network Posteriors
Tim Rensmeyer, Oliver Niggemann
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[257] arXiv:2403.08618 (cross-list from cs.LG) [pdf, html, other]
Title: SAP: Corrective Machine Unlearning with Scaled Activation Projection for Label Noise Robustness
Sangamesh Kodge, Deepak Ravikumar, Gobinda Saha, Kaushik Roy
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[258] arXiv:2403.08635 (cross-list from cs.LG) [pdf, html, other]
Title: Human Alignment of Large Language Models through Online Preference Optimisation
Daniele Calandriello, Daniel Guo, Remi Munos, Mark Rowland, Yunhao Tang, Bernardo Avila Pires, Pierre Harvey Richemond, Charline Le Lan, Michal Valko, Tianqi Liu, Rishabh Joshi, Zeyu Zheng, Bilal Piot
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[259] arXiv:2403.08652 (cross-list from cs.LG) [pdf, html, other]
Title: Extracting Explanations, Justification, and Uncertainty from Black-Box Deep Neural Networks
Paul Ardis, Arjuna Flenner
Comments: 8 pages, 5 figures, SPIE DCS 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[260] arXiv:2403.08673 (cross-list from cs.LG) [pdf, other]
Title: When can we Approximate Wide Contrastive Models with Neural Tangent Kernels and Principal Component Analysis?
Gautham Govind Anil, Pascal Esser, Debarghya Ghoshdastidar
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[261] arXiv:2403.08699 (cross-list from cs.LG) [pdf, html, other]
Title: Implicit Regularization of Gradient Flow on One-Layer Softmax Attention
Heejune Sheen, Siyu Chen, Tianhao Wang, Harrison H. Zhou
Comments: 34 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[262] arXiv:2403.08819 (cross-list from cs.LG) [pdf, html, other]
Title: Thermometer: Towards Universal Calibration for Large Language Models
Maohao Shen, Subhro Das, Kristjan Greenewald, Prasanna Sattigeri, Gregory Wornell, Soumya Ghosh
Comments: Camera ready version for ICML 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[263] arXiv:2403.08837 (cross-list from cs.LG) [pdf, html, other]
Title: Cyclic Data Parallelism for Efficient Parallelism of Deep Neural Networks
Louis Fournier (MLIA), Edouard Oyallon
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[264] arXiv:2403.08854 (cross-list from hep-ph) [pdf, html, other]
Title: Moments of Clarity: Streamlining Latent Spaces in Machine Learning using Moment Pooling
Rikab Gambhir, Athis Osathapan, Jesse Thaler
Comments: 15+7 pages, 14 figures, 7 tables. Code available at this https URL and this https URL v2: Updated to match journal version
Subjects: High Energy Physics - Phenomenology (hep-ph); Machine Learning (cs.LG); Machine Learning (stat.ML)
[265] arXiv:2403.09123 (cross-list from cs.LG) [pdf, other]
Title: Optimal Top-Two Method for Best Arm Identification and Fluid Analysis
Agniv Bandyopadhyay, Sandeep Juneja, Shubhada Agrawal
Comments: To appear in NeurIPS 2024
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[266] arXiv:2403.09130 (cross-list from cond-mat.stat-mech) [pdf, html, other]
Title: Viral Load Inference in Non-Adaptive Pooled Testing
Mansoor Sheikh, David Saad
Subjects: Statistical Mechanics (cond-mat.stat-mech); Applications (stat.AP); Machine Learning (stat.ML)
[267] arXiv:2403.09170 (cross-list from math.ST) [pdf, other]
Title: Analysis of singular subspaces under random perturbations
Ke Wang
Comments: Improved the results in the applications and updated the references
Subjects: Statistics Theory (math.ST); Numerical Analysis (math.NA); Probability (math.PR); Machine Learning (stat.ML)
[268] arXiv:2403.09215 (cross-list from cs.LG) [pdf, html, other]
Title: On the Laplace Approximation as Model Selection Criterion for Gaussian Processes
Andreas Besginow, Jan David Hüwel, Thomas Pawellek, Christian Beecks, Markus Lange-Hegermann
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[269] arXiv:2403.09300 (cross-list from cs.LG) [pdf, html, other]
Title: Recursive Causal Discovery
Ehsan Mokhtarian, Sepehr Elahi, Sina Akbari, Negar Kiyavash
Comments: 50 pages, 5 tables, 11 algorithms, 5 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[270] arXiv:2403.09416 (cross-list from stat.CO) [pdf, html, other]
Title: Scalability of Metropolis-within-Gibbs schemes for high-dimensional Bayesian models
Filippo Ascolani, Gareth O. Roberts, Giacomo Zanella
Subjects: Computation (stat.CO); Statistics Theory (math.ST); Machine Learning (stat.ML)
[271] arXiv:2403.09465 (cross-list from cs.DS) [pdf, other]
Title: Outlier Robust Multivariate Polynomial Regression
Vipul Arora, Arnab Bhattacharyya, Mathews Boban, Venkatesan Guruswami, Esty Kelman
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[272] arXiv:2403.09604 (cross-list from stat.ME) [pdf, html, other]
Title: Extremal graphical modeling with latent variables via convex optimization
Sebastian Engelke, Armeen Taeb
Comments: Journal of Machine Learning Research, 2025
Subjects: Methodology (stat.ME); Statistics Theory (math.ST); Machine Learning (stat.ML)
[273] arXiv:2403.09621 (cross-list from cs.LG) [pdf, other]
Title: Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning
Zhishuai Liu, Pan Xu
Comments: 46 pages, 3 figures, 1 table. Published in Proc. of the 38th Conference on Advances in Neural Information Processing Systems (NeurIPS 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[274] arXiv:2403.09701 (cross-list from cs.LG) [pdf, html, other]
Title: A Natural Extension To Online Algorithms For Hybrid RL With Limited Coverage
Kevin Tan, Ziping Xu
Comments: Submitted to the reinforcement learning conference
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[275] arXiv:2403.09960 (cross-list from math.ST) [pdf, other]
Title: Multivariate Gaussian Approximation for Random Forest via Region-based Stabilization
Zhaoyang Shi, Chinmoy Bhattacharjee, Krishnakumar Balasubramanian, Wolfgang Polonik
Subjects: Statistics Theory (math.ST); Probability (math.PR); Machine Learning (stat.ML)
[276] arXiv:2403.10168 (cross-list from cs.LG) [pdf, html, other]
Title: Explainability through uncertainty: Trustworthy decision-making with neural networks
Arthur Thuy, Dries F. Benoit
Comments: Accepted Manuscript version of an article published in the European Journal of Operational Research
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Machine Learning (stat.ML)
[277] arXiv:2403.10175 (cross-list from cs.LG) [pdf, html, other]
Title: A Short Survey on Importance Weighting for Machine Learning
Masanari Kimura, Hideitsu Hino
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[278] arXiv:2403.10182 (cross-list from cs.LG) [pdf, html, other]
Title: Fast and reliable uncertainty quantification with neural network ensembles for industrial image classification
Arthur Thuy, Dries F. Benoit
Comments: Accepted Manuscript version of an article published in Annals of Operations Research
Journal-ref: Ann Oper Res (2024)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[279] arXiv:2403.10416 (cross-list from cs.LG) [pdf, html, other]
Title: Robust Sparse Estimation for Gaussians with Optimal Error under Huber Contamination
Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, Ankit Pensia, Thanasis Pittas
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Statistics Theory (math.ST); Machine Learning (stat.ML)
[280] arXiv:2403.10424 (cross-list from cs.LG) [pdf, html, other]
Title: Structured Evaluation of Synthetic Tabular Data
Scott Cheng-Hsin Yang, Baxter Eaves, Michael Schmidt, Ken Swanson, Patrick Shafto
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[281] arXiv:2403.10459 (cross-list from cs.LG) [pdf, other]
Title: Understanding the Double Descent Phenomenon in Deep Learning
Marc Lafon, Alexandre Thomas
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[282] arXiv:2403.10610 (cross-list from cs.LG) [pdf, html, other]
Title: Sequential Monte Carlo for Inclusive KL Minimization in Amortized Variational Inference
Declan McNamara, Jackson Loper, Jeffrey Regier
Comments: Accepted to the International Conference on Artificial Intelligence and Statistics (AISTATS 2024)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[283] arXiv:2403.10638 (cross-list from cs.LG) [pdf, html, other]
Title: A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food
Conor M. Artman, Aditya Mate, Ezinne Nwankwo, Aliza Heching, Tsuyoshi Idé, Jiří Navrátil, Karthikeyan Shanmugam, Wei Sun, Kush R. Varshney, Lauri Goldkind, Gidi Kroch, Jaclyn Sawyer, Ian Watson
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Machine Learning (stat.ML)
[284] arXiv:2403.10771 (cross-list from cs.LG) [pdf, html, other]
Title: A Probabilistic Approach for Model Alignment with Human Comparisons
Junyu Cao, Mohsen Bayati
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[285] arXiv:2403.10819 (cross-list from cs.LG) [pdf, html, other]
Title: Incentivized Exploration of Non-Stationary Stochastic Bandits
Sourav Chakraborty, Lijun Chen
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[286] arXiv:2403.10889 (cross-list from cs.LG) [pdf, html, other]
Title: List Sample Compression and Uniform Convergence
Steve Hanneke, Shay Moran, Tom Waknine
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[287] arXiv:2403.10903 (cross-list from cs.LG) [pdf, html, other]
Title: DTOR: Decision Tree Outlier Regressor to explain anomalies
Riccardo Crupi, Daniele Regoli, Alessandro Damiano Sabatino, Immacolata Marano, Massimiliano Brinis, Luca Albertazzi, Andrea Cirillo, Andrea Claudio Cosentini
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[288] arXiv:2403.10923 (cross-list from cs.LG) [pdf, html, other]
Title: Interpretable Machine Learning for TabPFN
David Rundel, Julius Kobialka, Constantin von Crailsheim, Matthias Feurer, Thomas Nagler, David Rügamer
Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in Explainable Artificial Intelligence, and is available online at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation (stat.CO); Machine Learning (stat.ML)
[289] arXiv:2403.11343 (cross-list from cs.LG) [pdf, other]
Title: Federated Transfer Learning with Differential Privacy
Mengchu Li, Ye Tian, Yang Feng, Yi Yu
Comments: 89 pages, 4 figures
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
[290] arXiv:2403.11348 (cross-list from cs.LG) [pdf, html, other]
Title: COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits
Mintong Kang, Nezihe Merve Gürel, Linyi Li, Bo Li
Comments: Accepted to ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[291] arXiv:2403.11351 (cross-list from math.OC) [pdf, html, other]
Title: A Semidefinite Programming-Based Branch-and-Cut Algorithm for Biclustering
Antonio M. Sudoso
Journal-ref: INFORMS Journal on Computing, 2024
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[292] arXiv:2403.11477 (cross-list from cs.LG) [pdf, other]
Title: Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs
Matthew Zurek, Yudong Chen
Comments: Revision adds Theorem 3 on the difficulty of estimating the span of the optimal bias. arXiv admin note: text overlap with arXiv:2311.13469
Journal-ref: Conference on Neural Information Processing Systems (NeurIPS), 2024
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Optimization and Control (math.OC); Machine Learning (stat.ML)
[293] arXiv:2403.11497 (cross-list from cs.CV) [pdf, html, other]
Title: A Sober Look at the Robustness of CLIPs to Spurious Features
Qizhou Wang, Yong Lin, Yongqiang Chen, Ludwig Schmidt, Bo Han, Tong Zhang
Comments: NeurIPS 2024; Qizhou Wang, Yong Lin, and Yongqiang Chen contributed equally; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[294] arXiv:2403.11520 (cross-list from cs.LG) [pdf, html, other]
Title: State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards
Yuto Tanimoto, Kenji Fukumizu
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[295] arXiv:2403.11637 (cross-list from cs.LG) [pdf, html, other]
Title: The Value of Reward Lookahead in Reinforcement Learning
Nadav Merlis, Dorian Baudry, Vianney Perchet
Comments: Accepted to NeurIPS 2024 as spotlight
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[296] arXiv:2403.11696 (cross-list from cs.LG) [pdf, other]
Title: Generalization error of spectral algorithms
Maksim Velikanov, Maxim Panov, Dmitry Yarotsky
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[297] arXiv:2403.11743 (cross-list from cs.LG) [pdf, html, other]
Title: PARMESAN: Parameter-Free Memory Search and Transduction for Dense Prediction Tasks
Philip Matthias Winter, Maria Wimmer, David Major, Dimitrios Lenis, Astrid Berg, Theresa Neubauer, Gaia Romana De Paolis, Johannes Novotny, Sophia Ulonska, Katja Bühler
Comments: This is the author's accepted manuscript of a paper published in Lecture Notes in Computer Science (LNCS), volume 15297, Proceedings of DAGM GCPR 2024. 25 pages, 7 figures
Journal-ref: LNCS, volume 15297, 2025
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[298] arXiv:2403.11782 (cross-list from cs.LG) [pdf, html, other]
Title: A tutorial on learning from preferences and choices with Gaussian Processes
Alessio Benavoli, Dario Azzimonti
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[299] arXiv:2403.11960 (cross-list from cs.LG) [pdf, html, other]
Title: Causality-Aware Spatiotemporal Graph Neural Networks for Spatiotemporal Time Series Imputation
Baoyu Jing, Dawei Zhou, Kan Ren, Carl Yang
Comments: Accepted by CIKM'2024. Fixed typos
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[300] arXiv:2403.11963 (cross-list from cs.LG) [pdf, html, other]
Title: Transfer Learning Beyond Bounded Density Ratios
Alkis Kalavasis, Ilias Zadik, Manolis Zampetakis
Comments: Abstract shortened to fit ArXiv requirements
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Statistics Theory (math.ST); Machine Learning (stat.ML)
Total of 407 entries : 1-100 101-200 201-300 301-400 401-407
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack