close this message
arXiv smileybones

arXiv Is Hiring a DevOps Engineer

Work on one of the world's most important websites and make an impact on open science.

View Jobs
Skip to main content
Cornell University

arXiv Is Hiring a DevOps Engineer

View Jobs
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > stat.ML

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for May 2023

Total of 594 entries : 1-100 101-200 201-300 301-400 401-500 ... 501-594
Showing up to 100 entries per page: fewer | more | all
[101] arXiv:2305.12287 [pdf, other]
Title: Contrastive inverse regression for dimension reduction
Sam Hawke, Hengrui Luo, Didong Li
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Applications (stat.AP); Methodology (stat.ME)
[102] arXiv:2305.12313 [pdf, other]
Title: When are ensembles really effective?
Ryan Theisen, Hyunsuk Kim, Yaoqing Yang, Liam Hodgkinson, Michael W. Mahoney
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[103] arXiv:2305.12470 [pdf, other]
Title: Quasi-Monte Carlo Graph Random Features
Isaac Reid, Krzysztof Choromanski, Adrian Weller
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[104] arXiv:2305.12569 [pdf, html, other]
Title: Conditional Generative Modeling for High-dimensional Marked Temporal Point Processes
Zheng Dong, Zekai Fan, Shixiang Zhu
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[105] arXiv:2305.12686 [pdf, other]
Title: Conformal Inference for Invariant Risk Minimization
Wenlu Tang, Zicheng Liu
Comments: arXiv admin note: text overlap with arXiv:2209.11355
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[106] arXiv:2305.13248 [pdf, other]
Title: Bayesian Numerical Integration with Neural Networks
Katharina Ott, Michael Tiemann, Philipp Hennig, François-Xavier Briol
Journal-ref: PMLR 216:1606-1617, 2023
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[107] arXiv:2305.13271 [pdf, html, other]
Title: MAGDiff: Covariate Data Set Shift Detection via Activation Graphs of Deep Neural Networks
Charles Arnal, Felix Hensel, Mathieu Carrière, Théo Lacombe, Hiroaki Kurihara, Yuichi Ike, Frédéric Chazal
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[108] arXiv:2305.13498 [pdf, html, other]
Title: Parameter estimation from an Ornstein-Uhlenbeck process with measurement noise
Simon Carter, Lilianne Mujica-Parodi, Helmut H. Strey
Comments: 14 pages, 7 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[109] arXiv:2305.13517 [pdf, html, other]
Title: Statistical Guarantees of Group-Invariant GANs
Ziyu Chen, Markos A. Katsoulakis, Luc Rey-Bellet, Wei Zhu
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[110] arXiv:2305.13588 [pdf, other]
Title: Deep Learning with Kernels through RKHM and the Perron-Frobenius Operator
Yuka Hashimoto, Masahiro Ikeda, Hachem Kadri
Journal-ref: NeurIPS 2023
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[111] arXiv:2305.13715 [pdf, other]
Title: Covariate balancing using the integral probability metric for causal inference
Insung Kong, Yuha Park, Joonhyuk Jung, Kwonsang Lee, Yongdai Kim
Comments: 32 pages, ICML 2023 proceedings
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[112] arXiv:2305.13882 [pdf, html, other]
Title: Subsampling Error in Stochastic Gradient Langevin Diffusions
Kexin Jin, Chenguang Liu, Jonas Latz
Comments: AISTATS 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[113] arXiv:2305.14077 [pdf, other]
Title: Mind the spikes: Benign overfitting of kernels and neural networks in fixed dimension
Moritz Haas, David Holzmüller, Ulrike von Luxburg, Ingo Steinwart
Comments: Compared to the NeurIPS version (v2), this version strengthens Assumption (K) from d/2<s<=3d/4 to d/2<s<3d/4 and corrects Lemma B.2 by posing additional assumptions. This does not affect any other statements. We provide Python code to reproduce all of our experimental results at this https URL
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[114] arXiv:2305.14442 [pdf, other]
Title: Optimal Preconditioning and Fisher Adaptive Langevin Sampling
Michalis K. Titsias
Comments: 21 pages, 15 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[115] arXiv:2305.14454 [pdf, other]
Title: An Improved Variational Approximate Posterior for the Deep Wishart Process
Sebastian Ober, Ben Anson, Edward Milsom, Laurence Aitchison
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[116] arXiv:2305.14496 [pdf, other]
Title: Optimal Learning via Moderate Deviations Theory
Arnab Ganguly, Tobias Sutter
Comments: 35 pages, 3 figures
Subjects: Machine Learning (stat.ML); Optimization and Control (math.OC); Probability (math.PR); Statistics Theory (math.ST)
[117] arXiv:2305.14543 [pdf, html, other]
Title: Deep Functional Factor Models: Forecasting High-Dimensional Functional Time Series via Bayesian Nonparametric Factorization
Yirui Liu, Xinghao Qiao, Yulong Pei, Liying Wang
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[118] arXiv:2305.14593 [pdf, other]
Title: Discriminative calibration: Check Bayesian computation from simulations and flexible classifier
Yuling Yao, Justin Domke
Comments: Published at Neural Information Processing Systems (NeurIPS 2023)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[119] arXiv:2305.14606 [pdf, other]
Title: Taylor Learning
James Schmidt
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[120] arXiv:2305.14689 [pdf, html, other]
Title: Least Squares Regression Can Exhibit Under-Parameterized Double Descent
Xinyue Li, Rishi Sonthalia
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[121] arXiv:2305.14765 [pdf, other]
Title: Masked Bayesian Neural Networks : Theoretical Guarantee and its Posterior Inference
Insung Kong, Dongyoon Yang, Jongjin Lee, Ilsang Ohn, Gyuseung Baek, Yongdai Kim
Comments: 30 pages, ICML 2023 proceedings. arXiv admin note: substantial text overlap with arXiv:2206.00853
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[122] arXiv:2305.14916 [pdf, html, other]
Title: Tuning-Free Maximum Likelihood Training of Latent Variable Models via Coin Betting
Louis Sharrock, Daniel Dodd, Christopher Nemeth
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[123] arXiv:2305.14943 [pdf, html, other]
Title: Learning Rate Free Sampling in Constrained Domains
Louis Sharrock, Lester Mackey, Christopher Nemeth
Comments: Accepted at NeurIPS 2023
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[124] arXiv:2305.14961 [pdf, other]
Title: Deep Learning for Survival Analysis: A Review
Simon Wiegrebe, Philipp Kopper, Raphael Sonabend, Bernd Bischl, Andreas Bender
Comments: 29 pages, 7 figures, 2 tables, 1 interactive table
Journal-ref: Artif Intell Rev 57, 65 (2024)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[125] arXiv:2305.15022 [pdf, html, other]
Title: Hierarchical clustering with dot products recovers hidden tree structure
Annie Gray, Alexander Modell, Patrick Rubin-Delanchy, Nick Whiteley
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[126] arXiv:2305.15027 [pdf, other]
Title: A Rigorous Link between Deep Ensembles and (Variational) Bayesian Methods
Veit David Wild, Sahra Ghalebikesabi, Dino Sejdinovic, Jeremias Knoblauch
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
[127] arXiv:2305.15167 [pdf, other]
Title: Explaining the Uncertain: Stochastic Shapley Values for Gaussian Process Models
Siu Lun Chau, Krikamol Muandet, Dino Sejdinovic
Comments: 26 pages, 6 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[128] arXiv:2305.15208 [pdf, other]
Title: Generalized Bayesian Inference for Scientific Simulators via Amortized Cost Estimation
Richard Gao, Michael Deistler, Jakob H. Macke
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[129] arXiv:2305.15317 [pdf, other]
Title: On the robust learning mixtures of linear regressions
Ying Huang, Liang Chen
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[130] arXiv:2305.15574 [pdf, other]
Title: Deep Stochastic Processes via Functional Markov Transition Operators
Jin Xu, Emilien Dupont, Kaspar Märtens, Tom Rainforth, Yee Whye Teh
Comments: 18 pages, 5 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[131] arXiv:2305.15577 [pdf, html, other]
Title: Minimizing $f$-Divergences by Interpolating Velocity Fields
Song Liu, Jiahao Yu, Jack Simons, Mingxuan Yi, Mark Beaumont
Comments: This manuscript is an extended version of the ICML2024 version. The code for reproducing our results can be found at this https URL
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[132] arXiv:2305.15670 [pdf, other]
Title: Interpretable Machine Learning based on Functional ANOVA Framework: Algorithms and Comparisons
Linwei Hu, Vijayan N. Nair, Agus Sudjianto, Aijun Zhang, Jie Chen
Comments: 24 pages, 15 figures. arXiv admin note: substantial text overlap with arXiv:2207.06950
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[133] arXiv:2305.15742 [pdf, html, other]
Title: Counterfactual Generative Models for Time-Varying Treatments
Shenghao Wu, Wenbin Zhou, Minshuo Chen, Shixiang Zhu
Comments: Published at KDD'24
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[134] arXiv:2305.15746 [pdf, other]
Title: Assessing the Spatial Structure of the Association between Attendance at Preschool and Childrens Developmental Vulnerabilities in Queensland Australia
wala Draidi Areed, Aiden Price, Kathryn Arnett, Helen Thompson, Reid Malseed, Kerrie Mengersen
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[135] arXiv:2305.15759 [pdf, html, other]
Title: DP-LDMs: Differentially Private Latent Diffusion Models
Michael F. Liu, Saiyue Lyu, Margarita Vinaroz, Mijung Park
Subjects: Machine Learning (stat.ML); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[136] arXiv:2305.15807 [pdf, other]
Title: Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness
Evgenii Chzhen (LMO, CELESTE), Christophe Giraud (LMO, CELESTE), Zhen Li, Gilles Stoltz (LMO, CELESTE, HEC Paris)
Journal-ref: Advances in Neural Information Processing Systems, Dec 2023, New Orleans, United States
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[137] arXiv:2305.15839 [pdf, html, other]
Title: Embeddings between Barron spaces with higher order activation functions
Tjeerd Jan Heeringa, Len Spek, Felix Schwenninger, Christoph Brune
Comments: 21 pages, 1 figure; revision adds extension to fractional RePU and fractional Taylor expansion
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Functional Analysis (math.FA)
[138] arXiv:2305.15871 [pdf, other]
Title: Learning Robust Statistics for Simulation-based Inference under Model Misspecification
Daolang Huang, Ayush Bharti, Amauri Souza, Luigi Acerbi, Samuel Kaski
Comments: 22 pages, 13 figures, Published at NeurIPS 2023
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[139] arXiv:2305.15925 [pdf, html, other]
Title: On the Identifiability of Switching Dynamical Systems
Carles Balsells-Rodas, Yixin Wang, Yingzhen Li
Comments: ICML 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[140] arXiv:2305.15988 [pdf, other]
Title: Non-Log-Concave and Nonsmooth Sampling via Langevin Monte Carlo Algorithms
Tim Tsz-Kit Lau, Han Liu, Thomas Pock
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO); Methodology (stat.ME)
[141] arXiv:2305.16014 [pdf, other]
Title: How many samples are needed to leverage smoothness?
Vivien Cabannes, Stefano Vigogna
Comments: 34 pages, 13 figures
Journal-ref: NeurIPS 2023
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Statistics Theory (math.ST)
[142] arXiv:2305.16041 [pdf, other]
Title: An $\varepsilon$-Best-Arm Identification Algorithm for Fixed-Confidence and Beyond
Marc Jourdan, Rémy Degenne, Emilie Kaufmann
Comments: 68 pages, 14 figures, 4 tables. To be published in the Thirty-seventh Conference on Neural Information Processing Systems
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[143] arXiv:2305.16261 [pdf, other]
Title: Trans-Dimensional Generative Modeling via Jump Diffusion Models
Andrew Campbell, William Harvey, Christian Weilbach, Valentin De Bortoli, Tom Rainforth, Arnaud Doucet
Comments: 41 pages, 11 figures, 8 tables; NeurIPS 2023
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[144] arXiv:2305.16274 [pdf, other]
Title: Non-adversarial training of Neural SDEs with signature kernel scores
Zacharia Issa, Blanka Horvath, Maud Lemercier, Cristopher Salvi
Comments: Code available at this https URL
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computational Finance (q-fin.CP)
[145] arXiv:2305.16530 [pdf, other]
Title: Bi-fidelity Variational Auto-encoder for Uncertainty Quantification
Nuojin Cheng, Osman Asif Malik, Subhayan De, Stephen Becker, Alireza Doostan
Journal-ref: Computer Methods in Applied Mechanics and Engineering (CMAME), Volume 421, 1 March 2024, 116793
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[146] arXiv:2305.16534 [pdf, html, other]
Title: Variation Spaces for Multi-Output Neural Networks: Insights on Multi-Task Learning and Network Compression
Joseph Shenouda, Rahul Parhi, Kangwook Lee, Robert D. Nowak
Comments: Updated to version published in JMLR
Journal-ref: Journal of Machine Learning Research, vol. 25, no. 231, pp. 1-40, 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[147] arXiv:2305.16543 [pdf, other]
Title: Revisiting Structured Variational Autoencoders
Yixiu Zhao, Scott W. Linderman
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[148] arXiv:2305.16557 [pdf, other]
Title: Tree-Based Diffusion Schrödinger Bridge with Applications to Wasserstein Barycenters
Maxence Noble, Valentin De Bortoli, Arnaud Doucet, Alain Durmus
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR)
[149] arXiv:2305.16583 [pdf, html, other]
Title: Detecting Errors in a Numerical Response via any Regression Model
Hang Zhou, Jonas Mueller, Mayank Kumar, Jane-Ling Wang, Jing Lei
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[150] arXiv:2305.16703 [pdf, html, other]
Title: Sources of Uncertainty in Supervised Machine Learning -- A Statisticians' View
Cornelia Gruber, Patrick Oliver Schenk, Malte Schierholz, Frauke Kreuter, Göran Kauermann
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[151] arXiv:2305.16791 [pdf, html, other]
Title: On the Generalization and Approximation Capacities of Neural Controlled Differential Equations
Linus Bleistein, Agathe Guilloux
Comments: ICLR 2024. First presented at the F4CLD Workshop at ICML 2023
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[152] arXiv:2305.16836 [pdf, other]
Title: A Robust Probabilistic Approach to Stochastic Subspace Identification
Brandon J. O'Connell, Timothy J. Rogers
Comments: 42 pages
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Signal Processing (eess.SP)
[153] arXiv:2305.16860 [pdf, other]
Title: Error Bounds for Flow Matching Methods
Joe Benton, George Deligiannidis, Arnaud Doucet
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[154] arXiv:2305.16905 [pdf, html, other]
Title: Improving Neural Additive Models with Bayesian Principles
Kouroche Bouchiat, Alexander Immer, Hugo Yèche, Gunnar Rätsch, Vincent Fortuin
Comments: 41st International Conference on Machine Learning (ICML 2024)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[155] arXiv:2305.17028 [pdf, html, other]
Title: Better Batch for Deep Probabilistic Time Series Forecasting
Vincent Zhihao Zheng, Seongjin Choi, Lijun Sun
Comments: The 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024); We corrected a misleading notation in the published version and added a link to the code
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[156] arXiv:2305.17063 [pdf, html, other]
Title: Vecchia Gaussian Process Ensembles on Internal Representations of Deep Neural Networks
Felix Jimenez, Matthias Katzfuss
Comments: 22 pages, 9 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[157] arXiv:2305.17083 [pdf, other]
Title: A Policy Gradient Method for Confounded POMDPs
Mao Hong, Zhengling Qi, Yanxun Xu
Comments: 95 pages, 3 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Econometrics (econ.EM); Statistics Theory (math.ST); Methodology (stat.ME)
[158] arXiv:2305.17170 [pdf, other]
Title: Error Bounds for Learning with Vector-Valued Random Features
Samuel Lanthaler, Nicholas H. Nelsen
Comments: 28 pages, 1 table, 3 figures. NeurIPS 2023 spotlight
Journal-ref: Advances in Neural Information Processing Systems Vol. 36 (2023) pp. 71834-71861
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[159] arXiv:2305.17225 [pdf, html, other]
Title: Causal Component Analysis
Liang Wendong, Armin Kekić, Julius von Kügelgen, Simon Buchholz, Michel Besserve, Luigi Gresele, Bernhard Schölkopf
Comments: NeurIPS 2023 final camera-ready version
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[160] arXiv:2305.17255 [pdf, other]
Title: FineMorphs: Affine-diffeomorphic sequences for regression
Michele Lohr, Laurent Younes
Comments: 39 pages, 7 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[161] arXiv:2305.17277 [pdf, other]
Title: Optimizing NOTEARS Objectives via Topological Swaps
Chang Deng, Kevin Bello, Bryon Aragam, Pradeep Ravikumar
Comments: 39 pages, 12 figures, ICML 2023
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[162] arXiv:2305.17299 [pdf, other]
Title: Improving Stability in Decision Tree Models
Dimitris Bertsimas, Vassilis Digalakis Jr
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Optimization and Control (math.OC)
[163] arXiv:2305.17467 [pdf, other]
Title: Structured model selection via $\ell_1-\ell_2$ optimization
Xiaofan Lu, Linan Zhang, Hongjin He
Comments: Wanted to revise
Subjects: Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG)
[164] arXiv:2305.17490 [pdf, other]
Title: The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent
Lei Wu, Weijie J. Su
Comments: ICML 2023 camera ready
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[165] arXiv:2305.17557 [pdf, other]
Title: Fair Clustering via Hierarchical Fair-Dirichlet Process
Abhisek Chakraborty, Anirban Bhattacharya, Debdeep Pati
Subjects: Machine Learning (stat.ML); Computers and Society (cs.CY); Machine Learning (cs.LG)
[166] arXiv:2305.17558 [pdf, other]
Title: Provably Fast Finite Particle Variants of SVGD via Virtual Particle Stochastic Approximation
Aniket Das, Dheeraj Nagaraj
Comments: To appear as a Spotlight Paper in The 37th Conference on Neural Information Processing Systems (NeurIPS 2023)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[167] arXiv:2305.17570 [pdf, html, other]
Title: Auditing Fairness by Betting
Ben Chugg, Santiago Cortes-Gomez, Bryan Wilder, Aaditya Ramdas
Comments: Accepted to NeurIPS 2023. 28 pages, 5 figures
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG); Applications (stat.AP); Methodology (stat.ME)
[168] arXiv:2305.17583 [pdf, html, other]
Title: On Neural Networks as Infinite Tree-Structured Probabilistic Graphical Models
Boyao Li, Alexander J. Thomson, Houssam Nassif, Matthew M. Engelhard, David Page
Comments: Accepted to NeurIPS 2024
Journal-ref: Conference on Neural Information Processing Systems (NeurIPS'24), Vancouver, BC, pp. 4598-4628, 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[169] arXiv:2305.18270 [pdf, other]
Title: How Two-Layer Neural Networks Learn, One (Giant) Step at a Time
Yatin Dandi, Florent Krzakala, Bruno Loureiro, Luca Pesce, Ludovic Stephan
Journal-ref: Journal of Machine Learning Research 25 (2004) 1-65
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[170] arXiv:2305.18383 [pdf, other]
Title: A Three-regime Model of Network Pruning
Yefan Zhou, Yaoqing Yang, Arin Chang, Michael W. Mahoney
Comments: ICML 2023
Journal-ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:42790-42809, 2023
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[171] arXiv:2305.18423 [pdf, other]
Title: On the Role of Noise in the Sample Complexity of Learning Recurrent Neural Networks: Exponential Gaps for Long Sequences
Alireza Fathollah Pour, Hassan Ashtiani
Comments: arXiv admin note: text overlap with arXiv:2206.07199
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[172] arXiv:2305.18436 [pdf, html, other]
Title: Statistically Optimal K-means Clustering via Nonnegative Low-rank Semidefinite Programming
Yubo Zhuang, Xiaohui Chen, Yun Yang, Richard Y. Zhang
Comments: Accepted to ICLR 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[173] arXiv:2305.18484 [pdf, html, other]
Title: Neural Fourier Transform: A General Approach to Equivariant Representation Learning
Masanori Koyama, Kenji Fukumizu, Kohei Hayashi, Takeru Miyato
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[174] arXiv:2305.18488 [pdf, other]
Title: A Bayesian sparse factor model with adaptive posterior concentration
Ilsang Ohn, Lizhen Lin, Yongdai Kim
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[175] arXiv:2305.18502 [pdf, html, other]
Title: Escaping mediocrity: how two-layer networks learn hard generalized linear models with SGD
Luca Arnaboldi, Florent Krzakala, Bruno Loureiro, Ludovic Stephan
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[176] arXiv:2305.18506 [pdf, other]
Title: Generalization Ability of Wide Residual Networks
Jianfa Lai, Zixiong Yu, Songtao Tian, Qian Lin
Comments: 28 pages, 3 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[177] arXiv:2305.18671 [pdf, html, other]
Title: Perturbation-Assisted Sample Synthesis: A Novel Approach for Uncertainty Quantification
Yifei Liu, Rex Shen, Xiaotong Shen
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[178] arXiv:2305.18702 [pdf, html, other]
Title: Adversarial Adaptive Sampling: Unify PINN and Optimal Transport for the Approximation of PDEs
Kejun Tang, Jiayu Zhai, Xiaoliang Wan, Chao Yang
Comments: ICLR, 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[179] arXiv:2305.18974 [pdf, other]
Title: Asymptotic Characterisation of Robust Empirical Risk Minimisation Performance in the Presence of Outliers
Matteo Vilucchio, Emanuele Troiani, Vittorio Erba, Florent Krzakala
Journal-ref: Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, PMLR 238:811-819, 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[180] arXiv:2305.19001 [pdf, html, other]
Title: High-probability sample complexities for policy evaluation with linear function approximation
Gen Li, Weichen Wu, Yuejie Chi, Cong Ma, Alessandro Rinaldo, Yuting Wei
Comments: The first two authors contributed equally; paper accepted to IEEE Transactions on Information Theory
Subjects: Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG); Optimization and Control (math.OC); Statistics Theory (math.ST)
[181] arXiv:2305.19082 [pdf, html, other]
Title: Embedding Inequalities for Barron-type Spaces
Lei Wu
Comments: 11 pages
Journal-ref: Journal of Machine Learning, 2023
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[182] arXiv:2305.19123 [pdf, other]
Title: ELSA: Efficient Label Shift Adaptation through the Lens of Semiparametric Models
Qinglong Tian, Xin Zhang, Jiwei Zhao
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[183] arXiv:2305.19147 [pdf, other]
Title: Conditional score-based diffusion models for Bayesian inference in infinite dimensions
Lorenzo Baldassari, Ali Siahkoohi, Josselin Garnier, Knut Solna, Maarten V. de Hoop
Comments: NeurIPS 2023 (Spotlight)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Analysis of PDEs (math.AP); Probability (math.PR)
[184] arXiv:2305.19215 [pdf, html, other]
Title: dotears: Scalable, consistent DAG estimation using observational and interventional data
Albert Xue, Jingyou Rao, Sriram Sankararaman, Harold Pimentel
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[185] arXiv:2305.19243 [pdf, html, other]
Title: Improving Generalization of Complex Models under Unbounded Loss Using PAC-Bayes Bounds
Xitong Zhang, Avrajit Ghosh, Guangliang Liu, Rongrong Wang
Comments: 37 pages, 10 figures, 12 tables
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[186] arXiv:2305.19244 [pdf, other]
Title: Testing for the Markov Property in Time Series via Deep Conditional Generative Learning
Yunzhe Zhou, Chengchun Shi, Lexin Li, Qiwei Yao
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[187] arXiv:2305.19267 [pdf, other]
Title: Parallelized Acquisition for Active Learning using Monte Carlo Sampling
Jesús Torrado, Nils Schöneberg, Jonas El Gammal
Comments: 21 pages, 10 figures
Subjects: Machine Learning (stat.ML); Cosmology and Nongalactic Astrophysics (astro-ph.CO); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (cs.LG)
[188] arXiv:2305.19416 [pdf, other]
Title: KrADagrad: Kronecker Approximation-Domination Gradient Preconditioned Stochastic Optimization
Jonathan Mei, Alexander Moreno, Luke Walters
Comments: Accepted in "Uncertainty in Artificial Intelligence" (2023)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[189] arXiv:2305.19420 [pdf, other]
Title: What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization
Yufeng Zhang, Fengzhuo Zhang, Zhuoran Yang, Zhaoran Wang
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[190] arXiv:2305.19473 [pdf, other]
Title: Chain of Log-Concave Markov Chains
Saeed Saremi, Ji Won Park, Francis Bach
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[191] arXiv:2305.19482 [pdf, other]
Title: Adaptive False Discovery Rate Control with Privacy Guarantee
Xintao Xia, Zhanrui Cai
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[192] arXiv:2305.19535 [pdf, other]
Title: Low-rank extended Kalman filtering for online learning of neural networks from streaming data
Peter G. Chang, Gerardo Durán-Martín, Alexander Y Shestopaloff, Matt Jones, Kevin Murphy
Journal-ref: COLLAS conference 2023
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[193] arXiv:2305.19570 [pdf, other]
Title: Online Label Shift: Optimal Dynamic Regret meets Practical Algorithms
Dheeraj Baby, Saurabh Garg, Tzu-Ching Yen, Sivaraman Balakrishnan, Zachary Chase Lipton, Yu-Xiang Wang
Comments: First three authors contributed equally
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[194] arXiv:2305.19605 [pdf, other]
Title: Parameter-free projected gradient descent
Evgenii Chzhen (LMO, CELESTE), Christophe Giraud (LMO, CELESTE), Gilles Stoltz (LMO, CELESTE)
Subjects: Machine Learning (stat.ML)
[195] arXiv:2305.19638 [pdf, html, other]
Title: A Unified Framework for U-Net Design and Analysis
Christopher Williams, Fabian Falck, George Deligiannidis, Chris Holmes, Arnaud Doucet, Saifuddin Syed
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[196] arXiv:2305.19640 [pdf, html, other]
Title: Fine-grained analysis of non-parametric estimation for pairwise learning
Junyu Zhou, Shuo Huang, Han Feng, Puyu Wang, Ding-Xuan Zhou
Comments: 30 pages, 1 figure
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[197] arXiv:2305.19674 [pdf, html, other]
Title: Online-to-PAC Conversions: Generalization Bounds via Regret Analysis
Gábor Lugosi, Gergely Neu
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[198] arXiv:2305.19694 [pdf, other]
Title: Hypothesis Transfer Learning with Surrogate Classification Losses: Generalization Bounds through Algorithmic Stability
Anass Aghbalou, Guillaume Staerman
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[199] arXiv:2305.19738 [pdf, html, other]
Title: Bures-Wasserstein Means of Graphs
Isabel Haasler, Pascal Frossard
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Signal Processing (eess.SP)
[200] arXiv:2305.19802 [pdf, other]
Title: Neuro-Causal Factor Analysis
Alex Markham, Mingyu Liu, Bryon Aragam, Liam Solus
Comments: 23 pages, 13 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Total of 594 entries : 1-100 101-200 201-300 301-400 401-500 ... 501-594
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack