Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > stat

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Statistics

Authors and titles for May 2023

Total of 1141 entries
Showing up to 2000 entries per page: fewer | more | all
[901] arXiv:2305.12220 (cross-list from cs.LG) [pdf, other]
Title: A Novel Framework for Improving the Breakdown Point of Robust Regression Algorithms
Zheyi Fan, Szu Hui Ng, Qingpei Hu
Comments: conference
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST)
[902] arXiv:2305.12224 (cross-list from cs.LG) [pdf, html, other]
Title: On the Trade-off of Intra-/Inter-class Diversity for Supervised Pre-training
Jieyu Zhang, Bohan Wang, Zhengyu Hu, Pang Wei Koh, Alexander Ratner
Comments: NeurIPS 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[903] arXiv:2305.12274 (cross-list from math.OC) [pdf, other]
Title: Rebalance your portfolio without selling
Jay Bartroff
Comments: To appear in The College Mathematics Journal
Subjects: Optimization and Control (math.OC); Methodology (stat.ME)
[904] arXiv:2305.12283 (cross-list from cs.LG) [pdf, other]
Title: Distribution-Free Model-Agnostic Regression Calibration via Nonparametric Methods
Shang Liu, Zhongze Cai, Xiaocheng Li
Comments: Accepted at NeurIPS 2023 and update a camera-ready version; Add some experiments and literature reviews
Subjects: Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[905] arXiv:2305.12292 (cross-list from cs.LG) [pdf, html, other]
Title: Disjunctive Branch-And-Bound for Certifiably Optimal Low-Rank Matrix Completion
Dimitris Bertsimas, Ryan Cory-Wright, Sean Lo, Jean Pauphilet
Comments: Updated version with new numerics showcasing scalability up to n=2500
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[906] arXiv:2305.12310 (cross-list from eess.IV) [pdf, html, other]
Title: Alignment of Density Maps in Wasserstein Distance
Amit Singer, Ruiyi Yang
Subjects: Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
[907] arXiv:2305.12340 (cross-list from math.NA) [pdf, html, other]
Title: On the Identifiablility of Nonlocal Interaction Kernels in First-Order Systems of Interacting Particles on Riemannian Manifolds
Sui Tang, Malik Tuerkoen, Hanming Zhou
Comments: 21 pages, 2 figures
Journal-ref: Siam Journal on Applied Math 2024
Subjects: Numerical Analysis (math.NA); Classical Analysis and ODEs (math.CA); Statistics Theory (math.ST)
[908] arXiv:2305.12407 (cross-list from cs.LG) [pdf, html, other]
Title: Federated Offline Policy Learning
Aldo Gael Carranza, Susan Athey
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Econometrics (econ.EM); Machine Learning (stat.ML)
[909] arXiv:2305.12475 (cross-list from math.OC) [pdf, other]
Title: Two Sides of One Coin: the Limits of Untuned SGD and the Power of Adaptive Methods
Junchi Yang, Xiang Li, Ilyas Fatkhullin, Niao He
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[910] arXiv:2305.12638 (cross-list from cs.CY) [pdf, other]
Title: Risk Scores, Label Bias, and Everything but the Kitchen Sink
Michael Zanger-Tishler, Julian Nyarko, Sharad Goel
Comments: 19 pages, 4 figures
Subjects: Computers and Society (cs.CY); Applications (stat.AP)
[911] arXiv:2305.12640 (cross-list from cs.AI) [pdf, other]
Title: Limited Resource Allocation in a Non-Markovian World: The Case of Maternal and Child Healthcare
Panayiotis Danassis, Shresth Verma, Jackson A. Killian, Aparna Taneja, Milind Tambe
Comments: Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[912] arXiv:2305.12679 (cross-list from cs.LG) [pdf, other]
Title: Offline Reinforcement Learning with Additional Covering Distributions
Chenjie Mao
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[913] arXiv:2305.12809 (cross-list from cs.LG) [pdf, other]
Title: Relabeling Minimal Training Subset to Flip a Prediction
Jinghan Yang, Linjie Xu, Lequan Yu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[914] arXiv:2305.13012 (cross-list from physics.soc-ph) [pdf, other]
Title: A network community detection method with integration of data from multiple layers and node attributes
Hannu Reittu, Lasse Leskelä, Tomi Räty
Journal-ref: Published version: Network Science 2023
Subjects: Physics and Society (physics.soc-ph); Machine Learning (stat.ML)
[915] arXiv:2305.13064 (cross-list from cs.LG) [pdf, other]
Title: Gradient Descent Monotonically Decreases the Sharpness of Gradient Flow Solutions in Scalar Networks and Beyond
Itai Kreisler, Mor Shpigel Nacson, Daniel Soudry, Yair Carmon
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[916] arXiv:2305.13123 (cross-list from q-fin.ST) [pdf, other]
Title: Complexity measure, kernel density estimation, bandwidth selection, and the efficient market hypothesis
Matthieu Garcin
Subjects: Statistical Finance (q-fin.ST); Methodology (stat.ME)
[917] arXiv:2305.13165 (cross-list from cs.LG) [pdf, other]
Title: Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model
Peter Súkeník, Marco Mondelli, Christoph Lampert
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[918] arXiv:2305.13187 (cross-list from math.OC) [pdf, other]
Title: SignSVRG: fixing SignSGD via variance reduction
Evgenii Chzhen, Sholom Schechtman
Subjects: Optimization and Control (math.OC); Machine Learning (stat.ML)
[919] arXiv:2305.13209 (cross-list from cs.LG) [pdf, other]
Title: Faster Differentially Private Convex Optimization via Second-Order Methods
Arun Ganesh, Mahdi Haghifam, Thomas Steinke, Abhradeep Thakurta
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Optimization and Control (math.OC); Machine Learning (stat.ML)
[920] arXiv:2305.13233 (cross-list from physics.comp-ph) [pdf, other]
Title: Estimating Gibbs free energies via isobaric-isothermal flows
Peter Wirnsberger, Borja Ibarz, George Papamakarios
Comments: 19 pages, 7 figures
Subjects: Computational Physics (physics.comp-ph); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG); Machine Learning (stat.ML)
[921] arXiv:2305.13341 (cross-list from physics.data-an) [pdf, other]
Title: Discovering Causal Relations and Equations from Data
Gustau Camps-Valls, Andreas Gerhardus, Urmi Ninad, Gherardo Varando, Georg Martius, Emili Balaguer-Ballester, Ricardo Vinuesa, Emiliano Diaz, Laure Zanna, Jakob Runge
Comments: 137 pages
Subjects: Data Analysis, Statistics and Probability (physics.data-an); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Methodology (stat.ME)
[922] arXiv:2305.13349 (cross-list from cs.LG) [pdf, other]
Title: Multiclass classification for multidimensional functional data through deep neural networks
Shuoyang Wang, Guanqun Cao
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[923] arXiv:2305.13402 (cross-list from cs.DS) [pdf, other]
Title: Error-Tolerant Exact Query Learning of Finite Set Partitions with Same-Cluster Oracle
Adela Frances DePavia, Olga Medrano Martín del Campo, Erasmo Tani
Comments: 28 pages, 2 figures
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[924] arXiv:2305.13472 (cross-list from cs.LG) [pdf, other]
Title: A comprehensive theoretical framework for the optimization of neural networks classification performance with respect to weighted metrics
Francesco Marchetti, Sabrina Guastavino, Cristina Campi, Federico Benvenuto, Michele Piana
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[925] arXiv:2305.13552 (cross-list from cs.LG) [pdf, other]
Title: Squared Neural Families: A New Class of Tractable Density Models
Russell Tsuchida, Cheng Soon Ong, Dino Sejdinovic
Comments: Spotlight award at NeurIPS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[926] arXiv:2305.13687 (cross-list from econ.EM) [pdf, other]
Title: Flexible Bayesian Quantile Analysis of Residential Rental Rates
Ivan Jeliazkov, Shubham Karnawat, Mohammad Arshad Rahman, Angela Vossmeyer
Comments: 38 Pages, 3 Figures, 8 Tables
Subjects: Econometrics (econ.EM); Methodology (stat.ME)
[927] arXiv:2305.13856 (cross-list from cs.LG) [pdf, other]
Title: On the Optimal Batch Size for Byzantine-Robust Distributed Learning
Yi-Rui Yang, Chang-Wei Shi, Wu-Jun Li
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[928] arXiv:2305.13870 (cross-list from physics.data-an) [pdf, other]
Title: Towards effective information content assessment: analytical derivation of information loss in the reconstruction of random fields with model uncertainty
Aleksei Cherkasov, Kirill M. Gerke, Aleksey Khlyupin
Comments: Keywords: correlation functions, structure characterization, structural descriptors, image analysis, information content
Subjects: Data Analysis, Statistics and Probability (physics.data-an); Disordered Systems and Neural Networks (cond-mat.dis-nn); Materials Science (cond-mat.mtrl-sci); Information Theory (cs.IT); Applications (stat.AP)
[929] arXiv:2305.13879 (cross-list from math.NA) [pdf, other]
Title: Stochastic PDE representation of random fields for large-scale Gaussian process regression and statistical finite element analysis
Kim Jie Koh, Fehmi Cirak
Subjects: Numerical Analysis (math.NA); Computational Physics (physics.comp-ph); Machine Learning (stat.ML)
[930] arXiv:2305.13904 (cross-list from cs.LG) [pdf, other]
Title: Deep GEM-Based Network for Weakly Supervised UWB Ranging Error Mitigation
Yuxiao Li, Santiago Mazuelas, Yuan Shen
Comments: 6 pages, 4 figures, Published in: MILCOM 2021 - 2021 IEEE Military Communications Conference (MILCOM)
Journal-ref: MILCOM 2021 - 2021 IEEE Military Communications Conference (MILCOM), San Diego, CA, USA, 2021, pp. 528-532
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Applications (stat.AP)
[931] arXiv:2305.13946 (cross-list from cs.LG) [pdf, other]
Title: Data-Dependent Bounds for Online Portfolio Selection Without Lipschitzness and Smoothness
Chung-En Tsai, Ying-Ting Lin, Yen-Huan Li
Comments: 37 pages, typos fixed, NeurIPS 2023
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[932] arXiv:2305.13991 (cross-list from cs.LG) [pdf, html, other]
Title: Expressive Losses for Verified Robustness via Convex Combinations
Alessandro De Palma, Rudy Bunel, Krishnamurthy Dvijotham, M. Pawan Kumar, Robert Stanforth, Alessio Lomuscio
Comments: ICLR 2024
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
[933] arXiv:2305.13998 (cross-list from cs.LG) [pdf, html, other]
Title: SMT 2.0: A Surrogate Modeling Toolbox with a focus on Hierarchical and Mixed Variables Gaussian Processes
Paul Saves, Remi Lafage, Nathalie Bartoli, Youssef Diouane, Jasper Bussemaker, Thierry Lefebvre, John T. Hwang, Joseph Morlier, Joaquim R. R. A. Martins
Comments: https://doi.org/10.1016/j.advengsoft.2023.103571
Journal-ref: Advances in Engineering Software Volume 188, February 2024, 103571
Subjects: Machine Learning (cs.LG); Mathematical Software (cs.MS); Optimization and Control (math.OC); Computation (stat.CO)
[934] arXiv:2305.14067 (cross-list from cs.LG) [pdf, other]
Title: DIVA: A Dirichlet Process Mixtures Based Incremental Deep Clustering Algorithm via Variational Auto-Encoder
Zhenshan Bing, Yuan Meng, Yuqi Yun, Hang Su, Xiaojie Su, Kai Huang, Alois Knoll
Comments: static datasets comparision updated
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[935] arXiv:2305.14094 (cross-list from eess.SY) [pdf, other]
Title: Sustainable Edge Intelligence Through Energy-Aware Early Exiting
Marcello Bullo, Seifallah Jardak, Pietro Carnelli, Deniz Gündüz
Comments: 6 pages, accepted at IEEE MLSP 2023
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[936] arXiv:2305.14120 (cross-list from cs.LG) [pdf, html, other]
Title: Learning Relevant Contextual Variables Within Bayesian Optimization
Julien Martinelli, Ayush Bharti, Armi Tiihonen, S.T. John, Louis Filstroff, Sabina J. Sloman, Patrick Rinke, Samuel Kaski
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[937] arXiv:2305.14122 (cross-list from cs.LG) [pdf, other]
Title: Transferring Learning Trajectories of Neural Networks
Daiki Chijiwa
Comments: v2: updates include theoretical analysis and additional experiments
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[938] arXiv:2305.14137 (cross-list from hep-ph) [pdf, html, other]
Title: Goodness of fit by Neyman-Pearson testing
Gaia Grosso, Marco Letizia, Maurizio Pierini, Andrea Wulzer
Comments: 38 pages; improved presentation and writing throughout the paper
Journal-ref: SciPost Phys. 16, 123 (2024)
Subjects: High Energy Physics - Phenomenology (hep-ph); Machine Learning (stat.ML)
[939] arXiv:2305.14164 (cross-list from cs.LG) [pdf, html, other]
Title: Improved Convergence of Score-Based Diffusion Models via Prediction-Correction
Francesco Pedrotti, Jan Maas, Marco Mondelli
Comments: 34 pages; accepted to TMLR
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[940] arXiv:2305.14196 (cross-list from cs.CL) [pdf, html, other]
Title: ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Uri Shaham, Maor Ivgi, Avia Efrat, Jonathan Berant, Omer Levy
Comments: Findings of EMNLP 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[941] arXiv:2305.14247 (cross-list from physics.chem-ph) [pdf, other]
Title: Evaluation of the MACE Force Field Architecture: from Medicinal Chemistry to Materials Science
David Peter Kovacs, Ilyes Batatia, Eszter Sara Arany, Gabor Csanyi
Subjects: Chemical Physics (physics.chem-ph); Machine Learning (stat.ML)
[942] arXiv:2305.14265 (cross-list from econ.EM) [pdf, html, other]
Title: Adapting to Misspecification
Timothy B. Armstrong, Patrick Kline, Liyang Sun
Comments: 56 pages, 7 figures
Subjects: Econometrics (econ.EM); Methodology (stat.ME)
[943] arXiv:2305.14311 (cross-list from cs.LG) [pdf, other]
Title: Statistical Indistinguishability of Learning Algorithms
Alkis Kalavasis, Amin Karbasi, Shay Moran, Grigoris Velegkas
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[944] arXiv:2305.14451 (cross-list from cs.LG) [pdf, other]
Title: Kernel Interpolation with Sparse Grids
Mohit Yadav, Daniel Sheldon, Cameron Musco
Comments: Accepted at Neural Information Processing Systems (NeurIPS) 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[945] arXiv:2305.14478 (cross-list from econ.GN) [pdf, other]
Title: Reproducibility and Transparency versus Privacy and Confidentiality: Reflections from a Data Editor
Lars Vilhuber
Subjects: General Economics (econ.GN); Other Statistics (stat.OT)
[946] arXiv:2305.14528 (cross-list from cs.LG) [pdf, html, other]
Title: Function Basis Encoding of Numerical Features in Factorization Machines
Alex Shtoff, Elie Abboud, Rotem Stram, Oren Somekh
Comments: Published in TMLR, '2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[947] arXiv:2305.14535 (cross-list from cs.LG) [pdf, other]
Title: Uncertainty Quantification over Graph with Conformalized Graph Neural Networks
Kexin Huang, Ying Jin, Emmanuel Candès, Jure Leskovec
Comments: Published at NeurIPS 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[948] arXiv:2305.14612 (cross-list from cs.CV) [pdf, other]
Title: Assessment of Anterior Cruciate Ligament Injury Risk Based on Human Key Points Detection Algorithm
Ziyu Gong, Xiong Zhao, Chen Yang
Comments: 17 pages,and 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[949] arXiv:2305.14683 (cross-list from cs.LG) [pdf, other]
Title: On progressive sharpening, flat minima and generalisation
Lachlan Ewen MacDonald, Jack Valmadre, Simon Lucey
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[950] arXiv:2305.14704 (cross-list from cs.LG) [pdf, other]
Title: Practical Batch Bayesian Sampling Algorithms for Online Adaptive Traffic Experimentation
Zezhong Zhang, Ted Yuan
Subjects: Machine Learning (cs.LG); Applications (stat.AP); Methodology (stat.ME)
[951] arXiv:2305.14814 (cross-list from cs.LG) [pdf, other]
Title: What functions can Graph Neural Networks compute on random graphs? The role of Positional Encoding
Nicolas Keriven (CNRS, IRISA), Samuel Vaiter (CNRS, LJAD)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[952] arXiv:2305.14816 (cross-list from cs.LG) [pdf, other]
Title: Provable Offline Preference-Based Reinforcement Learning
Wenhao Zhan, Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun
Comments: The first two authors contribute equally
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[953] arXiv:2305.14978 (cross-list from math.NA) [pdf, html, other]
Title: Probabilistic Exponential Integrators
Nathanael Bosch, Philipp Hennig, Filip Tronarp
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG); Machine Learning (stat.ML)
[954] arXiv:2305.14979 (cross-list from cs.CV) [pdf, other]
Title: Assessment of the Reliablity of a Model's Decision by Generalizing Attribution to the Wavelet Domain
Gabriel Kasmi, Laurent Dubus, Yves-Marie Saint Drenan, Philippe Blanc
Comments: 18 pages, 10 figures, 3 tables. Camera-ready version accepted at the XAI in action workshop at NeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[955] arXiv:2305.14984 (cross-list from cs.LG) [pdf, other]
Title: Adversarial robustness of amortized Bayesian inference
Manuel Glöckler, Michael Deistler, Jakob H. Macke
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[956] arXiv:2305.15042 (cross-list from cs.LG) [pdf, other]
Title: Test like you Train in Implicit Deep Learning
Zaccharie Ramzi, Pierre Ablin, Gabriel Peyré, Thomas Moreau
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[957] arXiv:2305.15086 (cross-list from cs.CV) [pdf, html, other]
Title: Unpaired Image-to-Image Translation via Neural Schrödinger Bridge
Beomsu Kim, Gihyun Kwon, Kwanyoung Kim, Jong Chul Ye
Comments: ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[958] arXiv:2305.15141 (cross-list from cs.LG) [pdf, html, other]
Title: From Tempered to Benign Overfitting in ReLU Neural Networks
Guy Kornowski, Gilad Yehudai, Ohad Shamir
Comments: NeurIPS 2023; fixed bug
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[959] arXiv:2305.15203 (cross-list from cs.LG) [pdf, html, other]
Title: Frequency maps reveal the correlation between Adversarial Attacks and Implicit Bias
Lorenzo Basile, Nikos Karantzas, Alberto d'Onofrio, Luca Manzoni, Luca Bortolussi, Alex Rodriguez, Fabio Anselmi
Comments: Accepted at IJCNN 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
[960] arXiv:2305.15264 (cross-list from math.OC) [pdf, other]
Title: Error Feedback Shines when Features are Rare
Peter Richtárik, Elnur Gasanov, Konstantin Burlachenko
Subjects: Optimization and Control (math.OC); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[961] arXiv:2305.15267 (cross-list from cs.LG) [pdf, other]
Title: Training Energy-Based Normalizing Flow with Score-Matching Objectives
Chen-Hao Chao, Wei-Fang Sun, Yen-Chang Hsu, Zsolt Kira, Chun-Yi Lee
Comments: Published at NeurIPS 2023. Code: this https URL
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[962] arXiv:2305.15276 (cross-list from cs.LG) [pdf, html, other]
Title: Sparse Mean Estimation in Adversarial Settings via Incremental Learning
Jianhao Ma, Rui Ray Chen, Yinghui He, Salar Fattahi, Wei Hu
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[963] arXiv:2305.15287 (cross-list from cs.LG) [pdf, other]
Title: The Crucial Role of Normalization in Sharpness-Aware Minimization
Yan Dai, Kwangjun Ahn, Suvrit Sra
Comments: 30 pages, Published in 37th Neural Information Processing Systems (NeurIPS 2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[964] arXiv:2305.15340 (cross-list from cs.MA) [pdf, other]
Title: Bayesian calibration of differentiable agent-based models
Arnau Quera-Bofarull, Ayush Chopra, Anisoara Calinescu, Michael Wooldridge, Joel Dyer
Comments: Accepted for Oral Presentation at the AI4ABM Workshop at ICLR 2023
Subjects: Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[965] arXiv:2305.15342 (cross-list from cs.LG) [pdf, other]
Title: Is Your Model "MADD"? A Novel Metric to Evaluate Algorithmic Fairness for Predictive Student Models
Mélina Verger, Sébastien Lallé, François Bouchet, Vanda Luengo
Comments: 12 pages, conference
Journal-ref: Proceedings of the 16th International Conference on Educational Data Mining (EDM 2023)
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Machine Learning (stat.ML)
[966] arXiv:2305.15349 (cross-list from cs.LG) [pdf, other]
Title: On the Convergence of Black-Box Variational Inference
Kyurae Kim, Jisu Oh, Kaiwen Wu, Yi-An Ma, Jacob R. Gardner
Comments: Accepted to NeurIPS'23; previous title: "Black-Box Variational Inference Converges"
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Optimization and Control (math.OC); Computation (stat.CO); Machine Learning (stat.ML)
[967] arXiv:2305.15359 (cross-list from cs.CR) [pdf, html, other]
Title: Private and Collaborative Kaplan-Meier Estimators
Shadi Rahimian, Raouf Kerkouche, Ina Kurth, Mario Fritz
Subjects: Cryptography and Security (cs.CR); Applications (stat.AP)
[968] arXiv:2305.15408 (cross-list from cs.LG) [pdf, html, other]
Title: Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective
Guhao Feng, Bohang Zhang, Yuntian Gu, Haotian Ye, Di He, Liwei Wang
Comments: 42 pages; Camera-ready version for NeurIPS 2023 (Oral Presentation)
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Computation and Language (cs.CL); Machine Learning (stat.ML)
[969] arXiv:2305.15445 (cross-list from cs.LG) [pdf, other]
Title: Deep Learning-enabled MCMC for Probabilistic State Estimation in District Heating Grids
Andreas Bott, Tim Janke, Florian Steinke
Comments: The code for this paper is available under this https URL
Journal-ref: Applied Energy 336 (2023): 120837
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Numerical Analysis (math.NA); Methodology (stat.ME)
[970] arXiv:2305.15546 (cross-list from cs.LG) [pdf, other]
Title: Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs with Short Burn-In Time
Xiang Ji, Gen Li
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[971] arXiv:2305.15558 (cross-list from math.OC) [pdf, html, other]
Title: Online Optimization for Randomized Network Resource Allocation with Long-Term Constraints
Ahmed Sid-Ali, Ioannis Lambadaris, Yiqiang Q. Zhao, Gennady Shaikhet, Shima Kheradmand
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[972] arXiv:2305.15572 (cross-list from cs.LG) [pdf, other]
Title: The Behavior and Convergence of Local Bayesian Optimization
Kaiwen Wu, Kyurae Kim, Roman Garnett, Jacob R. Gardner
Comments: 27 pages; NeurIPS 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[973] arXiv:2305.15592 (cross-list from math.PR) [pdf, html, other]
Title: Large Sample Theory for Bures-Wasserstein Barycentres
Leonardo V. Santoro, Victor M. Panaretos
Subjects: Probability (math.PR); Statistics Theory (math.ST)
[974] arXiv:2305.15598 (cross-list from cs.LG) [pdf, html, other]
Title: ReLU Neural Networks with Linear Layers are Biased Towards Single- and Multi-Index Models
Suzanna Parkinson, Greg Ongie, Rebecca Willett
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[975] arXiv:2305.15612 (cross-list from cs.LG) [pdf, html, other]
Title: Density Ratio Estimation-based Bayesian Optimization with Semi-Supervised Learning
Jungtaek Kim
Comments: Accepted at the 42nd International Conference on Machine Learning (ICML 2025)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[976] arXiv:2305.15643 (cross-list from cs.LG) [pdf, other]
Title: Federated Composite Saddle Point Optimization
Site Bai, Brian Bullins
Journal-ref: ICLR 2024: https://openreview.net/forum?id=kklwv4c4dI
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[977] arXiv:2305.15703 (cross-list from cs.LG) [pdf, other]
Title: The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Kaiwen Wang, Kevin Zhou, Runzhe Wu, Nathan Kallus, Wen Sun
Comments: Accepted at NeurIPS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Statistics Theory (math.ST); Machine Learning (stat.ML)
[978] arXiv:2305.15786 (cross-list from cs.LG) [pdf, other]
Title: Theoretical Guarantees of Learning Ensembling Strategies with Applications to Time Series Forecasting
Hilaf Hasson, Danielle C. Maddix, Yuyang Wang, Gaurav Gupta, Youngsuk Park
Comments: ICML 2023
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[979] arXiv:2305.15793 (cross-list from cs.LG) [pdf, other]
Title: Feature space reduction method for ultrahigh-dimensional, multiclass data: Random forest-based multiround screening (RFMS)
Gergely Hanczár, Marcell Stippinger, Dávid Hanák, Marcell T. Kurbucz, Olivér M. Törteli, Ágnes Chripkó, Zoltán Somogyvári
Comments: 9 pages, 2 figures, 2 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Computation (stat.CO)
[980] arXiv:2305.15877 (cross-list from cs.LG) [pdf, other]
Title: Exponential Smoothing for Off-Policy Learning
Imad Aouali, Victor-Emmanuel Brunel, David Rohde, Anna Korba
Comments: ICML 2023 (Oral and Poster)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[981] arXiv:2305.15912 (cross-list from cs.LG) [pdf, html, other]
Title: Neural Characteristic Activation Analysis and Geometric Parameterization for ReLU Networks
Wenlin Chen, Hong Ge
Comments: Accepted for publication at NeurIPS 2024. Code available at: this https URL
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[982] arXiv:2305.15920 (cross-list from cond-mat.stat-mech) [pdf, other]
Title: Accurate generation of stochastic dynamics based on multi-model Generative Adversarial Networks
Daniele Lanzoni, Olivier Pierre-Louis, Francesco Montalenti
Comments: Main text and appendices, 10 pages and 10 figures Updated version: citations to previous work which was not known to the authors have been added, text has been re-organized and modified accordingly; supplemental material has been moved into appendices
Journal-ref: J. Chem. Phys. 159, 144109 (2023)
Subjects: Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG); Computational Physics (physics.comp-ph); Machine Learning (stat.ML)
[983] arXiv:2305.15936 (cross-list from cs.LG) [pdf, html, other]
Title: Learning DAGs from Data with Few Root Causes
Panagiotis Misiakos, Chris Wendler, Markus Püschel
Comments: to be published in 37th Conference on Neural Information Processing Systems (NeurIPS 2023)
Journal-ref: NeurIPS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[984] arXiv:2305.15938 (cross-list from math.OC) [pdf, html, other]
Title: First Order Methods with Markovian Noise: from Acceleration to Variational Inequalities
Aleksandr Beznosikov, Sergey Samsonov, Marina Sheshukova, Alexander Gasnikov, Alexey Naumov, Eric Moulines
Comments: Appears in: Advances in Neural Information Processing Systems 36 (NeurIPS 2023). 41 pages, 3 algorithms, 2 tables
Journal-ref: https://proceedings.neurips.cc/paper_files/paper/2023/hash/8c3e38ce55a0fa44bc325bc6fdb7f4e5-Abstract-Conference.html
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[985] arXiv:2305.15984 (cross-list from cs.LG) [pdf, html, other]
Title: Dynamic Inter-treatment Information Sharing for Individualized Treatment Effects Estimation
Vinod Kumar Chauhan, Jiandong Zhou, Ghadeer Ghosheh, Soheila Molaei, David A. Clifton
Comments: accepted to The 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[986] arXiv:2305.16038 (cross-list from cs.LG) [pdf, other]
Title: Implicit bias of SGD in $L_{2}$-regularized linear DNNs: One-way jumps from high to low rank
Zihan Wang, Arthur Jacot
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[987] arXiv:2305.16074 (cross-list from cs.LG) [pdf, other]
Title: Combinatorial Bandits for Maximum Value Reward Function under Max Value-Index Feedback
Yiliu Wang, Wei Chen, Milan Vojnović
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST)
[988] arXiv:2305.16094 (cross-list from cs.LG) [pdf, other]
Title: On Influence Functions, Classification Influence, Relative Influence, Memorization and Generalization
Michael Kounavis, Ousmane Dia, Ilqar Ramazanli
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[989] arXiv:2305.16099 (cross-list from cs.LG) [pdf, other]
Title: FAVANO: Federated AVeraging with Asynchronous NOdes
Louis Leconte, Van Minh Nguyen, Eric Moulines
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[990] arXiv:2305.16102 (cross-list from cs.LG) [pdf, html, other]
Title: Demystifying Oversmoothing in Attention-Based Graph Neural Networks
Xinyi Wu, Amir Ajorlou, Zihui Wu, Ali Jadbabaie
Comments: NeurIPS 2023 spotlight. Fixed an error in the previous version; new results and remarks added
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[991] arXiv:2305.16147 (cross-list from cs.LG) [pdf, html, other]
Title: Learning Safety Constraints from Demonstrations with Unknown Rewards
David Lindner, Xin Chen, Sebastian Tschiatschek, Katja Hofmann, Andreas Krause
Comments: Presented at the International Conference on Artificial Intelligence and Statistics (AISTATS) 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[992] arXiv:2305.16150 (cross-list from cs.LG) [pdf, html, other]
Title: Unifying GANs and Score-Based Diffusion as Generative Particle Models
Jean-Yves Franceschi, Mike Gartrell, Ludovic Dos Santos, Thibaut Issenhuth, Emmanuel de Bézenac, Mickaël Chen, Alain Rakotomamonjy
Journal-ref: Thirty-seventh Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, Dec. 2023, New Orleans, LA, USA
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[993] arXiv:2305.16179 (cross-list from cs.LG) [pdf, other]
Title: Dropout Drops Double Descent
Tian-Le Yang, Joe Suzuki
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST)
[994] arXiv:2305.16189 (cross-list from cs.LG) [pdf, html, other]
Title: Martian time-series unraveled: A multi-scale nested approach with factorial variational autoencoders
Ali Siahkoohi, Rudy Morel, Randall Balestriero, Erwan Allys, Grégory Sainton, Taichi Kawamura, Maarten V. de Hoop
Subjects: Machine Learning (cs.LG); Earth and Planetary Astrophysics (astro-ph.EP); Machine Learning (stat.ML)
[995] arXiv:2305.16205 (cross-list from cs.CY) [pdf, other]
Title: Packaging code for reproducible research in the public sector
Federico Botta, Robin Lovelace, Laura Gilbert, Arthur Turrell
Comments: 6 pages, 3 figures
Subjects: Computers and Society (cs.CY); Computation (stat.CO)
[996] arXiv:2305.16215 (cross-list from cs.LG) [pdf, other]
Title: Koopman Kernel Regression
Petar Bevanda, Max Beier, Armin Lederer, Stefan Sosnowski, Eyke Hüllermeier, Sandra Hirche
Comments: Accepted to the thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Dynamical Systems (math.DS); Machine Learning (stat.ML)
[997] arXiv:2305.16227 (cross-list from physics.data-an) [pdf, other]
Title: Transporting Densities Across Dimensions
Michael Plainer, Felix Dietrich, Ioannis G. Kevrekidis
Subjects: Data Analysis, Statistics and Probability (physics.data-an); Statistics Theory (math.ST)
[998] arXiv:2305.16272 (cross-list from cs.LG) [pdf, html, other]
Title: Incentivizing Honesty among Competitors in Collaborative Learning and Optimization
Florian E. Dorner, Nikola Konstantinov, Georgi Pashaliev, Martin Vechev
Comments: Updated experimental results after fixing a mistake in the code. Previous version published in NeurIPS 2023; 37 pages, 5 figures
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Machine Learning (stat.ML)
[999] arXiv:2305.16284 (cross-list from cs.LG) [pdf, html, other]
Title: DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method
Ahmed Khaled, Konstantin Mishchenko, Chi Jin
Comments: 22 pages, 1 table, 4 figures
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1000] arXiv:2305.16358 (cross-list from cs.LG) [pdf, other]
Title: Differentiable Clustering with Perturbed Spanning Forests
Lawrence Stewart (DI-ENS), Francis S Bach (DI-ENS), Felipe Llinares López, Quentin Berthet
Journal-ref: 37th Conference on Neural Information Processing Systems, Dec 2023, New Orleans, United States
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[1001] arXiv:2305.16360 (cross-list from cs.LG) [pdf, other]
Title: Modeling Task Relationships in Multi-variate Soft Sensor with Balanced Mixture-of-Experts
Yuxin Huang, Hao Wang, Zhaoran Liu, Licheng Pan, Haozhe Li, Xinggao Liu
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Applications (stat.AP)
[1002] arXiv:2305.16368 (cross-list from math.OC) [pdf, html, other]
Title: Neural incomplete factorization: learning preconditioners for the conjugate gradient method
Paul Häusner, Ozan Öktem, Jens Sjölund
Comments: 26 pages, 8 figures, accepted in Transactions on Machine Learning Research (TMLR)
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[1003] arXiv:2305.16375 (cross-list from cs.LG) [pdf, other]
Title: Data Topology-Dependent Upper Bounds of Neural Network Widths
Sangmin Lee, Jong Chul Ye
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[1004] arXiv:2305.16424 (cross-list from cs.LG) [pdf, html, other]
Title: SketchOGD: Memory-Efficient Continual Learning
Youngjae Min, Benjamin Wright, Jeremy Bernstein, Navid Azizan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[1005] arXiv:2305.16433 (cross-list from cs.CL) [pdf, other]
Title: Neural Machine Translation for Mathematical Formulae
Felix Petersen, Moritz Schubotz, Andre Greiner-Petter, Bela Gipp
Comments: Published at ACL 2023
Subjects: Computation and Language (cs.CL); Symbolic Computation (cs.SC); Applications (stat.AP)
[1006] arXiv:2305.16440 (cross-list from cs.LG) [pdf, other]
Title: Representation Transfer Learning via Multiple Pre-trained models for Linear Regression
Navjot Singh, Suhas Diggavi
Comments: 20 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1007] arXiv:2305.16446 (cross-list from cs.LG) [pdf, html, other]
Title: The Representation Jensen-Shannon Divergence
Jhoan K. Hoyos-Osorio, Luis G. Sanchez-Giraldo
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[1008] arXiv:2305.16475 (cross-list from cs.LG) [pdf, other]
Title: Initialization-Dependent Sample Complexity of Linear Predictors and Neural Networks
Roey Magen, Ohad Shamir
Comments: 30 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1009] arXiv:2305.16480 (cross-list from cond-mat.stat-mech) [pdf, other]
Title: Stochastic metrology and the empirical distribution
Joseph A. Smiga, Marco Radaelli, Felix C. Binder, Gabriel T. Landi
Comments: 16 pages, 8 figures, 1 table
Subjects: Statistical Mechanics (cond-mat.stat-mech); Statistics Theory (math.ST); Quantum Physics (quant-ph)
[1010] arXiv:2305.16491 (cross-list from cs.LG) [pdf, other]
Title: SAMoSSA: Multivariate Singular Spectrum Analysis with Stochastic Autoregressive Noise
Abdullah Alomar, Munther Dahleh, Sean Mann, Devavrat Shah
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Machine Learning (stat.ML)
[1011] arXiv:2305.16508 (cross-list from cs.LG) [pdf, other]
Title: Most Neural Networks Are Almost Learnable
Amit Daniely, Nathan Srebro, Gal Vardi
Comments: Small fixes after review
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1012] arXiv:2305.16536 (cross-list from cs.LG) [pdf, other]
Title: Which Features are Learnt by Contrastive Learning? On the Role of Simplicity Bias in Class Collapse and Feature Suppression
Yihao Xue, Siddharth Joshi, Eric Gan, Pin-Yu Chen, Baharan Mirzasoleiman
Comments: to appear at ICML 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1013] arXiv:2305.16562 (cross-list from cs.LG) [pdf, other]
Title: Unsupervised Embedding Quality Evaluation
Anton Tsitsulin, Marina Munkhoeva, Bryan Perozzi
Comments: As appeared at the 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML) at the 40th International Conference on Machine Learning (ICML), Honolulu, Hawaii, USA. 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1014] arXiv:2305.16578 (cross-list from cs.IT) [pdf, other]
Title: Computation of Reliability Statistics for Finite Samples of Success-Failure Experiments
Sanjay M. Joshi
Comments: 6 pages, 4 figures, 1 table
Subjects: Information Theory (cs.IT); Methodology (stat.ME)
[1015] arXiv:2305.16589 (cross-list from cs.LG) [pdf, html, other]
Title: The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model
Laixi Shi, Gen Li, Yuting Wei, Yuxin Chen, Matthieu Geist, Yuejie Chi
Comments: A short version was published in Neural Information Processing Systems (2023); Under Submission
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Statistics Theory (math.ST)
[1016] arXiv:2305.16590 (cross-list from cs.SI) [pdf, html, other]
Title: Seeding with Differentially Private Network Information
M. Amin Rahimian, Fang-Yi Yu, Yuxin Liu, Carlos Hurtado
Comments: Preliminary version in AAMAS 2023: this https URL -- Code and data: this https URL
Subjects: Social and Information Networks (cs.SI); Computational Complexity (cs.CC); Multiagent Systems (cs.MA); Probability (math.PR); Applications (stat.AP)
[1017] arXiv:2305.16704 (cross-list from cs.LG) [pdf, other]
Title: A Closer Look at In-Context Learning under Distribution Shifts
Kartik Ahuja, David Lopez-Paz
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1018] arXiv:2305.16827 (cross-list from econ.EM) [pdf, other]
Title: Fast and Order-invariant Inference in Bayesian VARs with Non-Parametric Shocks
Florian Huber, Gary Koop
Subjects: Econometrics (econ.EM); Methodology (stat.ME)
[1019] arXiv:2305.16842 (cross-list from q-fin.ST) [pdf, other]
Title: Accounting statement analysis at industry level. A gentle introduction to the compositional approach
Germà Coenders (1), Núria Arimany Serrat (2) ((1) University of Girona, (2) University of Vic - Central University of Catalonia)
Subjects: Statistical Finance (q-fin.ST); Applications (stat.AP); Methodology (stat.ME)
[1020] arXiv:2305.16843 (cross-list from cs.LG) [pdf, other]
Title: Randomized Positional Encodings Boost Length Generalization of Transformers
Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[1021] arXiv:2305.16846 (cross-list from cs.LG) [pdf, html, other]
Title: Lagrangian Flow Networks for Conservation Laws
F. Arend Torres, Marcello Massimo Negri, Marco Inversi, Jonathan Aellen, Volker Roth
Subjects: Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an); Fluid Dynamics (physics.flu-dyn); Machine Learning (stat.ML)
[1022] arXiv:2305.16891 (cross-list from cs.LG) [pdf, other]
Title: Generalization Guarantees of Gradient Descent for Multi-Layer Neural Networks
Puyu Wang, Yunwen Lei, Di Wang, Yiming Ying, Ding-Xuan Zhou
Comments: 38 pages, 2 figures
Journal-ref: Neural Computation 37(2025): 344-402
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1023] arXiv:2305.16892 (cross-list from cs.DS) [pdf, other]
Title: Feature Adaptation for Sparse Linear Regression
Jonathan Kelner, Frederic Koehler, Raghu Meka, Dhruv Rohatgi
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[1024] arXiv:2305.16907 (cross-list from cs.CR) [pdf, other]
Title: CyPhERS: A Cyber-Physical Event Reasoning System providing real-time situational awareness for attack and fault response
Nils Müller, Kaibin Bao, Jörg Matthes, Kai Heussen
Comments: Article submitted to Computers in Industry
Subjects: Cryptography and Security (cs.CR); Signal Processing (eess.SP); Systems and Control (eess.SY); Machine Learning (stat.ML)
[1025] arXiv:2305.16910 (cross-list from math.FA) [pdf, html, other]
Title: Universal approximation with complex-valued deep narrow neural networks
Paul Geuchen, Thomas Jahn, Hannes Matt
Comments: v2: correct typo in arxiv abstract v3: add quantitative result, restructure the entire paper
Subjects: Functional Analysis (math.FA); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1026] arXiv:2305.17010 (cross-list from cs.LG) [pdf, other]
Title: Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets
Dinghuai Zhang, Hanjun Dai, Nikolay Malkin, Aaron Courville, Yoshua Bengio, Ling Pan
Comments: Accepted by NeurIPS 2023 as spotlight
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Discrete Mathematics (cs.DM); Machine Learning (stat.ML)
[1027] arXiv:2305.17021 (cross-list from cs.LG) [pdf, html, other]
Title: GLOBE-CE: A Translation-Based Approach for Global Counterfactual Explanations
Dan Ley, Saumitra Mishra, Daniele Magazzeni
Comments: Published as a conference paper at ICML 2023 (9 page main text, 3 page references, 16 page appendix)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (stat.ML)
[1028] arXiv:2305.17043 (cross-list from eess.SP) [pdf, html, other]
Title: Explaining Deep Learning for ECG Analysis: Building Blocks for Auditing and Knowledge Discovery
Patrick Wagner, Temesgen Mehari, Wilhelm Haverkamp, Nils Strodthoff
Journal-ref: Computers in Biology and Medicine, Vol. 176, June 2024, 108525
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1029] arXiv:2305.17058 (cross-list from cs.PL) [pdf, other]
Title: Exact Bayesian Inference on Discrete Models via Probability Generating Functions: A Probabilistic Programming Approach
Fabian Zaiser, Andrzej S. Murawski, Luke Ong
Comments: NeurIPS 2023 version
Journal-ref: Advances in Neural Information Processing Systems 36 (NeurIPS 2023)
Subjects: Programming Languages (cs.PL); Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
[1030] arXiv:2305.17076 (cross-list from cs.LG) [pdf, other]
Title: Exact Generalization Guarantees for (Regularized) Wasserstein Distributionally Robust Models
Waïss Azizian (DAO), Franck Iutzeler (DAO), Jérôme Malick (DAO)
Comments: 49 pages, 2 figures; to be presented at the 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023)
Journal-ref: 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023), Dec 2023, New Orleans, United States
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1031] arXiv:2305.17119 (cross-list from cs.LG) [pdf, other]
Title: Manifold Regularization for Memory-Efficient Training of Deep Neural Networks
Shadi Sartipi, Edgar A. Bernal
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1032] arXiv:2305.17126 (cross-list from cs.LG) [pdf, html, other]
Title: Large Language Models as Tool Makers
Tianle Cai, Xuezhi Wang, Tengyu Ma, Xinyun Chen, Denny Zhou
Comments: Code available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[1033] arXiv:2305.17139 (cross-list from cs.AI) [pdf, html, other]
Title: A Measure-Theoretic Axiomatisation of Causality
Junhyung Park, Simon Buchholz, Bernhard Schölkopf, Krikamol Muandet
Subjects: Artificial Intelligence (cs.AI); Statistics Theory (math.ST)
[1034] arXiv:2305.17148 (cross-list from cs.LG) [pdf, html, other]
Title: Differentially Private Low-dimensional Synthetic Data from High-dimensional Datasets
Yiyun He, Thomas Strohmer, Roman Vershynin, Yizhe Zhu
Comments: 23 pages
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS); Probability (math.PR); Statistics Theory (math.ST)
[1035] arXiv:2305.17209 (cross-list from cs.LG) [pdf, html, other]
Title: Functional Flow Matching
Gavin Kerrigan, Giosue Migliorini, Padhraic Smyth
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1036] arXiv:2305.17224 (cross-list from math.OC) [pdf, html, other]
Title: Fast and Accurate Estimation of Low-Rank Matrices from Noisy Measurements via Preconditioned Non-Convex Gradient Descent
Gavin Zhang, Hong-Ming Chiu, Richard Y. Zhang
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1037] arXiv:2305.17284 (cross-list from cs.LG) [pdf, other]
Title: GC-Flow: A Graph-Based Flow Network for Effective Clustering
Tianchun Wang, Farzaneh Mirzazadeh, Xiang Zhang, Jie Chen
Comments: ICML 2023. Code is available at this https URL
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1038] arXiv:2305.17297 (cross-list from cs.LG) [pdf, html, other]
Title: Double Descent and Overfitting under Noisy Inputs and Distribution Shift for Linear Denoisers
Chinmaya Kausik, Kashvi Srivastava, Rishi Sonthalia
Comments: Complete overhaul of presentation, many new results
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[1039] arXiv:2305.17301 (cross-list from cs.LG) [pdf, other]
Title: Stability-penalty-adaptive follow-the-regularized-leader: Sparsity, game-dependency, and best-of-both-worlds
Taira Tsuchiya, Shinji Ito, Junya Honda
Comments: Published version in Advances in Neural Information Processing Systems 36 (NeurIPS 2023), 32 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1040] arXiv:2305.17332 (cross-list from cs.LG) [pdf, html, other]
Title: Learning Capacity: A Measure of the Effective Dimensionality of a Model
Daiwei Chen, Wei-Kai Chang, Pratik Chaudhari
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[1041] arXiv:2305.17339 (cross-list from cs.IR) [pdf, other]
Title: Counterfactual Evaluation of Peer-Review Assignment Policies
Martin Saveski, Steven Jecmen, Nihar B. Shah, Johan Ugander
Subjects: Information Retrieval (cs.IR); Digital Libraries (cs.DL); Applications (stat.AP)
[1042] arXiv:2305.17365 (cross-list from math.PR) [pdf, other]
Title: High-dimensional Central Limit Theorems by Stein's Method in the Degenerate Case
Xiao Fang, Yuta Koike, Song-Hao Liu, Yi-Kun Zhao
Comments: 32 pages
Subjects: Probability (math.PR); Statistics Theory (math.ST)
[1043] arXiv:2305.17380 (cross-list from cs.LG) [pdf, other]
Title: No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions
Tiancheng Jin, Junyan Liu, Chloé Rouyer, William Chang, Chen-Yu Wei, Haipeng Luo
Comments: Update the camera-ready version for NeurIPS 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1044] arXiv:2305.17397 (cross-list from q-bio.PE) [pdf, other]
Title: Mathematical model of mating probability and fertilized egg production in helminth parasites
Gonzalo Maximiliano Lopez, Juan Pablo Aparicio
Subjects: Populations and Evolution (q-bio.PE); Applications (stat.AP)
[1045] arXiv:2305.17435 (cross-list from cs.IT) [pdf, other]
Title: On the Noise Sensitivity of the Randomized SVD
Elad Romanov
Subjects: Information Theory (cs.IT); Numerical Analysis (math.NA); Statistics Theory (math.ST); Machine Learning (stat.ML)
[1046] arXiv:2305.17476 (cross-list from cs.LG) [pdf, other]
Title: Toward Understanding Generative Data Augmentation
Chenyu Zheng, Guoqiang Wu, Chongxuan Li
Comments: 39 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1047] arXiv:2305.17478 (cross-list from cs.LG) [pdf, other]
Title: Deep Variational Lesion-Deficit Mapping
Guilherme Pombo, Robert Gray, Amy P.K. Nelson, Chris Foulon, John Ashburner, Parashkev Nachev
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP); Machine Learning (stat.ML)
[1048] arXiv:2305.17535 (cross-list from cs.LG) [pdf, other]
Title: PFNs4BO: In-Context Learning for Bayesian Optimization
Samuel Müller, Matthias Feurer, Noah Hollmann, Frank Hutter
Comments: In: Proceedings of the 40th International Conference on Machine Learning (ICML'23), PMLR 202:25444-25470, 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1049] arXiv:2305.17574 (cross-list from cs.AI) [pdf, other]
Title: Counterfactual Formulation of Patient-Specific Root Causes of Disease
Eric V. Strobl
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM); Applications (stat.AP); Machine Learning (stat.ML)
[1050] arXiv:2305.17592 (cross-list from cs.LG) [pdf, html, other]
Title: Approximation-Generalization Trade-offs under (Approximate) Group Equivariance
Mircea Petrache, Shubhendu Trivedi
Comments: 23 Pages. Updated to the published version. Advances in Neural Information Processing Systems 36, 61936-61959
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1051] arXiv:2305.17608 (cross-list from cs.LG) [pdf, other]
Title: Reward Collapse in Aligning Large Language Models
Ziang Song, Tianle Cai, Jason D. Lee, Weijie J. Su
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1052] arXiv:2305.17665 (cross-list from cs.LG) [pdf, html, other]
Title: Acceleration of stochastic gradient descent with momentum by averaging: finite-sample rates and asymptotic normality
Kejie Tang, Weidong Liu, Yichen Zhang, Xi Chen
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1053] arXiv:2305.17817 (cross-list from cs.CL) [pdf, other]
Title: Transfer Learning for Power Outage Detection Task with Limited Training Data
Olukunle Owolabi
Subjects: Computation and Language (cs.CL); Applications (stat.AP)
[1054] arXiv:2305.17884 (cross-list from math.NA) [pdf, other]
Title: Combining Monte Carlo and Tensor-network Methods for Partial Differential Equations via Sketching
Yian Chen, Yuehaw Khoo
Subjects: Numerical Analysis (math.NA); Machine Learning (stat.ML)
[1055] arXiv:2305.18046 (cross-list from physics.chem-ph) [pdf, other]
Title: Implicit Transfer Operator Learning: Multiple Time-Resolution Surrogates for Molecular Dynamics
Mathias Schreiner, Ole Winther, Simon Olsson
Comments: 23 pages, 12 figures, 4 tables, NeurIPS 2023
Subjects: Chemical Physics (physics.chem-ph); Machine Learning (stat.ML)
[1056] arXiv:2305.18061 (cross-list from cs.SE) [pdf, other]
Title: Quantifying Process Quality: The Role of Effective Organizational Learning in Software Evolution
Sebastian Hönel
Comments: Ph.D. Thesis without appended papers, 201 pages, 6 figures, 2 tables
Subjects: Software Engineering (cs.SE); Optimization and Control (math.OC); Applications (stat.AP); Machine Learning (stat.ML)
[1057] arXiv:2305.18183 (cross-list from cs.LG) [pdf, other]
Title: On Counterfactual Data Augmentation Under Confounding
Abbavaram Gowtham Reddy, Saketh Bachu, Saloni Dash, Charchit Sharma, Amit Sharma, Vineeth N Balasubramanian
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1058] arXiv:2305.18204 (cross-list from cs.LG) [pdf, html, other]
Title: Kernel Density Matrices for Probabilistic Deep Learning
Fabio A. González, Raúl Ramos-Pollán, Joseph A. Gallego-Mejia
Subjects: Machine Learning (cs.LG); Quantum Physics (quant-ph); Machine Learning (stat.ML)
[1059] arXiv:2305.18206 (cross-list from eess.SP) [pdf, other]
Title: Deep Generative Model for Simultaneous Range Error Mitigation and Environment Identification
Yuxiao Li, Santiago Mazuelas, Yuan Shen
Comments: 6 pages, 5 figures, Published in: 2021 IEEE Global Communications Conference (GLOBECOM)
Journal-ref: 2021 IEEE Global Communications Conference (GLOBECOM), Madrid, Spain, 2021, pp. 1-6
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP)
[1060] arXiv:2305.18208 (cross-list from eess.SP) [pdf, other]
Title: A Semi-Supervised Learning Approach for Ranging Error Mitigation Based on UWB Waveform
Yuxiao Li, Santiago Mazuelas, Yuan Shen
Comments: 5 pages, 3 figures, Published in: MILCOM 2021 - 2021 IEEE Military Communications Conference (MILCOM)
Journal-ref: MILCOM 2021 - 2021 IEEE Military Communications Conference (MILCOM), San Diego, CA, USA, 2021, pp. 533-537
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP)
[1061] arXiv:2305.18231 (cross-list from eess.IV) [pdf, other]
Title: High-Fidelity Image Compression with Score-based Generative Models
Emiel Hoogeboom, Eirikur Agustsson, Fabian Mentzer, Luca Versari, George Toderici, Lucas Theis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1062] arXiv:2305.18258 (cross-list from cs.LG) [pdf, other]
Title: Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1063] arXiv:2305.18285 (cross-list from cs.LG) [pdf, other]
Title: Partially Personalized Federated Learning: Breaking the Curse of Data Heterogeneity
Konstantin Mishchenko, Rustem Islamov, Eduard Gorbunov, Samuel Horváth
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1064] arXiv:2305.18370 (cross-list from q-bio.QM) [pdf, other]
Title: Explainable Brain Age Prediction using coVariance Neural Networks
Saurabh Sihag, Gonzalo Mateos, Corey McMillan, Alejandro Ribeiro
Comments: Camera ready version for NeurIPS 2023. arXiv admin note: substantial text overlap with arXiv:2305.01807
Subjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG); Applications (stat.AP)
[1065] arXiv:2305.18375 (cross-list from cs.LG) [pdf, other]
Title: Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling
Tianqi Chen, Mingyuan Zhou
Comments: ICML 2023
Subjects: Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[1066] arXiv:2305.18378 (cross-list from cs.LG) [pdf, other]
Title: Disentanglement via Latent Quantization
Kyle Hsu, Will Dorrell, James C. R. Whittington, Jiajun Wu, Chelsea Finn
Comments: NeurIPS 2023 camera-ready. 26 pages, 15 figures. Code available at this https URL
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1067] arXiv:2305.18379 (cross-list from math.OC) [pdf, other]
Title: Constrained Optimization via Exact Augmented Lagrangian and Randomized Iterative Sketching
Ilgee Hong, Sen Na, Michael W. Mahoney, Mladen Kolar
Comments: 25 pages, 4 figures
Journal-ref: ICML 2023
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[1068] arXiv:2305.18388 (cross-list from cs.LG) [pdf, other]
Title: The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation
Mark Rowland, Yunhao Tang, Clare Lyle, Rémi Munos, Marc G. Bellemare, Will Dabney
Comments: ICML 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1069] arXiv:2305.18399 (cross-list from cs.LG) [pdf, other]
Title: On the impact of activation and normalization in obtaining isometric embeddings at initialization
Amir Joudaki, Hadi Daneshmand, Francis Bach
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[1070] arXiv:2305.18404 (cross-list from cs.CL) [pdf, other]
Title: Conformal Prediction with Large Language Models for Multi-Choice Question Answering
Bhawesh Kumar, Charlie Lu, Gauri Gupta, Anil Palepu, David Bellamy, Ramesh Raskar, Andrew Beam
Comments: Updated sections on prompt engineering. Expanded sections 4.1 and 4.2 and appendix. Included additional references. Work published at the ICML 2023 (Neural Conversational AI TEACH) workshop
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1071] arXiv:2305.18409 (cross-list from cs.LG) [pdf, other]
Title: Direction-oriented Multi-objective Learning: Simple and Provable Stochastic Algorithms
Peiyao Xiao, Hao Ban, Kaiyi Ji
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1072] arXiv:2305.18410 (cross-list from cs.LG) [pdf, other]
Title: Understanding Breast Cancer Survival: Using Causality and Language Models on Multi-omics Data
Mugariya Farooq, Shahad Hardan, Aigerim Zhumbhayeva, Yujia Zheng, Preslav Nakov, Kun Zhang
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Genomics (q-bio.GN); Methodology (stat.ME)
[1073] arXiv:2305.18415 (cross-list from cs.LG) [pdf, other]
Title: Geometric Algebra Transformer
Johann Brehmer, Pim de Haan, Sönke Behrends, Taco Cohen
Comments: Published at NeurIPS 2023, implementation available at this https URL . v3: matches camera-ready version
Subjects: Machine Learning (cs.LG); Robotics (cs.RO); Machine Learning (stat.ML)
[1074] arXiv:2305.18420 (cross-list from cs.LG) [pdf, other]
Title: Sample Complexity of Variance-reduced Distributionally Robust Q-learning
Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1075] arXiv:2305.18435 (cross-list from cs.LG) [pdf, html, other]
Title: Statistically Efficient Bayesian Sequential Experiment Design via Reinforcement Learning with Cross-Entropy Estimators
Tom Blau, Iadine Chades, Amir Dezfouli, Daniel Steinberg, Edwin V. Bonilla
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[1076] arXiv:2305.18438 (cross-list from cs.LG) [pdf, other]
Title: Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism
Zihao Li, Zhuoran Yang, Mengdi Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Statistics Theory (math.ST); Machine Learning (stat.ML)
[1077] arXiv:2305.18447 (cross-list from cs.LG) [pdf, other]
Title: Unleashing the Power of Randomization in Auditing Differentially Private ML
Krishna Pillutla, Galen Andrew, Peter Kairouz, H. Brendan McMahan, Alina Oprea, Sewoong Oh
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Information Theory (cs.IT); Statistics Theory (math.ST)
[1078] arXiv:2305.18505 (cross-list from cs.LG) [pdf, html, other]
Title: Provable Reward-Agnostic Preference-Based Reinforcement Learning
Wenhao Zhan, Masatoshi Uehara, Wen Sun, Jason D. Lee
Comments: ICLR 2024 Spotlight
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Statistics Theory (math.ST); Machine Learning (stat.ML)
[1079] arXiv:2305.18543 (cross-list from cs.LG) [pdf, other]
Title: Robust Lipschitz Bandits to Adversarial Corruptions
Yue Kang, Cho-Jui Hsieh, Thomas C. M. Lee
Comments: Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1080] arXiv:2305.18550 (cross-list from cs.LG) [pdf, other]
Title: Meta-Regression Analysis of Errors in Short-Term Electricity Load Forecasting
Konstantin Hopf, Hannah Hartstang, Thorsten Staake
Comments: 8 pages, 3 figures, 7 tables
Journal-ref: The 14th ACM International Conference on Future Energy Systems (e-Energy '23), June 20--23, 2023, Orlando, FL, USA
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[1081] arXiv:2305.18577 (cross-list from cs.LG) [pdf, other]
Title: Towards Constituting Mathematical Structures for Learning to Optimize
Jialin Liu, Xiaohan Chen, Zhangyang Wang, Wotao Yin, HanQin Cai
Comments: ICML 2023
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1082] arXiv:2305.18627 (cross-list from cs.LG) [pdf, other]
Title: Quantize Once, Train Fast: Allreduce-Compatible Compression with Provable Guarantees
Jihao Xin, Marco Canini, Peter Richtárik, Samuel Horváth
Comments: ECAI'25
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[1083] arXiv:2305.18655 (cross-list from cs.LG) [pdf, other]
Title: Parity Calibration
Youngseog Chung, Aaron Rumack, Chirag Gupta
Comments: To appear at UAI 2023 (Oral); 19 pages and 10 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[1084] arXiv:2305.18699 (cross-list from cs.LG) [pdf, other]
Title: Approximation and Estimation Ability of Transformers for Sequence-to-Sequence Functions with Infinite Dimensional Input
Shokichi Takakura, Taiji Suzuki
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1085] arXiv:2305.18728 (cross-list from cs.LG) [pdf, html, other]
Title: Plug-in Performative Optimization
Licong Lin, Tijana Zrnic
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1086] arXiv:2305.18730 (cross-list from math.OC) [pdf, other]
Title: Blockwise Stochastic Variance-Reduced Methods with Parallel Speedup for Multi-Block Bilevel Optimization
Quanqi Hu, Zi-Hao Qiu, Zhishuai Guo, Lijun Zhang, Tianbao Yang
Subjects: Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1087] arXiv:2305.18764 (cross-list from cs.LG) [pdf, other]
Title: When Does Optimizing a Proper Loss Yield Calibration?
Jarosław Błasiok, Parikshit Gopalan, Lunjia Hu, Preetum Nakkiran
Comments: In NeurIPS 2023. Selected for spotlight presentation
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[1088] arXiv:2305.18771 (cross-list from eess.IV) [pdf, other]
Title: SFCNeXt: a simple fully convolutional network for effective brain age estimation with small sample size
Yu Fu, Yanyan Huang, Shunjie Dong, Yalin Wang, Tianbai Yu, Meng Niu, Cheng Zhuo
Comments: This paper has been accepted by IEEE ISBI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1089] arXiv:2305.18777 (cross-list from cs.LG) [pdf, other]
Title: Adaptive Conditional Quantile Neural Processes
Peiman Mohseni, Nick Duffield, Bani Mallick, Arman Hasanzadeh
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1090] arXiv:2305.18779 (cross-list from cs.LG) [pdf, html, other]
Title: It begins with a boundary: A geometric view on probabilistically robust learning
Leon Bungert, Nicolás García Trillos, Matt Jacobs, Daniel McKenzie, Đorđe Nikolić, Qingsong Wang
Comments: Added more general convergence proofs, new results on interpolation behavior, corrected title
Subjects: Machine Learning (cs.LG); Analysis of PDEs (math.AP); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1091] arXiv:2305.18784 (cross-list from cs.LG) [pdf, html, other]
Title: Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits
Ronshee Chawla, Daniel Vial, Sanjay Shakkottai, R. Srikant
Comments: To appear in the proceedings of ICML 2023
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[1092] arXiv:2305.18806 (cross-list from cs.LG) [pdf, html, other]
Title: Prediction Error-based Classification for Class-Incremental Learning
Michał Zając, Tinne Tuytelaars, Gido M. van de Ven
Comments: ICLR 2024 camera ready
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1093] arXiv:2305.18811 (cross-list from cs.LG) [pdf, html, other]
Title: PyPOTS: A Python Toolkit for Machine Learning on Partially-Observed Time Series
Wenjie Du, Yiyuan Yang, Linglong Qian, Jun Wang, Qingsong Wen
Comments: PyPOTS website is at this https URL, and PyPOTS is open source at this https URL
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1094] arXiv:2305.18840 (cross-list from cs.LG) [pdf, other]
Title: Learning Perturbations to Explain Time Series Predictions
Joseph Enguehard
Comments: Accepted at ICML 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[1095] arXiv:2305.18929 (cross-list from cs.LG) [pdf, other]
Title: Clip21: Error Feedback for Gradient Clipping
Sarit Khirirat, Eduard Gorbunov, Samuel Horváth, Rustem Islamov, Fakhri Karray, Peter Richtárik
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1096] arXiv:2305.18961 (cross-list from quant-ph) [pdf, other]
Title: Quantum Convolutional Neural Networks for Multi-Channel Supervised Learning
Anthony M. Smaldone, Gregory W. Kyro, Victor S. Batista
Subjects: Quantum Physics (quant-ph); Machine Learning (stat.ML)
[1097] arXiv:2305.18991 (cross-list from econ.EM) [pdf, other]
Title: Generalized Autoregressive Score Trees and Forests
Andrew J. Patton, Yasin Simsek
Subjects: Econometrics (econ.EM); Risk Management (q-fin.RM); Applications (stat.AP); Machine Learning (stat.ML)
[1098] arXiv:2305.19008 (cross-list from cs.LG) [pdf, html, other]
Title: Bottleneck Structure in Learned Features: Low-Dimension vs Regularity Tradeoff
Arthur Jacot
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[1099] arXiv:2305.19010 (cross-list from q-bio.PE) [pdf, other]
Title: Wind turbine power and land cover effects on cumulative bat deaths
Aristides Moustakas, Panagiotis Georgiakakis, Elzbieta Kret, Eleftherios Kapsalis
Subjects: Populations and Evolution (q-bio.PE); Applications (stat.AP)
[1100] arXiv:2305.19043 (cross-list from cs.LG) [pdf, other]
Title: A Heat Diffusion Perspective on Geodesic Preserving Dimensionality Reduction
Guillaume Huguet, Alexander Tong, Edward De Brouwer, Yanlei Zhang, Guy Wolf, Ian Adelstein, Smita Krishnaswamy
Comments: 31 pages, 13 figures, 10 tables
Subjects: Machine Learning (cs.LG); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
[1101] arXiv:2305.19059 (cross-list from cs.LG) [pdf, html, other]
Title: Geometry-aware training of factorized layers in tensor Tucker format
Emanuele Zangrando, Steffen Schotthöfer, Gianluca Ceruti, Jonas Kusch, Francesco Tudisco
Journal-ref: Proceedings NeurIPS 2024
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[1102] arXiv:2305.19076 (cross-list from cs.LG) [pdf, html, other]
Title: Approximate Bayesian Class-Conditional Models under Continuous Representation Shift
Thomas L. Lee, Amos Storkey
Comments: Published at AISTATS 2024, 9 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1103] arXiv:2305.19154 (cross-list from q-bio.PE) [pdf, html, other]
Title: Sparse species interactions reproduce abundance correlation patterns in microbial communities
José Camacho-Mateu, Aniello Lampo, Matteo Sireci, Miguel Ángel Muñoz, José A. Cuesta
Journal-ref: PNAS Vol. 121 (5) e2309575121 (2024)
Subjects: Populations and Evolution (q-bio.PE); Statistics Theory (math.ST); Quantitative Methods (q-bio.QM)
[1104] arXiv:2305.19161 (cross-list from cs.LG) [pdf, other]
Title: Cooperative Thresholded Lasso for Sparse Linear Bandit
Haniyeh Barghi, Xiaotong Cheng, Setareh Maghsudi
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1105] arXiv:2305.19185 (cross-list from cs.LG) [pdf, other]
Title: Compression with Bayesian Implicit Neural Representations
Zongyu Guo, Gergely Flamich, Jiajun He, Zhibo Chen, José Miguel Hernández-Lobato
Comments: Accepted as a Spotlight paper in NeurIPS 2023. Updated camera-ready version
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[1106] arXiv:2305.19187 (cross-list from cs.CL) [pdf, html, other]
Title: Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models
Zhen Lin, Shubhendu Trivedi, Jimeng Sun
Comments: Published in Transactions on Machine Learning Research (05/2024)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1107] arXiv:2305.19206 (cross-list from math.OC) [pdf, html, other]
Title: Gradient descent in matrix factorization: Understanding large initialization
Hengchao Chen, Xin Chen, Mohamad Elmasri, Qiang Sun
Comments: Published in the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)
Subjects: Optimization and Control (math.OC); Machine Learning (stat.ML)
[1108] arXiv:2305.19210 (cross-list from math.RA) [pdf, other]
Title: Rectifiable paths with polynomial log-signature are straight lines
Peter K. Friz, Terry Lyons, Anna Seigal
Comments: 11 pages
Subjects: Rings and Algebras (math.RA); Commutative Algebra (math.AC); Classical Analysis and ODEs (math.CA); Probability (math.PR); Statistics Theory (math.ST)
[1109] arXiv:2305.19259 (cross-list from cs.LG) [pdf, other]
Title: On Convergence of Incremental Gradient for Non-Convex Smooth Functions
Anastasia Koloskova, Nikita Doikov, Sebastian U. Stich, Martin Jaggi
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1110] arXiv:2305.19265 (cross-list from cs.LG) [pdf, html, other]
Title: Probabilistic computation and uncertainty quantification with emerging covariance
Hengyuan Ma, Yang Qi, Li Zhang, Wenlian Lu, Jianfeng Feng
Comments: Code is available in this https URL
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Statistics Theory (math.ST)
[1111] arXiv:2305.19346 (cross-list from nlin.CD) [pdf, other]
Title: Dynamics and Statistics of Weak Chaos in a 4--D Symplectic Map
Tassos Bountis, Konstantinos Kaloudis, Helen Christodoulidi
Subjects: Chaotic Dynamics (nlin.CD); Applications (stat.AP)
[1112] arXiv:2305.19349 (cross-list from cs.LG) [pdf, html, other]
Title: Riemannian Projection-free Online Learning
Zihao Hu, Guanghui Wang, Jacob Abernethy
Comments: Published in Proceedings of The Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS 2023)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1113] arXiv:2305.19366 (cross-list from cs.LG) [pdf, other]
Title: Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network
Tristan Deleu, Mizu Nishikawa-Toomey, Jithendaraa Subramanian, Nikolay Malkin, Laurent Charlin, Yoshua Bengio
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1114] arXiv:2305.19429 (cross-list from cs.LG) [pdf, other]
Title: Adapting Fairness Interventions to Missing Values
Raymond Feng, Flavio P. Calmon, Hao Wang
Comments: Accepted to NeurIPS 2023
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Information Theory (cs.IT); Machine Learning (stat.ML)
[1115] arXiv:2305.19440 (cross-list from cs.LG) [pdf, html, other]
Title: Machine learning with tree tensor networks, CP rank constraints, and tensor dropout
Hao Chen, Thomas Barthel
Comments: 7 pages, 8 figures; published version
Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 46, 7825 (2024)
Subjects: Machine Learning (cs.LG); Strongly Correlated Electrons (cond-mat.str-el); Machine Learning (stat.ML)
[1116] arXiv:2305.19442 (cross-list from cs.LG) [pdf, other]
Title: SimFBO: Towards Simple, Flexible and Communication-efficient Federated Bilevel Learning
Yifan Yang, Peiyao Xiao, Kaiyi Ji
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1117] arXiv:2305.19470 (cross-list from cs.LG) [pdf, html, other]
Title: Label Embedding via Low-Coherence Matrices
Jianxin Zhang, Clayton Scott
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1118] arXiv:2305.19510 (cross-list from cs.LG) [pdf, other]
Title: Mildly Overparameterized ReLU Networks Have a Favorable Loss Landscape
Kedar Karhadkar, Michael Murray, Hanna Tseran, Guido Montúfar
Comments: 40 pages
Subjects: Machine Learning (cs.LG); Combinatorics (math.CO); Machine Learning (stat.ML)
[1119] arXiv:2305.19534 (cross-list from cs.LG) [pdf, other]
Title: Recasting Self-Attention with Holographic Reduced Representations
Mohammad Mahmudul Alam, Edward Raff, Stella Biderman, Tim Oates, James Holt
Comments: To appear in Proceedings of the 40th International Conference on Machine Learning (ICML)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[1120] arXiv:2305.19557 (cross-list from math.OC) [pdf, other]
Title: Dictionary Learning under Symmetries via Group Representations
Subhroshekhar Ghosh, Aaron Y. R. Low, Yong Sheng Soh, Zhuohang Feng, Brendan K. Y. Tan
Comments: 29 pages, 2 figures
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)
[1121] arXiv:2305.19562 (cross-list from cs.LG) [pdf, other]
Title: Replicability in Reinforcement Learning
Amin Karbasi, Grigoris Velegkas, Lin F. Yang, Felix Zhou
Comments: to be published in neurips 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[1122] arXiv:2305.19575 (cross-list from math.OC) [pdf, other]
Title: On the Linear Convergence of Policy Gradient under Hadamard Parameterization
Jiacai Liu, Jinchi Chen, Ke Wei
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1123] arXiv:2305.19582 (cross-list from cs.LG) [pdf, other]
Title: Causal Discovery with Latent Confounders Based on Higher-Order Cumulants
Ruichu Cai, Zhiyi Huang, Wei Chen, Zhifeng Hao, Kun Zhang
Comments: Accepted by ICML 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[1124] arXiv:2305.19588 (cross-list from cs.LG) [pdf, other]
Title: Active causal structure learning with advice
Davin Choo, Themis Gouleakis, Arnab Bhattacharyya
Comments: Accepted into ICML 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[1125] arXiv:2305.19666 (cross-list from cs.DS) [pdf, other]
Title: Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation
Joonhyuk Yang, Dongpil Shin, Hye Won Chung
Comments: ICML 2023
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[1126] arXiv:2305.19684 (cross-list from cs.LG) [pdf, other]
Title: End-to-end Training of Deep Boltzmann Machines by Unbiased Contrastive Divergence with Local Mode Initialization
Shohei Taniguchi, Masahiro Suzuki, Yusuke Iwasawa, Yutaka Matsuo
Comments: Accepted at ICML 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[1127] arXiv:2305.19685 (cross-list from cs.LG) [pdf, html, other]
Title: Deep Stochastic Mechanics
Elena Orlova, Aleksei Ustimenko, Ruoxi Jiang, Peter Y. Lu, Rebecca Willett
Comments: ICML 2024
Journal-ref: Proceedings of the 41st International Conference on Machine Learning, 235, 2024, 38779-38814; https://proceedings.mlr.press/v235/orlova24a.html
Subjects: Machine Learning (cs.LG); Quantum Physics (quant-ph); Machine Learning (stat.ML)
[1128] arXiv:2305.19691 (cross-list from cs.LG) [pdf, other]
Title: Constant or logarithmic regret in asynchronous multiplayer bandits
Hugo Richard, Etienne Boursier, Vianney Perchet
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1129] arXiv:2305.19721 (cross-list from econ.EM) [pdf, html, other]
Title: Quasi-Score Matching Estimation for Spatial Autoregressive Model with Random Weights Matrix and Regressors
Xuan Liang, Tao Zou
Comments: 35 pages
Subjects: Econometrics (econ.EM); Statistics Theory (math.ST); Methodology (stat.ME)
[1130] arXiv:2305.19722 (cross-list from quant-ph) [pdf, other]
Title: Monte-Carlo simulation for the frequency comb spectrum of an atom laser
A. Schelle
Comments: 7 pages, 11 figures
Journal-ref: Quanta 2023; 12: 171-179
Subjects: Quantum Physics (quant-ph); Applications (stat.AP)
[1131] arXiv:2305.19744 (cross-list from cs.LG) [pdf, other]
Title: Neural Markov Jump Processes
Patrick Seifner, Ramses J. Sanchez
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1132] arXiv:2305.19779 (cross-list from cs.LG) [pdf, other]
Title: Deep learning and MCMC with aggVAE for shifting administrative boundaries: mapping malaria prevalence in Kenya
Elizaveta Semenova, Swapnil Mishra, Samir Bhatt, Seth Flaxman, H Juliette T Unwin
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1133] arXiv:2305.19809 (cross-list from cs.CV) [pdf, other]
Title: Direct Diffusion Bridge using Data Consistency for Inverse Problems
Hyungjin Chung, Jeongsol Kim, Jong Chul Ye
Comments: NeurIPS 2023 camera-ready. 16 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1134] arXiv:2305.19893 (cross-list from cs.IR) [pdf, other]
Title: Web scraping: a promising tool for geographic data acquisition
Alexander Brenning, Sebastian Henn
Comments: 18 pages, 7 figures, 2 tables
Subjects: Information Retrieval (cs.IR); Applications (stat.AP)
[1135] arXiv:2305.19901 (cross-list from cs.LG) [pdf, other]
Title: Adaptive Conformal Regression with Jackknife+ Rescaled Scores
Nicolas Deutschmann, Mattia Rigotti, Maria Rodriguez Martinez
Comments: 24 pages, 7 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1136] arXiv:2305.19918 (cross-list from cs.DS) [pdf, other]
Title: Fully Dynamic Submodular Maximization over Matroids
Paul Dütting, Federico Fusco, Silvio Lattanzi, Ashkan Norouzi-Fard, Morteza Zadimoghaddam
Comments: Accepted at ICML 2023
Journal-ref: ACM Transactions on Algorithms, Volume 21, Issue 1 (2025), Article No.: 11
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1137] arXiv:2305.19940 (cross-list from math.NA) [pdf, other]
Title: Bayesian design of measurements for magnetorelaxometry imaging
Tapio Helin, Nuutti Hyvönen, Jarno Maaninen, Juha-Pekka Puska
Comments: 23 pages, 9 figures
Subjects: Numerical Analysis (math.NA); Statistics Theory (math.ST)
[1138] arXiv:2305.19947 (cross-list from cs.CV) [pdf, html, other]
Title: A Geometric Perspective on Diffusion Models
Defang Chen, Zhenyu Zhou, Jian-Ping Mei, Chunhua Shen, Chun Chen, Can Wang
Comments: 38 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1139] arXiv:2305.19951 (cross-list from cs.LG) [pdf, html, other]
Title: Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts
Emanuele Marconato, Stefano Teso, Antonio Vergari, Andrea Passerini
Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1140] arXiv:2305.20028 (cross-list from cs.LG) [pdf, html, other]
Title: A Study of Bayesian Neural Network Surrogates for Bayesian Optimization
Yucen Lily Li, Tim G. J. Rudner, Andrew Gordon Wilson
Comments: ICLR 2024. Code available at this https URL
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[1141] arXiv:2305.20043 (cross-list from cs.LG) [pdf, other]
Title: Deception by Omission: Using Adversarial Missingness to Poison Causal Structure Learning
Deniz Koyuncu, Alex Gittens, Bülent Yener, Moti Yung
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Total of 1141 entries
Showing up to 2000 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack