Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > stat.ML

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for March 2023

Total of 422 entries : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 351-400 ... 401-422
Showing up to 50 entries per page: fewer | more | all
[201] arXiv:2303.04245 (cross-list from cs.LG) [pdf, other]
Title: How Do Transformers Learn Topic Structure: Towards a Mechanistic Understanding
Yuchen Li, Yuanzhi Li, Andrej Risteski
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[202] arXiv:2303.04286 (cross-list from stat.ME) [pdf, other]
Title: Sufficient dimension reduction for feature matrices
Chanwoo Lee
Comments: 30 pages, 3 figures
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)
[203] arXiv:2303.04338 (cross-list from cs.LG) [pdf, other]
Title: Provable Pathways: Learning Multiple Tasks over Multiple Paths
Yingcong Li, Samet Oymak
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[204] arXiv:2303.04379 (cross-list from cs.LG) [pdf, other]
Title: HappyMap: A Generalized Multi-calibration Method
Zhun Deng, Cynthia Dwork, Linjun Zhang
Comments: Appeared at ITCS 2023 (submitted on Sept. 8th, 2022)
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Methodology (stat.ME); Machine Learning (stat.ML)
[205] arXiv:2303.04397 (cross-list from cs.LG) [pdf, other]
Title: The Lie-Group Bayesian Learning Rule
Eren Mehmet Kıral, Thomas Möllenhoff, Mohammad Emtiyaz Khan
Comments: AISTATS 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[206] arXiv:2303.04435 (cross-list from cs.LG) [pdf, other]
Title: A Message Passing Perspective on Learning Dynamics of Contrastive Learning
Yifei Wang, Qi Zhang, Tianqi Du, Jiansheng Yang, Zhouchen Lin, Yisen Wang
Comments: ICLR 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[207] arXiv:2303.04437 (cross-list from cs.LG) [pdf, other]
Title: Learning Hybrid Interpretable Models: Theory, Taxonomy, and Methods
Julien Ferry (LAAS-ROC), Gabriel Laberge (EPM), Ulrich Aïvodji (ETS)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[208] arXiv:2303.04444 (cross-list from math.ST) [pdf, other]
Title: A note on $L^1$-Convergence of the Empiric Minimizer for unbounded functions with fast growth
Pierre Bras
Comments: 10 pages
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
[209] arXiv:2303.04614 (cross-list from cs.LG) [pdf, other]
Title: Densely Connected $G$-invariant Deep Neural Networks with Signed Permutation Representations
Devanshu Agrawal, James Ostrowski
Comments: 40 pages, 2 figures, 4 tables. For associated code repository see this https URL
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[210] arXiv:2303.04743 (cross-list from cs.LG) [pdf, other]
Title: Vector Quantized Time Series Generation with a Bidirectional Prior Model
Daesoo Lee, Sara Malacarne, Erlend Aune
Comments: accepted at AISTATS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[211] arXiv:2303.04745 (cross-list from cs.LG) [pdf, other]
Title: A General Theory of Correct, Incorrect, and Extrinsic Equivariance
Dian Wang, Xupeng Zhu, Jung Yeon Park, Mingxi Jia, Guanang Su, Robert Platt, Robin Walters
Comments: Published at NeurIPS 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[212] arXiv:2303.04756 (cross-list from stat.ME) [pdf, other]
Title: Meta-learning Control Variates: Variance Reduction with Limited Data
Zhuo Sun, Chris J. Oates, François-Xavier Briol
Comments: Accepted for publication (with an oral presentation) at UAI 2023
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)
[213] arXiv:2303.04772 (cross-list from cs.LG) [pdf, html, other]
Title: Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation
Paul Hagemann, Sophie Mildenberger, Lars Ruthotto, Gabriele Steidl, Nicole Tianjiao Yang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Probability (math.PR); Machine Learning (stat.ML)
[214] arXiv:2303.04797 (cross-list from cs.LG) [pdf, other]
Title: Automatic Debiased Learning from Positive, Unlabeled, and Exposure Data
Masahiro Kato, Shuting Wu, Kodai Kureishi, Shota Yasui
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[215] arXiv:2303.04845 (cross-list from cs.LG) [pdf, other]
Title: Smoothed Analysis of Sequential Probability Assignment
Alankrita Bhatt, Nika Haghtalab, Abhishek Shetty
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Information Theory (cs.IT); Machine Learning (stat.ML)
[216] arXiv:2303.04859 (cross-list from cs.LG) [pdf, other]
Title: Agnostic PAC Learning of k-juntas Using L2-Polynomial Regression
Mohsen Heidari, Wojciech Szpankowski
Comments: AISTATS 2023
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Information Theory (cs.IT); Machine Learning (stat.ML)
[217] arXiv:2303.05024 (cross-list from math.ST) [pdf, other]
Title: Phase transition for detecting a small community in a large network
Jiashun Jin, Zheng Tracy Ke, Paxton Turner, Anru R. Zhang
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[218] arXiv:2303.05148 (cross-list from cs.CV) [pdf, other]
Title: Weakly Supervised Knowledge Transfer with Probabilistic Logical Reasoning for Object Detection
Martijn Oldenhof, Adam Arany, Yves Moreau, Edward De Brouwer
Comments: Accepted to ICLR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[219] arXiv:2303.05445 (cross-list from cs.LG) [pdf, html, other]
Title: Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks
Junghyun Lee, Laura Schmid, Se-Young Yun
Comments: 25 pages, 6 figures. Accepted to the 27th International Conference on Principles of Distributed Systems (OPODIS 2023) - Best Student Paper
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI); Machine Learning (stat.ML)
[220] arXiv:2303.05485 (cross-list from cs.LG) [pdf, other]
Title: Efficient Testable Learning of Halfspaces with Adversarial Label Noise
Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Sihan Liu, Nikos Zarifis
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[221] arXiv:2303.05487 (cross-list from cs.AI) [pdf, other]
Title: Learning Rational Subgoals from Demonstrations and Instructions
Zhezheng Luo, Jiayuan Mao, Jiajun Wu, Tomás Lozano-Pérez, Joshua B. Tenenbaum, Leslie Pack Kaelbling
Comments: AAAI 2023. First two authors contributed equally. Project page: this https URL
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[222] arXiv:2303.05490 (cross-list from cs.LG) [pdf, other]
Title: On the Expressiveness and Generalization of Hypergraph Neural Networks
Zhezheng Luo, Jiayuan Mao, Joshua B. Tenenbaum, Leslie Pack Kaelbling
Comments: Learning on Graphs Conference (LoG) 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[223] arXiv:2303.05496 (cross-list from cs.LG) [pdf, other]
Title: Sparse and Local Networks for Hypergraph Reasoning
Guangxuan Xiao, Leslie Pack Kaelbling, Jiajun Wu, Jiayuan Mao
Comments: Learning on Graphs Conference (LoG) 2022. Project page: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[224] arXiv:2303.05501 (cross-list from cs.AI) [pdf, other]
Title: PDSketch: Integrated Planning Domain Programming and Learning
Jiayuan Mao, Tomás Lozano-Pérez, Joshua B. Tenenbaum, Leslie Pack Kaelbling
Comments: Minor typo fixes. NeurIPS 2022. Project page: this https URL
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO); Machine Learning (stat.ML)
[225] arXiv:2303.05504 (cross-list from q-bio.QM) [pdf, other]
Title: Computable Phenotypes to Characterize Changing Patient Brain Dysfunction in the Intensive Care Unit
Yuanfang Ren (1 and 2), Tyler J. Loftus (1 and 3), Ziyuan Guan (1 and 2), Rayon Uddin (1), Benjamin Shickel (1 and 2), Carolina B. Maciel (4), Katharina Busl (4), Parisa Rashidi (1 and 5), Azra Bihorac (1 and 2), Tezcan Ozrazgat-Baslanti (1 and 2) ((1) Intelligent Critical Care Center, University of Florida, Gainesville, FL, (2) Department of Medicine, College of Medicine, University of Florida, Gainesville, FL, (3) Department of Surgery, College of Medicine, University of Florida, Gainesville, FL, (4) Department of Neurology, Neurocritical Care Division, College of Medicine, University of Florida, Gainesville, FL, (5) Crayton Pruitt Family Department of Biomedical Engineering, University of Florida, Gainesville, FL)
Comments: 21 pages, 5 figures, 3 tables, 1 eTable
Subjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG); Machine Learning (stat.ML)
[226] arXiv:2303.05506 (cross-list from cs.LG) [pdf, other]
Title: TANGOS: Regularizing Tabular Neural Networks through Gradient Orthogonalization and Specialization
Alan Jeffares, Tennison Liu, Jonathan Crabbé, Fergus Imrie, Mihaela van der Schaar
Comments: Published at International Conference on Learning Representations (ICLR) 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[227] arXiv:2303.05606 (cross-list from cs.LG) [pdf, other]
Title: Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards
Xiang Li, Qiang Sun
Comments: 23 page main text, 42 page appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Statistics Theory (math.ST); Machine Learning (stat.ML)
[228] arXiv:2303.05731 (cross-list from cs.LG) [pdf, other]
Title: Upper Bound of Real Log Canonical Threshold of Tensor Decomposition and its Application to Bayesian Inference
Naoki Yoshida, Sumio Watanabe
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[229] arXiv:2303.05754 (cross-list from cs.LG) [pdf, other]
Title: Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems
Hyungjin Chung, Suhyeon Lee, Jong Chul Ye
Comments: ICLR 2024; 28 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[230] arXiv:2303.05798 (cross-list from cs.LG) [pdf, other]
Title: Sliced-Wasserstein on Symmetric Positive Definite Matrices for M/EEG Signals
Clément Bonet, Benoît Malézieux, Alain Rakotomamonjy, Lucas Drumetz, Thomas Moreau, Matthieu Kowalski, Nicolas Courty
Comments: Published as a conference paper at ICML2023
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)
[231] arXiv:2303.05838 (cross-list from math.PR) [pdf, other]
Title: Rosenthal-type inequalities for linear statistics of Markov chains
Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov, Marina Sheshukova
Subjects: Probability (math.PR); Statistics Theory (math.ST); Machine Learning (stat.ML)
[232] arXiv:2303.05909 (cross-list from stat.ME) [pdf, other]
Title: A pseudo-likelihood approach to community detection in weighted networks
Andressa Cerqueira, Elizaveta Levina
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[233] arXiv:2303.05958 (cross-list from cs.CL) [pdf, other]
Title: Robust Knowledge Distillation from RNN-T Models With Noisy Training Labels Using Full-Sum Loss
Mohammad Zeineldeen, Kartik Audhkhasi, Murali Karthick Baskar, Bhuvana Ramabhadran
Comments: Accepted at ICASSP 2023
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[234] arXiv:2303.05981 (cross-list from stat.ME) [pdf, other]
Title: Feature Importance: A Closer Look at Shapley Values and LOCO
Isabella Verdinelli, Larry Wasserman
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[235] arXiv:2303.06058 (cross-list from cs.LG) [pdf, html, other]
Title: A General Recipe for the Analysis of Randomized Multi-Armed Bandit Algorithms
Dorian Baudry, Kazuya Suzuki, Junya Honda
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[236] arXiv:2303.06067 (cross-list from cs.LG) [pdf, other]
Title: Modeling Events and Interactions through Temporal Processes -- A Survey
Angelica Liguori, Luciano Caroprese, Marco Minici, Bruno Veloso, Francesco Spinnato, Mirco Nanni, Giuseppe Manco, Joao Gama
Comments: Image replacements
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[237] arXiv:2303.06075 (cross-list from cs.LG) [pdf, other]
Title: Long-tailed Classification from a Bayesian-decision-theory Perspective
Bolian Li, Ruqi Zhang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[238] arXiv:2303.06171 (cross-list from cs.LG) [pdf, other]
Title: DP-Fast MH: Private, Fast, and Accurate Metropolis-Hastings for Large-Scale Bayesian Inference
Wanrong Zhang, Ruqi Zhang
Journal-ref: published at ICML 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[239] arXiv:2303.06198 (cross-list from math.ST) [pdf, other]
Title: Deflated HeteroPCA: Overcoming the curse of ill-conditioning in heteroskedastic PCA
Yuchen Zhou, Yuxin Chen
Comments: accepted to Annals of Statistics
Journal-ref: The Annals of Statistics, vol. 53, no.1, pp. 91-116, 2025
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[240] arXiv:2303.06208 (cross-list from cs.LG) [pdf, other]
Title: Fast computation of permutation equivariant layers with the partition algebra
Charles Godfrey, Michael G. Rawson, Davis Brown, Henry Kvinge
Comments: Comments welcome!
Subjects: Machine Learning (cs.LG); Combinatorics (math.CO); Representation Theory (math.RT); Machine Learning (stat.ML)
[241] arXiv:2303.06296 (cross-list from cs.LG) [pdf, other]
Title: Stabilizing Transformer Training by Preventing Attention Entropy Collapse
Shuangfei Zhai, Tatiana Likhomanenko, Etai Littwin, Dan Busbridge, Jason Ramapuram, Yizhe Zhang, Jiatao Gu, Josh Susskind
Journal-ref: In International Conference on Machine Learning (pp. 40770-40803). PMLR. 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[242] arXiv:2303.06396 (cross-list from cs.LG) [pdf, other]
Title: No-regret Algorithms for Fair Resource Allocation
Abhishek Sinha, Ativ Joshi, Rajarshi Bhattacharjee, Cameron Musco, Mohammad Hajiesmaili
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[243] arXiv:2303.06398 (cross-list from stat.CO) [pdf, other]
Title: Variational Gaussian filtering via Wasserstein gradient flows
Adrien Corenflos, Hany Abdulsamad
Comments: 5 pages, 2 figures, double column, minor modifications compared to version 1 (more experiments + typos). Accepted as a conference paper to EUSIPCO 2023
Subjects: Computation (stat.CO); Computational Engineering, Finance, and Science (cs.CE); Systems and Control (eess.SY); Machine Learning (stat.ML)
[244] arXiv:2303.06484 (cross-list from cs.LG) [pdf, other]
Title: Generalizing and Decoupling Neural Collapse via Hyperspherical Uniformity Gap
Weiyang Liu, Longhui Yu, Adrian Weller, Bernhard Schölkopf
Comments: ICLR 2023 (v2: fixed typos)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[245] arXiv:2303.06526 (cross-list from cs.LG) [pdf, other]
Title: Data Dependent Regret Guarantees Against General Comparators for Full or Bandit Feedback
Kaan Gokcesu, Hakan Gokcesu
Comments: this article draws from arXiv:2009.04372,arXiv:2109.09212,arXiv:2204.06660
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Signal Processing (eess.SP); Optimization and Control (math.OC); Machine Learning (stat.ML)
[246] arXiv:2303.06561 (cross-list from cs.LG) [pdf, other]
Title: Phase Diagram of Initial Condensation for Two-layer Neural Networks
Zhengan Chen, Yuqing Li, Tao Luo, Zhangchen Zhou, Zhi-Qin John Xu
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Optimization and Control (math.OC); Machine Learning (stat.ML)
[247] arXiv:2303.06562 (cross-list from cs.LG) [pdf, other]
Title: ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond
Xiaojun Guo, Yifei Wang, Tianqi Du, Yisen Wang
Comments: ICLR 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[248] arXiv:2303.06614 (cross-list from cs.LG) [pdf, other]
Title: Synthetic Experience Replay
Cong Lu, Philip J. Ball, Yee Whye Teh, Jack Parker-Holder
Comments: Published at NeurIPS, 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[249] arXiv:2303.06815 (cross-list from cs.LG) [pdf, other]
Title: On Model Compression for Neural Networks: Framework, Algorithm, and Convergence Guarantee
Chenyang Li, Jihoon Chung, Mengnan Du, Haimin Wang, Xianlian Zhou, Bo Shen
Comments: 44 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[250] arXiv:2303.06825 (cross-list from cs.LG) [pdf, other]
Title: Best-of-three-worlds Analysis for Linear Bandits with Follow-the-regularized-leader Algorithm
Fang Kong, Canzhe Zhao, Shuai Li
Comments: Accepted in COLT 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Total of 422 entries : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 351-400 ... 401-422
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack