close this message
arXiv smileybones

arXiv Is Hiring a DevOps Engineer

Work on one of the world's most important websites and make an impact on open science.

View Jobs
Skip to main content
Cornell University

arXiv Is Hiring a DevOps Engineer

View Jobs
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > stat.ML

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for February 2024

Total of 673 entries : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 351-400 ... 651-673
Showing up to 50 entries per page: fewer | more | all
[201] arXiv:2402.17943 [pdf, html, other]
Title: Sequential transport maps using SoS density estimation and $α$-divergences
Benjamin Zanger, Olivier Zahm, Tiangang Cui, Martin Schreiber
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[202] arXiv:2402.18242 [pdf, html, other]
Title: A network-constrain Weibull AFT model for biomarkers discovery
Claudia Angelini, Daniela De Canditiis, Italia De Feis, Antonella Iuliano
Subjects: Machine Learning (stat.ML); Statistics Theory (math.ST); Methodology (stat.ME)
[203] arXiv:2402.18697 [pdf, html, other]
Title: Inferring Dynamic Networks from Marginals with Iterative Proportional Fitting
Serina Chang, Frederic Koehler, Zhaonan Qu, Jure Leskovec, Johan Ugander
Comments: Conference version available from this https URL
Journal-ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:6202-6252, 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Optimization and Control (math.OC); Statistics Theory (math.ST)
[204] arXiv:2402.19455 [pdf, html, other]
Title: Listening to the Noise: Blind Denoising with Gibbs Diffusion
David Heurtel-Depeiges, Charles C. Margossian, Ruben Ohana, Bruno Régaldo-Saint Blancard
Comments: 12+9 pages, 7+5 figures, 1+1 tables; accepted to 2024 International Conference on Machine Learning; code: this https URL
Subjects: Machine Learning (stat.ML); Cosmology and Nongalactic Astrophysics (astro-ph.CO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[205] arXiv:2402.00072 (cross-list from cs.LG) [pdf, html, other]
Title: Explainable AI for survival analysis: a median-SHAP approach
Lucile Ter-Minassian, Sahra Ghalebikesabi, Karla Diaz-Ordaz, Chris Holmes
Comments: Accepted to the Interpretable Machine Learning for Healthcare (IMLH) workshop of the ICML 2022 Conference
Subjects: Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[206] arXiv:2402.00152 (cross-list from cs.LG) [pdf, html, other]
Title: Deeper or Wider: A Perspective from Optimal Generalization Error with Sobolev Loss
Yahong Yang, Juncai He
Comments: arXiv admin note: text overlap with arXiv:2310.10766, arXiv:2305.08466
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[207] arXiv:2402.00162 (cross-list from cs.LG) [pdf, html, other]
Title: Behind the Myth of Exploration in Policy Gradients
Adrien Bolland, Gaspard Lambrechts, Damien Ernst
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[208] arXiv:2402.00267 (cross-list from cs.DS) [pdf, html, other]
Title: Not All Learnable Distribution Classes are Privately Learnable
Mark Bun, Gautam Kamath, Argyris Mouzakis, Vikrant Singhal
Comments: Appeared in ALT 2024. Added clarification about result, and updated affiliation and funding for VS
Subjects: Data Structures and Algorithms (cs.DS); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
[209] arXiv:2402.00305 (cross-list from math.ST) [pdf, html, other]
Title: Information-Theoretic Thresholds for Planted Dense Cycles
Cheng Mao, Alexander S. Wein, Shenduo Zhang
Comments: 31 pages, 1 figure
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[210] arXiv:2402.00332 (cross-list from cs.LG) [pdf, other]
Title: Comparing Spectral Bias and Robustness For Two-Layer Neural Networks: SGD vs Adaptive Random Fourier Features
Aku Kammonen, Lisi Liang, Anamika Pandey, Raúl Tempone
Comments: 6 Pages, 4 Figures; Accepted in the International Conference on Scientific Computing and Machine Learning
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[211] arXiv:2402.00382 (cross-list from math.ST) [pdf, html, other]
Title: On the design-dependent suboptimality of the Lasso
Reese Pathak, Cong Ma
Comments: 19 pages, 1 figure
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
[212] arXiv:2402.00388 (cross-list from cs.LG) [pdf, other]
Title: Cumulative Distribution Function based General Temporal Point Processes
Maolin Wang, Yu Pan, Zenglin Xu, Ruocheng Guo, Xiangyu Zhao, Wanyu Wang, Yiqi Wang, Zitao Liu, Langming Liu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[213] arXiv:2402.00396 (cross-list from cs.LG) [pdf, html, other]
Title: Efficient Exploration for LLMs
Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Methodology (stat.ME); Machine Learning (stat.ML)
[214] arXiv:2402.00522 (cross-list from cs.LG) [pdf, other]
Title: Understanding the Expressive Power and Mechanisms of Transformer for Sequence Modeling
Mingze Wang, Weinan E
Comments: 76 pages, accepted by NeurIPS 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[215] arXiv:2402.00592 (cross-list from cs.LG) [pdf, html, other]
Title: Partial-Label Learning with a Reject Option
Tobias Fuchs, Florian Kalinke, Klemens Böhm
Comments: Accepted for publication at TMLR
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[216] arXiv:2402.00728 (cross-list from cs.LG) [pdf, other]
Title: Dropout-Based Rashomon Set Exploration for Efficient Predictive Multiplicity Estimation
Hsiang Hsu, Guihong Li, Shaohan Hu, Chun-Fu (Richard)Chen
Comments: ICLR 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[217] arXiv:2402.00743 (cross-list from cs.LG) [pdf, html, other]
Title: Theoretical Understanding of In-Context Learning in Shallow Transformers with Unstructured Data
Yue Xing, Xiaofeng Lin, Chenheng Xu, Namjoon Suh, Qifan Song, Guang Cheng
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[218] arXiv:2402.00776 (cross-list from quant-ph) [pdf, html, other]
Title: Hybrid Quantum Vision Transformers for Event Classification in High Energy Physics
Eyup B. Unlu, Marçal Comajoan Cara, Gopal Ramesh Dahale, Zhongtian Dong, Roy T. Forestano, Sergei Gleyzer, Daniel Justice, Kyoungchul Kong, Tom Magorsch, Konstantin T. Matchev, Katia Matcheva
Comments: 13 pages, 9 figures. Published version in a special issue "Computational Aspects of Machine Learning and Quantum Computing"
Journal-ref: Axioms v. 13, no 3, (2024) 187
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG); High Energy Physics - Phenomenology (hep-ph); Machine Learning (stat.ML)
[219] arXiv:2402.00809 (cross-list from cs.LG) [pdf, html, other]
Title: Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI
Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang
Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[220] arXiv:2402.00847 (cross-list from cs.CV) [pdf, html, other]
Title: BootsTAP: Bootstrapped Training for Tracking-Any-Point
Carl Doersch, Pauline Luc, Yi Yang, Dilara Gokay, Skanda Koppula, Ankush Gupta, Joseph Heyward, Ignacio Rocco, Ross Goroshin, João Carreira, Andrew Zisserman
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[221] arXiv:2402.00849 (cross-list from cs.LG) [pdf, html, other]
Title: Score-based Causal Representation Learning: Linear and General Transformations
Burak Varıcı, Emre Acartürk, Karthikeyan Shanmugam, Abhishek Kumar, Ali Tajer
Comments: Main changes: additional identifiability results from single-node interventions, simplified linear algorithm, and additional experiments. General transform results also appear in our paper General Identifiability and Achievability for Causal Representation Learning (arXiv:2310.15450) appeared at AISTATS 2024 (oral)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[222] arXiv:2402.00857 (cross-list from cs.LG) [pdf, html, other]
Title: Early Time Classification with Accumulated Accuracy Gap Control
Liran Ringel, Regev Cohen, Daniel Freedman, Michael Elad, Yaniv Romano
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[223] arXiv:2402.00899 (cross-list from cs.LG) [pdf, other]
Title: Weakly Supervised Learners for Correction of AI Errors with Provable Performance Guarantees
Ivan Y. Tyukin, Tatiana Tyukina, Daniel van Helden, Zedong Zheng, Evgeny M. Mirkes, Oliver J. Sutton, Qinghua Zhou, Alexander N. Gorban, Penelope Allison
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[224] arXiv:2402.00949 (cross-list from math.AG) [pdf, html, other]
Title: Geometry of Polynomial Neural Networks
Kaie Kubjas, Jiayi Li, Maximilian Wiesmann
Comments: 34 pages, 3 figures. Comments are welcome!
Journal-ref: Alg. Stat. 15 (2024) 295-328
Subjects: Algebraic Geometry (math.AG); Machine Learning (cs.LG); Machine Learning (stat.ML)
[225] arXiv:2402.00957 (cross-list from cs.LG) [pdf, html, other]
Title: Credal Learning Theory
Michele Caprio, Maryam Sultana, Eleni Elia, Fabio Cuzzolin
Comments: 30 pages, 2 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[226] arXiv:2402.01036 (cross-list from math.PR) [pdf, html, other]
Title: Fisher information dissipation for time inhomogeneous stochastic differential equations
Qi Feng, Xinzhe Zuo, Wuchen Li
Comments: 9 figures, 36 pages
Subjects: Probability (math.PR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[227] arXiv:2402.01052 (cross-list from math.OC) [pdf, html, other]
Title: Weakly Convex Regularisers for Inverse Problems: Convergence of Critical Points and Primal-Dual Optimisation
Zakhar Shumaylov, Jeremy Budd, Subhadip Mukherjee, Carola-Bibiane Schönlieb
Comments: 26 pages, 4 figures; this https URL
Subjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[228] arXiv:2402.01055 (cross-list from cs.LG) [pdf, html, other]
Title: Multiclass Learning from Noisy Labels for Non-decomposable Performance Measures
Mingyuan Zhang, Shivani Agarwal
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[229] arXiv:2402.01095 (cross-list from cs.LG) [pdf, html, other]
Title: How many views does your deep neural network use for prediction?
Keisuke Kawano, Takuro Kutsuna, Keisuke Sano
Comments: 20 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[230] arXiv:2402.01098 (cross-list from cs.LG) [pdf, html, other]
Title: Bayesian Deep Learning for Remaining Useful Life Estimation via Stein Variational Gradient Descent
Luca Della Libera, Jacopo Andreoli, Davide Dalle Pezze, Mirco Ravanelli, Gian Antonio Susto
Comments: 26 pages, 3 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[231] arXiv:2402.01111 (cross-list from cs.LG) [pdf, html, other]
Title: Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
Dan Qiao, Yu-Xiang Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[232] arXiv:2402.01143 (cross-list from cs.LG) [pdf, html, other]
Title: Learning Network Representations with Disentangled Graph Auto-Encoder
Di Fan, Chuanhou Gao
Comments: 15 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[233] arXiv:2402.01148 (cross-list from math.ST) [pdf, html, other]
Title: The Optimality of Kernel Classifiers in Sobolev Space
Jianfa Lai, Zhifan Li, Dongming Huang, Qian Lin
Comments: 21 pages, 2 figures
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
[234] arXiv:2402.01199 (cross-list from math.OC) [pdf, other]
Title: MIQCQP reformulation of the ReLU neural networks Lipschitz constant estimation problem
Mohammed Sbihi (ENAC), Sophie Jan (IMT), Nicolas Couellan (IMT, ENAC)
Subjects: Optimization and Control (math.OC); Machine Learning (stat.ML)
[235] arXiv:2402.01297 (cross-list from cs.LG) [pdf, html, other]
Title: Characterizing Overfitting in Kernel Ridgeless Regression Through the Eigenspectrum
Tin Sum Cheng, Aurelien Lucchi, Anastasis Kratsios, David Belius
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[236] arXiv:2402.01341 (cross-list from cs.LG) [pdf, html, other]
Title: Fundamental Properties of Causal Entropy and Information Gain
Francisco N. F. Q. Simoes, Mehdi Dastani, Thijs van Ommen
Comments: In Proceedings of the conference CLeaR (Causal Learning and Reasoning) 2024
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[237] arXiv:2402.01342 (cross-list from cs.LG) [pdf, html, other]
Title: Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusion
Zexi Li, Zhiqi Li, Jie Lin, Tao Shen, Tao Lin, Chao Wu
Comments: preprint
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[238] arXiv:2402.01399 (cross-list from cs.LG) [pdf, other]
Title: A Probabilistic Model Behind Self-Supervised Learning
Alice Bizeul, Bernhard Schölkopf, Carl Allen
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[239] arXiv:2402.01401 (cross-list from cs.LG) [pdf, html, other]
Title: An Information Theoretic Approach to Machine Unlearning
Jack Foster, Kyle Fogarty, Stefan Schoepf, Zack Dugue, Cengiz Öztireli, Alexandra Brintrup
Comments: Updated, new low-dimensional experiments and updated perspective on unlearning from an information theoretic view
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[240] arXiv:2402.01450 (cross-list from cs.LG) [pdf, other]
Title: Improving importance estimation in covariate shift for providing accurate prediction error
Laura Fdez-Díaz, Sara González Tomillo, Elena Montañés, José Ramón Quevedo
Journal-ref: Expert Systems With Applications 2022 Volume 193 116376
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[241] arXiv:2402.01454 (cross-list from cs.LG) [pdf, html, other]
Title: Integrating Large Language Models in Causal Discovery: A Statistical Causal Approach
Masayuki Takayama, Tadahisa Okuda, Thong Pham, Tatsuyoshi Ikenoue, Shingo Fukuma, Shohei Shimizu, Akiyoshi Sannai
Journal-ref: Published in Transactions in Machine Learning Research (05/2025) https://openreview.net/forum?id=Reh1S8rxfh
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[242] arXiv:2402.01476 (cross-list from cs.LG) [pdf, html, other]
Title: Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian Processes
Yingyi Chen, Qinghua Tao, Francesco Tonin, Johan A.K. Suykens
Comments: We propose Kernel-Eigen Pair Sparse Variational Gaussian Processes (KEP-SVGP) for building uncertainty-aware self-attention where the asymmetry of attention kernel is tackled by KSVD and a reduced time complexity is acquired
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[243] arXiv:2402.01484 (cross-list from cs.LG) [pdf, html, other]
Title: Connecting the Dots: Is Mode-Connectedness the Key to Feasible Sample-Based Inference in Bayesian Neural Networks?
Emanuel Sommer, Lisa Wimmer, Theodore Papamarkou, Ludwig Bothmann, Bernd Bischl, David Rügamer
Subjects: Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
[244] arXiv:2402.01514 (cross-list from cs.LG) [pdf, html, other]
Title: Mapping the Multiverse of Latent Representations
Jeremy Wayland, Corinna Coupette, Bastian Rieck
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Algebraic Topology (math.AT); Machine Learning (stat.ML)
[245] arXiv:2402.01543 (cross-list from cs.LG) [pdf, html, other]
Title: Adaptive Optimization for Prediction with Missing Data
Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[246] arXiv:2402.01577 (cross-list from cs.CY) [pdf, other]
Title: Deep Active Learning for Data Mining from Conflict Text Corpora
Mihai Croicu
Comments: 40 pages, 6 figures. Paper presented at the Using LLMs and Text-as-Data in Political Science Research Workshop at the University of Barcelona, 29 January 2024
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Machine Learning (stat.ML)
[247] arXiv:2402.01599 (cross-list from math.OC) [pdf, other]
Title: Hyperparameter tuning via trajectory predictions: Stochastic prox-linear methods in matrix sensing
Mengqi Lou, Kabir Aladin Verchand, Ashwin Pananjady
Comments: 68 pages, 6 figures
Subjects: Optimization and Control (math.OC); Statistics Theory (math.ST); Machine Learning (stat.ML)
[248] arXiv:2402.01614 (cross-list from cs.LG) [pdf, other]
Title: L2G2G: a Scalable Local-to-Global Network Embedding with Graph Autoencoders
Ruikang Ouyang, Andrew Elliott, Stratis Limnios, Mihai Cucuringu, Gesine Reinert
Comments: 13 pages, 4 figures, Complex Networks 2023, Volume I, SCI 1141
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[249] arXiv:2402.01629 (cross-list from cs.CL) [pdf, html, other]
Title: Position Paper: Generalized grammar rules and structure-based generalization beyond classical equivariance for lexical tasks and transduction
Mircea Petrache, Shubhendu Trivedi
Comments: 12 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[250] arXiv:2402.01632 (cross-list from cs.LG) [pdf, html, other]
Title: Time-Varying Gaussian Process Bandits with Unknown Prior
Juliusz Ziomek, Masaki Adachi, Michael A. Osborne
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Total of 673 entries : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 351-400 ... 651-673
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack