Machine Learning

Authors and titles for February 2024

Total of 3960 entries : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 ... 3951-3960

Showing up to 50 entries per page: fewer | more | all

[151] arXiv:2402.01399 [pdf, other]: Title: A Probabilistic Model Behind Self-Supervised Learning

Alice Bizeul, Bernhard Schölkopf, Carl Allen

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[152] arXiv:2402.01401 [pdf, html, other]: Title: An Information Theoretic Approach to Machine Unlearning

Jack Foster, Kyle Fogarty, Stefan Schoepf, Zack Dugue, Cengiz Öztireli, Alexandra Brintrup

Comments: Updated, new low-dimensional experiments and updated perspective on unlearning from an information theoretic view

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[153] arXiv:2402.01408 [pdf, html, other]: Title: Counterfactual Concept Bottleneck Models

Gabriele Dominici, Pietro Barbiero, Francesco Giannini, Martin Gjoreski, Giuseppe Marra, Marc Langheinrich

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[154] arXiv:2402.01415 [pdf, other]: Title: SMLP: Symbolic Machine Learning Prover

Franz Brauße, Zurab Khasidashvili, Konstantin Korovin

Comments: 12 pages, 4 figures. (submitted)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO); Symbolic Computation (cs.SC); Optimization and Control (math.OC)
[155] arXiv:2402.01431 [pdf, html, other]: Title: Approximate Control for Continuous-Time POMDPs

Yannick Eich, Bastian Alt, Heinz Koeppl

Comments: To be published in AISTATS 2024

Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Quantitative Methods (q-bio.QM)
[156] arXiv:2402.01439 [pdf, other]: Title: From Words to Molecules: A Survey of Large Language Models in Chemistry

Chang Liao, Yemin Yu, Yu Mei, Ying Wei

Comments: Submitted to IJCAI 2024 survey track

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Biomolecules (q-bio.BM); Quantitative Methods (q-bio.QM)
[157] arXiv:2402.01440 [pdf, html, other]: Title: A Survey of Few-Shot Learning on Graphs: from Meta-Learning to Pre-Training and Prompt Learning

Xingtong Yu, Yuan Fang, Zemin Liu, Yuxia Wu, Zhihao Wen, Jianyuan Bo, Xinming Zhang, Steven C.H. Hoi

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[158] arXiv:2402.01444 [pdf, other]: Title: Mission Critical -- Satellite Data is a Distinct Modality in Machine Learning

Esther Rolf, Konstantin Klemmer, Caleb Robinson, Hannah Kerner

Comments: 15 pages, 5 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[159] arXiv:2402.01450 [pdf, other]: Title: Improving importance estimation in covariate shift for providing accurate prediction error

Laura Fdez-Díaz, Sara González Tomillo, Elena Montañés, José Ramón Quevedo

Journal-ref: Expert Systems With Applications 2022 Volume 193 116376

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[160] arXiv:2402.01454 [pdf, html, other]: Title: Integrating Large Language Models in Causal Discovery: A Statistical Causal Approach

Masayuki Takayama, Tadahisa Okuda, Thong Pham, Tatsuyoshi Ikenoue, Shingo Fukuma, Shohei Shimizu, Akiyoshi Sannai

Journal-ref: Published in Transactions in Machine Learning Research (05/2025) https://openreview.net/forum?id=Reh1S8rxfh

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[161] arXiv:2402.01476 [pdf, html, other]: Title: Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian Processes

Yingyi Chen, Qinghua Tao, Francesco Tonin, Johan A.K. Suykens

Comments: We propose Kernel-Eigen Pair Sparse Variational Gaussian Processes (KEP-SVGP) for building uncertainty-aware self-attention where the asymmetry of attention kernel is tackled by KSVD and a reduced time complexity is acquired

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[162] arXiv:2402.01481 [pdf, html, other]: Title: Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains

Jiale Zhao, Wanru Zhuang, Jia Song, Yaqi Li, Shuqi Lu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Biomolecules (q-bio.BM)
[163] arXiv:2402.01484 [pdf, html, other]: Title: Connecting the Dots: Is Mode-Connectedness the Key to Feasible Sample-Based Inference in Bayesian Neural Networks?

Emanuel Sommer, Lisa Wimmer, Theodore Papamarkou, Ludwig Bothmann, Bernd Bischl, David Rügamer

Subjects: Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
[164] arXiv:2402.01514 [pdf, html, other]: Title: Mapping the Multiverse of Latent Representations

Jeremy Wayland, Corinna Coupette, Bastian Rieck

Comments: Accepted at ICML 2024

Subjects: Machine Learning (cs.LG); Algebraic Topology (math.AT); Machine Learning (stat.ML)
[165] arXiv:2402.01515 [pdf, html, other]: Title: Enhancing Stochastic Gradient Descent: A Unified Framework and Novel Acceleration Methods for Faster Convergence

Yichuan Deng, Zhao Song, Chiwun Yang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[166] arXiv:2402.01528 [pdf, html, other]: Title: Decoding Speculative Decoding

Minghao Yan, Saurabh Agarwal, Shivaram Venkataraman

Comments: Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2025)

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[167] arXiv:2402.01543 [pdf, html, other]: Title: Adaptive Optimization for Prediction with Missing Data

Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[168] arXiv:2402.01546 [pdf, other]: Title: Privacy-Preserving Distributed Learning for Residential Short-Term Load Forecasting

Yi Dong, Yingjie Wang, Mariana Gama, Mustafa A. Mustafa, Geert Deconinck, Xiaowei Huang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA); Systems and Control (eess.SY)
[169] arXiv:2402.01567 [pdf, html, other]: Title: Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise

Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook, Yan Dai

Comments: Accepted at ICML 2024

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[170] arXiv:2402.01608 [pdf, other]: Title: Contingency Analysis of a Grid of Connected EVs for Primary Frequency Control of an Industrial Microgrid Using Efficient Control Scheme

J.N. Sabhahit, S.S. Solanke, V.K. Jadoun, H. Malik, F.P. García Márquez, J.M. Pinar-Pérez

Comments: Published in energies (MDPI) 2022

Journal-ref: Energies 2022, 15, 3102

Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[171] arXiv:2402.01614 [pdf, other]: Title: L2G2G: a Scalable Local-to-Global Network Embedding with Graph Autoencoders

Ruikang Ouyang, Andrew Elliott, Stratis Limnios, Mihai Cucuringu, Gesine Reinert

Comments: 13 pages, 4 figures, Complex Networks 2023, Volume I, SCI 1141

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[172] arXiv:2402.01621 [pdf, html, other]: Title: Stochastic Two Points Method for Deep Model Zeroth-order Optimization

Yijiang Pang, Jiayu Zhou

Subjects: Machine Learning (cs.LG)
[173] arXiv:2402.01632 [pdf, html, other]: Title: Time-Varying Gaussian Process Bandits with Unknown Prior

Juliusz Ziomek, Masaki Adachi, Michael A. Osborne

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[174] arXiv:2402.01768 [pdf, other]: Title: Enriched Physics-informed Neural Networks for Dynamic Poisson-Nernst-Planck Systems

Xujia Huang, Fajie Wang, Benrong Zhang, Hanqing Liu

Comments: 24 pages, 16 figures, 6 tables

Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[175] arXiv:2402.01785 [pdf, html, other]: Title: DoubleMLDeep: Estimation of Causal Effects with Multimodal Data

Sven Klaassen, Jan Teichert-Kluge, Philipp Bach, Victor Chernozhukov, Martin Spindler, Suhas Vijaykumar

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Econometrics (econ.EM); Methodology (stat.ME); Machine Learning (stat.ML)
[176] arXiv:2402.01790 [pdf, html, other]: Title: An introduction to graphical tensor notation for mechanistic interpretability

Jordan K. Taylor

Comments: 30 pages, 75 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[177] arXiv:2402.01797 [pdf, html, other]: Title: Robust support vector machines via conic optimization

Valentina Cepeda, Andrés Gómez, Shaoning Han

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Computation (stat.CO)
[178] arXiv:2402.01798 [pdf, html, other]: Title: Improved Quantization Strategies for Managing Heavy-tailed Gradients in Distributed Learning

Guangfeng Yan, Tan Li, Yuanzhang Xiao, Hanxu Hou, Linqi Song

Comments: arXiv admin note: substantial text overlap with arXiv:2402.01160

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[179] arXiv:2402.01799 [pdf, html, other]: Title: Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward

Arnav Chavan, Raghav Magazine, Shubham Kushwaha, Mérouane Debbah, Deepak Gupta

Comments: Accepted at IJCAI '24 (Survey Track), Updated TGI results

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[180] arXiv:2402.01801 [pdf, html, other]: Title: Large Language Models for Time Series: A Survey

Xiyuan Zhang, Ranak Roy Chowdhury, Rajesh K. Gupta, Jingbo Shang

Comments: GitHub repository: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[181] arXiv:2402.01802 [pdf, html, other]: Title: An Auction-based Marketplace for Model Trading in Federated Learning

Yue Cui, Liuyi Yao, Yaliang Li, Ziqian Chen, Bolin Ding, Xiaofang Zhou

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[182] arXiv:2402.01811 [pdf, html, other]: Title: A Distributionally Robust Optimisation Approach to Fair Credit Scoring

Pablo Casas, Christophe Mues, Huan Yu

Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[183] arXiv:2402.01821 [pdf, html, other]: Title: Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks

Akshay K. Jagadish, Julian Coda-Forno, Mirko Thalmann, Eric Schulz, Marcel Binz

Comments: 27 pages (9 pages of main text, 4 pages of references, and 14 pages of appendix), 13 figures, and 7 Tables

Journal-ref: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[184] arXiv:2402.01845 [pdf, html, other]: Title: Multi-Armed Bandits with Interference

Su Jia, Peter Frazier, Nathan Kallus

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[185] arXiv:2402.01849 [pdf, html, other]: Title: Capturing waste collection planning expert knowledge in a fitness function through preference learning

Laura Fernández Díaz, Miriam Fernández Díaz, José Ramón Quevedo, Elena Montañés

Journal-ref: Engineering Applications of Artificial Intelligence 2021 Volume 99 104113

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[186] arXiv:2402.01857 [pdf, html, other]: Title: Position Paper: Assessing Robustness, Privacy, and Fairness in Federated Learning Integrated with Foundation Models

Xi Li, Jiaqi Wang

Comments: Under review

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computers and Society (cs.CY)
[187] arXiv:2402.01858 [pdf, html, other]: Title: Explaining latent representations of generative models with large multimodal models

Mengdan Zhu, Zhenke Liu, Bo Pan, Abhinav Angirekula, Liang Zhao

Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[188] arXiv:2402.01862 [pdf, html, other]: Title: Parametric Feature Transfer: One-shot Federated Learning with Foundation Models

Mahdi Beitollahi, Alex Bie, Sobhan Hemati, Leo Maxime Brunswic, Xu Li, Xi Chen, Guojun Zhang

Comments: 20 pages, 12 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[189] arXiv:2402.01863 [pdf, html, other]: Title: DFML: Decentralized Federated Mutual Learning

Yasser H. Khalil, Amir H. Estiri, Mahdi Beitollahi, Nader Asadi, Sobhan Hemati, Xu Li, Guojun Zhang, Xi Chen

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[190] arXiv:2402.01865 [pdf, html, other]: Title: What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement

Xisen Jin, Xiang Ren

Comments: ICML 2024 (Spotlight)

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[191] arXiv:2402.01867 [pdf, html, other]: Title: Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision

Jinyan Su, Peilin Yu, Jieyu Zhang, Stephen H. Bach

Comments: Accepted to IEEE International Conference on Big Data 2023

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[192] arXiv:2402.01868 [pdf, html, other]: Title: Challenges in Training PINNs: A Loss Landscape Perspective

Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, Madeleine Udell

Comments: ICML 2024 Oral; 33 pages (including appendices), 10 figures, 3 tables

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[193] arXiv:2402.01869 [pdf, html, other]: Title: InferCept: Efficient Intercept Support for Augmented Large Language Model Inference

Reyna Abhyankar, Zijian He, Vikranth Srivatsa, Hao Zhang, Yiying Zhang

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[194] arXiv:2402.01879 [pdf, html, other]: Title: $σ$-zero: Gradient-based Optimization of $\ell_0$-norm Adversarial Examples

Antonio Emanuele Cinà, Francesco Villani, Maura Pintor, Lea Schönherr, Battista Biggio, Marcello Pelillo

Comments: Paper accepted at International Conference on Learning Representations (ICLR 2025). Code available at this https URL

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[195] arXiv:2402.01881 [pdf, other]: Title: Large Language Model Agent for Hyper-Parameter Optimization

Siyi Liu, Chen Gao, Yong Li

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[196] arXiv:2402.01886 [pdf, html, other]: Title: Inverse Reinforcement Learning by Estimating Expertise of Demonstrators

Mark Beliaev, Ramtin Pedarsani

Comments: 11 pages, 4 figures, extended version of AAAI publication

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[197] arXiv:2402.01902 [pdf, html, other]: Title: EBV: Electronic Bee-Veterinarian for Principled Mining and Forecasting of Honeybee Time Series

Mst. Shamima Hossain, Christos Faloutsos, Boris Baer, Hyoseung Kim, Vassilis J. Tsotras

Comments: 9 pages, 7 figure, Accepted at 2024 SIAM International Conference on Data Mining (SDM'24)

Subjects: Machine Learning (cs.LG)
[198] arXiv:2402.01909 [pdf, html, other]: Title: On Catastrophic Inheritance of Large Foundation Models

Hao Chen, Bhiksha Raj, Xing Xie, Jindong Wang

Comments: Accepted by DMLR

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[199] arXiv:2402.01911 [pdf, html, other]: Title: From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers

Bharat Runwal, Tejaswini Pedapati, Pin-Yu Chen

Comments: Preprint

Subjects: Machine Learning (cs.LG)
[200] arXiv:2402.01920 [pdf, html, other]: Title: Preference Poisoning Attacks on Reward Model Learning

Junlin Wu, Jiongxiao Wang, Chaowei Xiao, Chenguang Wang, Ning Zhang, Yevgeniy Vorobeychik

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Total of 3960 entries : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 ... 3951-3960

Showing up to 50 entries per page: fewer | more | all