Machine Learning

Authors and titles for February 2024

Total of 3960 entries : 1-100 101-200 201-300 301-400 401-500 ... 3901-3960

Showing up to 100 entries per page: fewer | more | all

[101] arXiv:2402.01093 [pdf, html, other]: Title: Need a Small Specialized Language Model? Plan Early!

David Grangier, Angelos Katharopoulos, Pierre Ablin, Awni Hannun

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[102] arXiv:2402.01095 [pdf, html, other]: Title: Minimal Sufficient Views: A DNN model making predictions with more evidence has higher accuracy

Keisuke Kawano, Takuro Kutsuna, Keisuke Sano

Comments: 24 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[103] arXiv:2402.01096 [pdf, html, other]: Title: Trustworthy Distributed AI Systems: Robustness, Privacy, and Governance

Wenqi Wei, Ling Liu

Comments: Manuscript accepted to ACM Computing Surveys

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[104] arXiv:2402.01098 [pdf, html, other]: Title: Bayesian Deep Learning for Remaining Useful Life Estimation via Stein Variational Gradient Descent

Luca Della Libera, Jacopo Andreoli, Davide Dalle Pezze, Mirco Ravanelli, Gian Antonio Susto

Comments: 26 pages, 3 figures

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[105] arXiv:2402.01103 [pdf, html, other]: Title: Compositional Generative Modeling: A Single Model is Not All You Need

Yilun Du, Leslie Kaelbling

Comments: ICML 2024 (Position Track)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[106] arXiv:2402.01105 [pdf, html, other]: Title: A Survey for Foundation Models in Autonomous Driving

Haoxiang Gao, Zhongruo Wang, Yaqian Li, Kaiwen Long, Ming Yang, Yiqing Shen

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[107] arXiv:2402.01107 [pdf, html, other]: Title: Simulation of Graph Algorithms with Looped Transformers

Artur Back de Luca, Kimon Fountoulakis

Comments: 55 pages, 3 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS)
[108] arXiv:2402.01109 [pdf, html, other]: Title: Vaccine: Perturbation-aware Alignment for Large Language Models against Harmful Fine-tuning Attack

Tiansheng Huang, Sihao Hu, Ling Liu

Comments: Rejected by ICML2024. Accepted by NeurIPS2024

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[109] arXiv:2402.01111 [pdf, html, other]: Title: Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints

Dan Qiao, Yu-Xiang Wang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[110] arXiv:2402.01114 [pdf, html, other]: Title: Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization

Arezoo Rajabi, Reeya Pimple, Aiswarya Janardhanan, Surudhi Asokraj, Bhaskar Ramasubramanian, Radha Poovendran

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[111] arXiv:2402.01140 [pdf, other]: Title: Root Cause Analysis In Microservice Using Neural Granger Causal Discovery

Cheng-Ming Lin, Ching Chang, Wei-Yao Wang, Kuang-Da Wang, Wen-Chih Peng

Comments: AAAI 2024 Main Track

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[112] arXiv:2402.01143 [pdf, html, other]: Title: Learning Network Representations with Disentangled Graph Auto-Encoder

Di Fan, Chuanhou Gao

Comments: 15 pages, 9 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[113] arXiv:2402.01146 [pdf, html, other]: Title: Limited Memory Online Gradient Descent for Kernelized Pairwise Learning with Dynamic Averaging

Hilal AlQuabeh, William de Vazelhes, Bin Gu

Comments: Accepted in AAAI 2024

Subjects: Machine Learning (cs.LG)
[114] arXiv:2402.01147 [pdf, html, other]: Title: Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems

Neharika Jali, Guannan Qu, Weina Wang, Gauri Joshi

Comments: AISTATS 2024; Corrected typos

Subjects: Machine Learning (cs.LG); Performance (cs.PF)
[115] arXiv:2402.01160 [pdf, other]: Title: Truncated Non-Uniform Quantization for Distributed SGD

Guangfeng Yan, Tan Li, Yuanzhang Xiao, Congduan Li, Linqi Song

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[116] arXiv:2402.01195 [pdf, html, other]: Title: Conditional Normalizing Flows for Active Learning of Coarse-Grained Molecular Representations

Henrik Schopmans, Pascal Friederich

Journal-ref: Proceedings of the 41st International Conference on Machine Learning (ICML 2024), PMLR 235:43804-43827, 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Chemical Physics (physics.chem-ph)
[117] arXiv:2402.01201 [pdf, other]: Title: Few-Shot Class-Incremental Learning with Prior Knowledge

Wenhao Jiang, Duo Li, Menghan Hu, Guangtao Zhai, Xiaokang Yang, Xiao-Ping Zhang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[118] arXiv:2402.01203 [pdf, html, other]: Title: Neural Language of Thought Models

Yi-Fu Wu, Minseung Lee, Sungjin Ahn

Comments: Accepted in ICLR 2024

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[119] arXiv:2402.01204 [pdf, html, other]: Title: A Survey on Self-Supervised Learning for Non-Sequential Tabular Data

Wei-Yao Wang, Wei-Wei Du, Derek Xu, Wei Wang, Wen-Chih Peng

Comments: ACML-24 Journal Track. The paper list can be found at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[120] arXiv:2402.01206 [pdf, other]: Title: Comparative Evaluation of Weather Forecasting using Machine Learning Models

Md Saydur Rahman, Farhana Akter Tumpa, Md Shazid Islam, Abul Al Arabi, Md Sanzid Bin Hossain, Md Saad Ul Haque

Subjects: Machine Learning (cs.LG)
[121] arXiv:2402.01207 [pdf, html, other]: Title: Efficient Causal Graph Discovery Using Large Language Models

Thomas Jiralerspong, Xiaoyin Chen, Yash More, Vedant Shah, Yoshua Bengio

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[122] arXiv:2402.01208 [pdf, html, other]: Title: Location Agnostic Adaptive Rain Precipitation Prediction using Deep Learning

Md Shazid Islam, Md Saydur Rahman, Md Saad Ul Haque, Farhana Akter Tumpa, Md Sanzid Bin Hossain, Abul Al Arabi

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[123] arXiv:2402.01226 [pdf, html, other]: Title: HW-SW Optimization of DNNs for Privacy-preserving People Counting on Low-resolution Infrared Arrays

Matteo Risso, Chen Xie, Francesco Daghero, Alessio Burrello, Seyedmorteza Mollaei, Marco Castellano, Enrico Macii, Massimo Poncino, Daniele Jahier Pagliari

Comments: This paper has been accepted for publication in the DATE 2024 conference IEEE

Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[124] arXiv:2402.01231 [pdf, html, other]: Title: Unveiling Delay Effects in Traffic Forecasting: A Perspective from Spatial-Temporal Delay Differential Equations

Qingqing Long, Zheng Fang, Chen Fang, Chong Chen, Pengfei Wang, Yuanchun Zhou

Comments: 11 pages, 7 figures

Subjects: Machine Learning (cs.LG)
[125] arXiv:2402.01238 [pdf, other]: Title: Flexible Variational Information Bottleneck: Achieving Diverse Compression with a Single Training

Sota Kudo, Naoaki Ono, Shigehiko Kanaya, Ming Huang

Journal-ref: Neurocomputing (2025): 130198

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[126] arXiv:2402.01242 [pdf, html, other]: Title: Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness

Guibin Zhang, Yanwei Yue, Kun Wang, Junfeng Fang, Yongduo Sui, Kai Wang, Yuxuan Liang, Dawei Cheng, Shirui Pan, Tianlong Chen

Subjects: Machine Learning (cs.LG)
[127] arXiv:2402.01252 [pdf, other]: Title: Target inductive methods for zero-shot regression

Miriam Fdez-Díaz, José Ramón Quevedo, Elena Montañés

Journal-ref: Information Sciences ISSN: 0020-0255 2022 Volumen: 599 P\'aginas: 44-63

Subjects: Machine Learning (cs.LG)
[128] arXiv:2402.01261 [pdf, other]: Title: TEDDY: Trimming Edges with Degree-based Discrimination strategY

Hyunjin Seo, Jihun Yun, Eunho Yang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[129] arXiv:2402.01262 [pdf, html, other]: Title: Class incremental learning with probability dampening and cascaded gated classifier

Jary Pomponi, Alessio Devoto, Simone Scardapane

Comments: Previously called "Cascaded Scaling Classifier: class incremental learning with probability scaling ". The official code is available this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[130] arXiv:2402.01263 [pdf, other]: Title: A Differentiable Partially Observable Generalized Linear Model with Forward-Backward Message Passing

Chengrui Li, Weihan Li, Yule Wang, Anqi Wu

Subjects: Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[131] arXiv:2402.01264 [pdf, other]: Title: Direct side information learning for zero-shot regression

Miriam Fdez-Díaz, Elena Montañés, José Ramón Quevedo

Journal-ref: Neurocomputing 2023 Volumen 561 126873

Subjects: Machine Learning (cs.LG)
[132] arXiv:2402.01293 [pdf, other]: Title: Can MLLMs Perform Text-to-Image In-Context Learning?

Yuchen Zeng, Wonjun Kang, Yicong Chen, Hyung Il Koo, Kangwook Lee

Comments: Accepted at COLM 2024

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[133] arXiv:2402.01295 [pdf, html, other]: Title: ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast

Wanghan Xu, Kang Chen, Tao Han, Hao Chen, Wanli Ouyang, Lei Bai

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[134] arXiv:2402.01296 [pdf, html, other]: Title: Bi-CryptoNets: Leveraging Different-Level Privacy for Encrypted Inference

Man-Jie Yuan, Zheng Zou, Wei Gao

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[135] arXiv:2402.01297 [pdf, html, other]: Title: Characterizing Overfitting in Kernel Ridgeless Regression Through the Eigenspectrum

Tin Sum Cheng, Aurelien Lucchi, Anastasis Kratsios, David Belius

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[136] arXiv:2402.01302 [pdf, html, other]: Title: A Unified Framework for Center-based Clustering of Distributed Data

Aleksandar Armacki, Dragana Bajović, Dušan Jakovetić, Soummya Kar

Comments: 49 pages, 9 figures, 7 tables

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA)
[137] arXiv:2402.01306 [pdf, html, other]: Title: KTO: Model Alignment as Prospect Theoretic Optimization

Kawin Ethayarajh, Winnie Xu, Niklas Muennighoff, Dan Jurafsky, Douwe Kiela

Comments: ICML 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[138] arXiv:2402.01327 [pdf, html, other]: Title: Supervised Algorithmic Fairness in Distribution Shifts: A Survey

Minglai Shao, Dong Li, Chen Zhao, Xintao Wu, Yujie Lin, Qin Tian

Comments: IJCAI 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[139] arXiv:2402.01340 [pdf, html, other]: Title: SignSGD with Federated Defense: Harnessing Adversarial Attacks through Gradient Sign Decoding

Chanho Park, Namyoon Lee

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Signal Processing (eess.SP)
[140] arXiv:2402.01341 [pdf, html, other]: Title: Fundamental Properties of Causal Entropy and Information Gain

Francisco N. F. Q. Simoes, Mehdi Dastani, Thijs van Ommen

Comments: In Proceedings of the conference CLeaR (Causal Learning and Reasoning) 2024

Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[141] arXiv:2402.01342 [pdf, html, other]: Title: Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusion

Zexi Li, Zhiqi Li, Jie Lin, Tao Shen, Tao Lin, Chao Wu

Comments: preprint

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[142] arXiv:2402.01343 [pdf, html, other]: Title: Shapelet-based Model-agnostic Counterfactual Local Explanations for Time Series Classification

Qi Huang, Wei Chen, Thomas Bäck, Niki van Stein

Comments: The paper has been accepted by the XAI4Sci workshop of AAAI 2024

Subjects: Machine Learning (cs.LG)
[143] arXiv:2402.01344 [pdf, html, other]: Title: Monotone, Bi-Lipschitz, and Polyak-Lojasiewicz Networks

Ruigang Wang, Krishnamurthy Dvijotham, Ian R. Manchester

Comments: International Conference on Machine Learning, Vienna, Austria, July 21 -- 17, 2024

Subjects: Machine Learning (cs.LG)
[144] arXiv:2402.01348 [pdf, html, other]: Title: CORE: Mitigating Catastrophic Forgetting in Continual Learning through Cognitive Replay

Jianshu Zhang, Yankai Fu, Ziheng Peng, Dongyu Yao, Kun He

Comments: Accepted by CogSci24 as oral presentation

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[145] arXiv:2402.01350 [pdf, html, other]: Title: pFedMoE: Data-Level Personalization with Mixture of Experts for Model-Heterogeneous Personalized Federated Learning

Liping Yi, Han Yu, Chao Ren, Heng Zhang, Gang Wang, Xiaoguang Liu, Xiaoxiao Li

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[146] arXiv:2402.01359 [pdf, html, other]: Title: TESSERACT: Eliminating Experimental Bias in Malware Classification across Space and Time (Extended Version)

Zeliang Kan, Shae McFadden, Daniel Arp, Feargus Pendlebury, Roberto Jordaney, Johannes Kinder, Fabio Pierazzi, Lorenzo Cavallaro

Comments: 30 pages. arXiv admin note: text overlap with arXiv:1807.07838

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Performance (cs.PF)
[147] arXiv:2402.01361 [pdf, html, other]: Title: To the Max: Reinventing Reward in Reinforcement Learning

Grigorii Veviurko, Wendelin Böhmer, Mathijs de Weerdt

Journal-ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:49455-49470, 2024

Subjects: Machine Learning (cs.LG)
[148] arXiv:2402.01369 [pdf, html, other]: Title: On the Multi-modal Vulnerability of Diffusion Models

Dingcheng Yang, Yang Bai, Xiaojun Jia, Yang Liu, Xiaochun Cao, Wenjian Yu

Comments: Accepted at ICML2024 Workshop on Trustworthy Multi-modal Foundation Models and AI Agents (TiFA)

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[149] arXiv:2402.01371 [pdf, other]: Title: Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation

Prashansa Panda, Shalabh Bhatnagar

Subjects: Machine Learning (cs.LG)
[150] arXiv:2402.01379 [pdf, html, other]: Title: Regularized boosting with an increasing coefficient magnitude stop criterion as meta-learner in hyperparameter optimization stacking ensemble

Laura Fdez-Díaz, José Ramón Quevedo, Elena Montañés

Journal-ref: Neurocomputing 2023 Volume 551 126516

Subjects: Machine Learning (cs.LG)
[151] arXiv:2402.01399 [pdf, other]: Title: A Probabilistic Model Behind Self-Supervised Learning

Alice Bizeul, Bernhard Schölkopf, Carl Allen

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[152] arXiv:2402.01401 [pdf, html, other]: Title: An Information Theoretic Approach to Machine Unlearning

Jack Foster, Kyle Fogarty, Stefan Schoepf, Zack Dugue, Cengiz Öztireli, Alexandra Brintrup

Comments: Updated, new low-dimensional experiments and updated perspective on unlearning from an information theoretic view

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[153] arXiv:2402.01408 [pdf, html, other]: Title: Counterfactual Concept Bottleneck Models

Gabriele Dominici, Pietro Barbiero, Francesco Giannini, Martin Gjoreski, Giuseppe Marra, Marc Langheinrich

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[154] arXiv:2402.01415 [pdf, other]: Title: SMLP: Symbolic Machine Learning Prover

Franz Brauße, Zurab Khasidashvili, Konstantin Korovin

Comments: 12 pages, 4 figures. (submitted)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO); Symbolic Computation (cs.SC); Optimization and Control (math.OC)
[155] arXiv:2402.01431 [pdf, html, other]: Title: Approximate Control for Continuous-Time POMDPs

Yannick Eich, Bastian Alt, Heinz Koeppl

Comments: To be published in AISTATS 2024

Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Quantitative Methods (q-bio.QM)
[156] arXiv:2402.01439 [pdf, other]: Title: From Words to Molecules: A Survey of Large Language Models in Chemistry

Chang Liao, Yemin Yu, Yu Mei, Ying Wei

Comments: Submitted to IJCAI 2024 survey track

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Biomolecules (q-bio.BM); Quantitative Methods (q-bio.QM)
[157] arXiv:2402.01440 [pdf, html, other]: Title: A Survey of Few-Shot Learning on Graphs: from Meta-Learning to Pre-Training and Prompt Learning

Xingtong Yu, Yuan Fang, Zemin Liu, Yuxia Wu, Zhihao Wen, Jianyuan Bo, Xinming Zhang, Steven C.H. Hoi

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[158] arXiv:2402.01444 [pdf, other]: Title: Mission Critical -- Satellite Data is a Distinct Modality in Machine Learning

Esther Rolf, Konstantin Klemmer, Caleb Robinson, Hannah Kerner

Comments: 15 pages, 5 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[159] arXiv:2402.01450 [pdf, other]: Title: Improving importance estimation in covariate shift for providing accurate prediction error

Laura Fdez-Díaz, Sara González Tomillo, Elena Montañés, José Ramón Quevedo

Journal-ref: Expert Systems With Applications 2022 Volume 193 116376

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[160] arXiv:2402.01454 [pdf, html, other]: Title: Integrating Large Language Models in Causal Discovery: A Statistical Causal Approach

Masayuki Takayama, Tadahisa Okuda, Thong Pham, Tatsuyoshi Ikenoue, Shingo Fukuma, Shohei Shimizu, Akiyoshi Sannai

Journal-ref: Published in Transactions in Machine Learning Research (05/2025) https://openreview.net/forum?id=Reh1S8rxfh

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[161] arXiv:2402.01476 [pdf, html, other]: Title: Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian Processes

Yingyi Chen, Qinghua Tao, Francesco Tonin, Johan A.K. Suykens

Comments: We propose Kernel-Eigen Pair Sparse Variational Gaussian Processes (KEP-SVGP) for building uncertainty-aware self-attention where the asymmetry of attention kernel is tackled by KSVD and a reduced time complexity is acquired

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[162] arXiv:2402.01481 [pdf, html, other]: Title: Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains

Jiale Zhao, Wanru Zhuang, Jia Song, Yaqi Li, Shuqi Lu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Biomolecules (q-bio.BM)
[163] arXiv:2402.01484 [pdf, html, other]: Title: Connecting the Dots: Is Mode-Connectedness the Key to Feasible Sample-Based Inference in Bayesian Neural Networks?

Emanuel Sommer, Lisa Wimmer, Theodore Papamarkou, Ludwig Bothmann, Bernd Bischl, David Rügamer

Subjects: Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
[164] arXiv:2402.01514 [pdf, html, other]: Title: Mapping the Multiverse of Latent Representations

Jeremy Wayland, Corinna Coupette, Bastian Rieck

Comments: Accepted at ICML 2024

Subjects: Machine Learning (cs.LG); Algebraic Topology (math.AT); Machine Learning (stat.ML)
[165] arXiv:2402.01515 [pdf, html, other]: Title: Enhancing Stochastic Gradient Descent: A Unified Framework and Novel Acceleration Methods for Faster Convergence

Yichuan Deng, Zhao Song, Chiwun Yang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[166] arXiv:2402.01528 [pdf, html, other]: Title: Decoding Speculative Decoding

Minghao Yan, Saurabh Agarwal, Shivaram Venkataraman

Comments: Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2025)

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[167] arXiv:2402.01543 [pdf, html, other]: Title: Adaptive Optimization for Prediction with Missing Data

Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[168] arXiv:2402.01546 [pdf, other]: Title: Privacy-Preserving Distributed Learning for Residential Short-Term Load Forecasting

Yi Dong, Yingjie Wang, Mariana Gama, Mustafa A. Mustafa, Geert Deconinck, Xiaowei Huang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA); Systems and Control (eess.SY)
[169] arXiv:2402.01567 [pdf, html, other]: Title: Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise

Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook, Yan Dai

Comments: Accepted at ICML 2024

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[170] arXiv:2402.01608 [pdf, other]: Title: Contingency Analysis of a Grid of Connected EVs for Primary Frequency Control of an Industrial Microgrid Using Efficient Control Scheme

J.N. Sabhahit, S.S. Solanke, V.K. Jadoun, H. Malik, F.P. García Márquez, J.M. Pinar-Pérez

Comments: Published in energies (MDPI) 2022

Journal-ref: Energies 2022, 15, 3102

Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[171] arXiv:2402.01614 [pdf, other]: Title: L2G2G: a Scalable Local-to-Global Network Embedding with Graph Autoencoders

Ruikang Ouyang, Andrew Elliott, Stratis Limnios, Mihai Cucuringu, Gesine Reinert

Comments: 13 pages, 4 figures, Complex Networks 2023, Volume I, SCI 1141

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[172] arXiv:2402.01621 [pdf, html, other]: Title: Stochastic Two Points Method for Deep Model Zeroth-order Optimization

Yijiang Pang, Jiayu Zhou

Subjects: Machine Learning (cs.LG)
[173] arXiv:2402.01632 [pdf, html, other]: Title: Time-Varying Gaussian Process Bandits with Unknown Prior

Juliusz Ziomek, Masaki Adachi, Michael A. Osborne

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[174] arXiv:2402.01768 [pdf, other]: Title: Enriched Physics-informed Neural Networks for Dynamic Poisson-Nernst-Planck Systems

Xujia Huang, Fajie Wang, Benrong Zhang, Hanqing Liu

Comments: 24 pages, 16 figures, 6 tables

Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[175] arXiv:2402.01785 [pdf, html, other]: Title: DoubleMLDeep: Estimation of Causal Effects with Multimodal Data

Sven Klaassen, Jan Teichert-Kluge, Philipp Bach, Victor Chernozhukov, Martin Spindler, Suhas Vijaykumar

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Econometrics (econ.EM); Methodology (stat.ME); Machine Learning (stat.ML)
[176] arXiv:2402.01790 [pdf, html, other]: Title: An introduction to graphical tensor notation for mechanistic interpretability

Jordan K. Taylor

Comments: 30 pages, 75 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[177] arXiv:2402.01797 [pdf, html, other]: Title: Robust support vector machines via conic optimization

Valentina Cepeda, Andrés Gómez, Shaoning Han

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Computation (stat.CO)
[178] arXiv:2402.01798 [pdf, html, other]: Title: Improved Quantization Strategies for Managing Heavy-tailed Gradients in Distributed Learning

Guangfeng Yan, Tan Li, Yuanzhang Xiao, Hanxu Hou, Linqi Song

Comments: arXiv admin note: substantial text overlap with arXiv:2402.01160

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[179] arXiv:2402.01799 [pdf, html, other]: Title: Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward

Arnav Chavan, Raghav Magazine, Shubham Kushwaha, Mérouane Debbah, Deepak Gupta

Comments: Accepted at IJCAI '24 (Survey Track), Updated TGI results

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[180] arXiv:2402.01801 [pdf, html, other]: Title: Large Language Models for Time Series: A Survey

Xiyuan Zhang, Ranak Roy Chowdhury, Rajesh K. Gupta, Jingbo Shang

Comments: GitHub repository: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[181] arXiv:2402.01802 [pdf, html, other]: Title: An Auction-based Marketplace for Model Trading in Federated Learning

Yue Cui, Liuyi Yao, Yaliang Li, Ziqian Chen, Bolin Ding, Xiaofang Zhou

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[182] arXiv:2402.01811 [pdf, html, other]: Title: A Distributionally Robust Optimisation Approach to Fair Credit Scoring

Pablo Casas, Christophe Mues, Huan Yu

Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[183] arXiv:2402.01821 [pdf, html, other]: Title: Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks

Akshay K. Jagadish, Julian Coda-Forno, Mirko Thalmann, Eric Schulz, Marcel Binz

Comments: 27 pages (9 pages of main text, 4 pages of references, and 14 pages of appendix), 13 figures, and 7 Tables

Journal-ref: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[184] arXiv:2402.01845 [pdf, html, other]: Title: Multi-Armed Bandits with Interference

Su Jia, Peter Frazier, Nathan Kallus

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[185] arXiv:2402.01849 [pdf, html, other]: Title: Capturing waste collection planning expert knowledge in a fitness function through preference learning

Laura Fernández Díaz, Miriam Fernández Díaz, José Ramón Quevedo, Elena Montañés

Journal-ref: Engineering Applications of Artificial Intelligence 2021 Volume 99 104113

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[186] arXiv:2402.01857 [pdf, html, other]: Title: Position Paper: Assessing Robustness, Privacy, and Fairness in Federated Learning Integrated with Foundation Models

Xi Li, Jiaqi Wang

Comments: Under review

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computers and Society (cs.CY)
[187] arXiv:2402.01858 [pdf, html, other]: Title: Explaining latent representations of generative models with large multimodal models

Mengdan Zhu, Zhenke Liu, Bo Pan, Abhinav Angirekula, Liang Zhao

Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[188] arXiv:2402.01862 [pdf, html, other]: Title: Parametric Feature Transfer: One-shot Federated Learning with Foundation Models

Mahdi Beitollahi, Alex Bie, Sobhan Hemati, Leo Maxime Brunswic, Xu Li, Xi Chen, Guojun Zhang

Comments: 20 pages, 12 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[189] arXiv:2402.01863 [pdf, html, other]: Title: DFML: Decentralized Federated Mutual Learning

Yasser H. Khalil, Amir H. Estiri, Mahdi Beitollahi, Nader Asadi, Sobhan Hemati, Xu Li, Guojun Zhang, Xi Chen

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[190] arXiv:2402.01865 [pdf, html, other]: Title: What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement

Xisen Jin, Xiang Ren

Comments: ICML 2024 (Spotlight)

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[191] arXiv:2402.01867 [pdf, html, other]: Title: Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision

Jinyan Su, Peilin Yu, Jieyu Zhang, Stephen H. Bach

Comments: Accepted to IEEE International Conference on Big Data 2023

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[192] arXiv:2402.01868 [pdf, html, other]: Title: Challenges in Training PINNs: A Loss Landscape Perspective

Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, Madeleine Udell

Comments: ICML 2024 Oral; 33 pages (including appendices), 10 figures, 3 tables

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[193] arXiv:2402.01869 [pdf, html, other]: Title: InferCept: Efficient Intercept Support for Augmented Large Language Model Inference

Reyna Abhyankar, Zijian He, Vikranth Srivatsa, Hao Zhang, Yiying Zhang

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[194] arXiv:2402.01879 [pdf, html, other]: Title: $σ$-zero: Gradient-based Optimization of $\ell_0$-norm Adversarial Examples

Antonio Emanuele Cinà, Francesco Villani, Maura Pintor, Lea Schönherr, Battista Biggio, Marcello Pelillo

Comments: Paper accepted at International Conference on Learning Representations (ICLR 2025). Code available at this https URL

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[195] arXiv:2402.01881 [pdf, other]: Title: Large Language Model Agent for Hyper-Parameter Optimization

Siyi Liu, Chen Gao, Yong Li

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[196] arXiv:2402.01886 [pdf, html, other]: Title: Inverse Reinforcement Learning by Estimating Expertise of Demonstrators

Mark Beliaev, Ramtin Pedarsani

Comments: 11 pages, 4 figures, extended version of AAAI publication

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[197] arXiv:2402.01902 [pdf, html, other]: Title: EBV: Electronic Bee-Veterinarian for Principled Mining and Forecasting of Honeybee Time Series

Mst. Shamima Hossain, Christos Faloutsos, Boris Baer, Hyoseung Kim, Vassilis J. Tsotras

Comments: 9 pages, 7 figure, Accepted at 2024 SIAM International Conference on Data Mining (SDM'24)

Subjects: Machine Learning (cs.LG)
[198] arXiv:2402.01909 [pdf, html, other]: Title: On Catastrophic Inheritance of Large Foundation Models

Hao Chen, Bhiksha Raj, Xing Xie, Jindong Wang

Comments: Accepted by DMLR

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[199] arXiv:2402.01911 [pdf, html, other]: Title: From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers

Bharat Runwal, Tejaswini Pedapati, Pin-Yu Chen

Comments: Preprint

Subjects: Machine Learning (cs.LG)
[200] arXiv:2402.01920 [pdf, html, other]: Title: Preference Poisoning Attacks on Reward Model Learning

Junlin Wu, Jiongxiao Wang, Chaowei Xiao, Chenguang Wang, Ning Zhang, Yevgeniy Vorobeychik

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Total of 3960 entries : 1-100 101-200 201-300 301-400 401-500 ... 3901-3960

Showing up to 100 entries per page: fewer | more | all