Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for February 2024

Total of 3960 entries : 1-100 101-200 201-300 301-400 401-500 ... 3901-3960
Showing up to 100 entries per page: fewer | more | all
[101] arXiv:2402.01093 [pdf, html, other]
Title: Need a Small Specialized Language Model? Plan Early!
David Grangier, Angelos Katharopoulos, Pierre Ablin, Awni Hannun
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[102] arXiv:2402.01095 [pdf, html, other]
Title: Minimal Sufficient Views: A DNN model making predictions with more evidence has higher accuracy
Keisuke Kawano, Takuro Kutsuna, Keisuke Sano
Comments: 24 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[103] arXiv:2402.01096 [pdf, html, other]
Title: Trustworthy Distributed AI Systems: Robustness, Privacy, and Governance
Wenqi Wei, Ling Liu
Comments: Manuscript accepted to ACM Computing Surveys
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[104] arXiv:2402.01098 [pdf, html, other]
Title: Bayesian Deep Learning for Remaining Useful Life Estimation via Stein Variational Gradient Descent
Luca Della Libera, Jacopo Andreoli, Davide Dalle Pezze, Mirco Ravanelli, Gian Antonio Susto
Comments: 26 pages, 3 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[105] arXiv:2402.01103 [pdf, html, other]
Title: Compositional Generative Modeling: A Single Model is Not All You Need
Yilun Du, Leslie Kaelbling
Comments: ICML 2024 (Position Track)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[106] arXiv:2402.01105 [pdf, html, other]
Title: A Survey for Foundation Models in Autonomous Driving
Haoxiang Gao, Zhongruo Wang, Yaqian Li, Kaiwen Long, Ming Yang, Yiqing Shen
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[107] arXiv:2402.01107 [pdf, html, other]
Title: Simulation of Graph Algorithms with Looped Transformers
Artur Back de Luca, Kimon Fountoulakis
Comments: 55 pages, 3 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS)
[108] arXiv:2402.01109 [pdf, html, other]
Title: Vaccine: Perturbation-aware Alignment for Large Language Models against Harmful Fine-tuning Attack
Tiansheng Huang, Sihao Hu, Ling Liu
Comments: Rejected by ICML2024. Accepted by NeurIPS2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[109] arXiv:2402.01111 [pdf, html, other]
Title: Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
Dan Qiao, Yu-Xiang Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[110] arXiv:2402.01114 [pdf, html, other]
Title: Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
Arezoo Rajabi, Reeya Pimple, Aiswarya Janardhanan, Surudhi Asokraj, Bhaskar Ramasubramanian, Radha Poovendran
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[111] arXiv:2402.01140 [pdf, other]
Title: Root Cause Analysis In Microservice Using Neural Granger Causal Discovery
Cheng-Ming Lin, Ching Chang, Wei-Yao Wang, Kuang-Da Wang, Wen-Chih Peng
Comments: AAAI 2024 Main Track
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[112] arXiv:2402.01143 [pdf, html, other]
Title: Learning Network Representations with Disentangled Graph Auto-Encoder
Di Fan, Chuanhou Gao
Comments: 15 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[113] arXiv:2402.01146 [pdf, html, other]
Title: Limited Memory Online Gradient Descent for Kernelized Pairwise Learning with Dynamic Averaging
Hilal AlQuabeh, William de Vazelhes, Bin Gu
Comments: Accepted in AAAI 2024
Subjects: Machine Learning (cs.LG)
[114] arXiv:2402.01147 [pdf, html, other]
Title: Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems
Neharika Jali, Guannan Qu, Weina Wang, Gauri Joshi
Comments: AISTATS 2024; Corrected typos
Subjects: Machine Learning (cs.LG); Performance (cs.PF)
[115] arXiv:2402.01160 [pdf, other]
Title: Truncated Non-Uniform Quantization for Distributed SGD
Guangfeng Yan, Tan Li, Yuanzhang Xiao, Congduan Li, Linqi Song
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[116] arXiv:2402.01195 [pdf, html, other]
Title: Conditional Normalizing Flows for Active Learning of Coarse-Grained Molecular Representations
Henrik Schopmans, Pascal Friederich
Journal-ref: Proceedings of the 41st International Conference on Machine Learning (ICML 2024), PMLR 235:43804-43827, 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Chemical Physics (physics.chem-ph)
[117] arXiv:2402.01201 [pdf, other]
Title: Few-Shot Class-Incremental Learning with Prior Knowledge
Wenhao Jiang, Duo Li, Menghan Hu, Guangtao Zhai, Xiaokang Yang, Xiao-Ping Zhang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[118] arXiv:2402.01203 [pdf, html, other]
Title: Neural Language of Thought Models
Yi-Fu Wu, Minseung Lee, Sungjin Ahn
Comments: Accepted in ICLR 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[119] arXiv:2402.01204 [pdf, html, other]
Title: A Survey on Self-Supervised Learning for Non-Sequential Tabular Data
Wei-Yao Wang, Wei-Wei Du, Derek Xu, Wei Wang, Wen-Chih Peng
Comments: ACML-24 Journal Track. The paper list can be found at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[120] arXiv:2402.01206 [pdf, other]
Title: Comparative Evaluation of Weather Forecasting using Machine Learning Models
Md Saydur Rahman, Farhana Akter Tumpa, Md Shazid Islam, Abul Al Arabi, Md Sanzid Bin Hossain, Md Saad Ul Haque
Subjects: Machine Learning (cs.LG)
[121] arXiv:2402.01207 [pdf, html, other]
Title: Efficient Causal Graph Discovery Using Large Language Models
Thomas Jiralerspong, Xiaoyin Chen, Yash More, Vedant Shah, Yoshua Bengio
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[122] arXiv:2402.01208 [pdf, html, other]
Title: Location Agnostic Adaptive Rain Precipitation Prediction using Deep Learning
Md Shazid Islam, Md Saydur Rahman, Md Saad Ul Haque, Farhana Akter Tumpa, Md Sanzid Bin Hossain, Abul Al Arabi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[123] arXiv:2402.01226 [pdf, html, other]
Title: HW-SW Optimization of DNNs for Privacy-preserving People Counting on Low-resolution Infrared Arrays
Matteo Risso, Chen Xie, Francesco Daghero, Alessio Burrello, Seyedmorteza Mollaei, Marco Castellano, Enrico Macii, Massimo Poncino, Daniele Jahier Pagliari
Comments: This paper has been accepted for publication in the DATE 2024 conference IEEE
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[124] arXiv:2402.01231 [pdf, html, other]
Title: Unveiling Delay Effects in Traffic Forecasting: A Perspective from Spatial-Temporal Delay Differential Equations
Qingqing Long, Zheng Fang, Chen Fang, Chong Chen, Pengfei Wang, Yuanchun Zhou
Comments: 11 pages, 7 figures
Subjects: Machine Learning (cs.LG)
[125] arXiv:2402.01238 [pdf, other]
Title: Flexible Variational Information Bottleneck: Achieving Diverse Compression with a Single Training
Sota Kudo, Naoaki Ono, Shigehiko Kanaya, Ming Huang
Journal-ref: Neurocomputing (2025): 130198
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[126] arXiv:2402.01242 [pdf, html, other]
Title: Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness
Guibin Zhang, Yanwei Yue, Kun Wang, Junfeng Fang, Yongduo Sui, Kai Wang, Yuxuan Liang, Dawei Cheng, Shirui Pan, Tianlong Chen
Subjects: Machine Learning (cs.LG)
[127] arXiv:2402.01252 [pdf, other]
Title: Target inductive methods for zero-shot regression
Miriam Fdez-Díaz, José Ramón Quevedo, Elena Montañés
Journal-ref: Information Sciences ISSN: 0020-0255 2022 Volumen: 599 P\'aginas: 44-63
Subjects: Machine Learning (cs.LG)
[128] arXiv:2402.01261 [pdf, other]
Title: TEDDY: Trimming Edges with Degree-based Discrimination strategY
Hyunjin Seo, Jihun Yun, Eunho Yang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[129] arXiv:2402.01262 [pdf, html, other]
Title: Class incremental learning with probability dampening and cascaded gated classifier
Jary Pomponi, Alessio Devoto, Simone Scardapane
Comments: Previously called "Cascaded Scaling Classifier: class incremental learning with probability scaling ". The official code is available this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[130] arXiv:2402.01263 [pdf, other]
Title: A Differentiable Partially Observable Generalized Linear Model with Forward-Backward Message Passing
Chengrui Li, Weihan Li, Yule Wang, Anqi Wu
Subjects: Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[131] arXiv:2402.01264 [pdf, other]
Title: Direct side information learning for zero-shot regression
Miriam Fdez-Díaz, Elena Montañés, José Ramón Quevedo
Journal-ref: Neurocomputing 2023 Volumen 561 126873
Subjects: Machine Learning (cs.LG)
[132] arXiv:2402.01293 [pdf, other]
Title: Can MLLMs Perform Text-to-Image In-Context Learning?
Yuchen Zeng, Wonjun Kang, Yicong Chen, Hyung Il Koo, Kangwook Lee
Comments: Accepted at COLM 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[133] arXiv:2402.01295 [pdf, html, other]
Title: ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast
Wanghan Xu, Kang Chen, Tao Han, Hao Chen, Wanli Ouyang, Lei Bai
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[134] arXiv:2402.01296 [pdf, html, other]
Title: Bi-CryptoNets: Leveraging Different-Level Privacy for Encrypted Inference
Man-Jie Yuan, Zheng Zou, Wei Gao
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[135] arXiv:2402.01297 [pdf, html, other]
Title: Characterizing Overfitting in Kernel Ridgeless Regression Through the Eigenspectrum
Tin Sum Cheng, Aurelien Lucchi, Anastasis Kratsios, David Belius
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[136] arXiv:2402.01302 [pdf, html, other]
Title: A Unified Framework for Center-based Clustering of Distributed Data
Aleksandar Armacki, Dragana Bajović, Dušan Jakovetić, Soummya Kar
Comments: 49 pages, 9 figures, 7 tables
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA)
[137] arXiv:2402.01306 [pdf, html, other]
Title: KTO: Model Alignment as Prospect Theoretic Optimization
Kawin Ethayarajh, Winnie Xu, Niklas Muennighoff, Dan Jurafsky, Douwe Kiela
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[138] arXiv:2402.01327 [pdf, html, other]
Title: Supervised Algorithmic Fairness in Distribution Shifts: A Survey
Minglai Shao, Dong Li, Chen Zhao, Xintao Wu, Yujie Lin, Qin Tian
Comments: IJCAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[139] arXiv:2402.01340 [pdf, html, other]
Title: SignSGD with Federated Defense: Harnessing Adversarial Attacks through Gradient Sign Decoding
Chanho Park, Namyoon Lee
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Signal Processing (eess.SP)
[140] arXiv:2402.01341 [pdf, html, other]
Title: Fundamental Properties of Causal Entropy and Information Gain
Francisco N. F. Q. Simoes, Mehdi Dastani, Thijs van Ommen
Comments: In Proceedings of the conference CLeaR (Causal Learning and Reasoning) 2024
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[141] arXiv:2402.01342 [pdf, html, other]
Title: Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusion
Zexi Li, Zhiqi Li, Jie Lin, Tao Shen, Tao Lin, Chao Wu
Comments: preprint
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[142] arXiv:2402.01343 [pdf, html, other]
Title: Shapelet-based Model-agnostic Counterfactual Local Explanations for Time Series Classification
Qi Huang, Wei Chen, Thomas Bäck, Niki van Stein
Comments: The paper has been accepted by the XAI4Sci workshop of AAAI 2024
Subjects: Machine Learning (cs.LG)
[143] arXiv:2402.01344 [pdf, html, other]
Title: Monotone, Bi-Lipschitz, and Polyak-Lojasiewicz Networks
Ruigang Wang, Krishnamurthy Dvijotham, Ian R. Manchester
Comments: International Conference on Machine Learning, Vienna, Austria, July 21 -- 17, 2024
Subjects: Machine Learning (cs.LG)
[144] arXiv:2402.01348 [pdf, html, other]
Title: CORE: Mitigating Catastrophic Forgetting in Continual Learning through Cognitive Replay
Jianshu Zhang, Yankai Fu, Ziheng Peng, Dongyu Yao, Kun He
Comments: Accepted by CogSci24 as oral presentation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[145] arXiv:2402.01350 [pdf, html, other]
Title: pFedMoE: Data-Level Personalization with Mixture of Experts for Model-Heterogeneous Personalized Federated Learning
Liping Yi, Han Yu, Chao Ren, Heng Zhang, Gang Wang, Xiaoguang Liu, Xiaoxiao Li
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[146] arXiv:2402.01359 [pdf, html, other]
Title: TESSERACT: Eliminating Experimental Bias in Malware Classification across Space and Time (Extended Version)
Zeliang Kan, Shae McFadden, Daniel Arp, Feargus Pendlebury, Roberto Jordaney, Johannes Kinder, Fabio Pierazzi, Lorenzo Cavallaro
Comments: 30 pages. arXiv admin note: text overlap with arXiv:1807.07838
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Performance (cs.PF)
[147] arXiv:2402.01361 [pdf, html, other]
Title: To the Max: Reinventing Reward in Reinforcement Learning
Grigorii Veviurko, Wendelin Böhmer, Mathijs de Weerdt
Journal-ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:49455-49470, 2024
Subjects: Machine Learning (cs.LG)
[148] arXiv:2402.01369 [pdf, html, other]
Title: On the Multi-modal Vulnerability of Diffusion Models
Dingcheng Yang, Yang Bai, Xiaojun Jia, Yang Liu, Xiaochun Cao, Wenjian Yu
Comments: Accepted at ICML2024 Workshop on Trustworthy Multi-modal Foundation Models and AI Agents (TiFA)
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[149] arXiv:2402.01371 [pdf, other]
Title: Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda, Shalabh Bhatnagar
Subjects: Machine Learning (cs.LG)
[150] arXiv:2402.01379 [pdf, html, other]
Title: Regularized boosting with an increasing coefficient magnitude stop criterion as meta-learner in hyperparameter optimization stacking ensemble
Laura Fdez-Díaz, José Ramón Quevedo, Elena Montañés
Journal-ref: Neurocomputing 2023 Volume 551 126516
Subjects: Machine Learning (cs.LG)
[151] arXiv:2402.01399 [pdf, other]
Title: A Probabilistic Model Behind Self-Supervised Learning
Alice Bizeul, Bernhard Schölkopf, Carl Allen
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[152] arXiv:2402.01401 [pdf, html, other]
Title: An Information Theoretic Approach to Machine Unlearning
Jack Foster, Kyle Fogarty, Stefan Schoepf, Zack Dugue, Cengiz Öztireli, Alexandra Brintrup
Comments: Updated, new low-dimensional experiments and updated perspective on unlearning from an information theoretic view
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[153] arXiv:2402.01408 [pdf, html, other]
Title: Counterfactual Concept Bottleneck Models
Gabriele Dominici, Pietro Barbiero, Francesco Giannini, Martin Gjoreski, Giuseppe Marra, Marc Langheinrich
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[154] arXiv:2402.01415 [pdf, other]
Title: SMLP: Symbolic Machine Learning Prover
Franz Brauße, Zurab Khasidashvili, Konstantin Korovin
Comments: 12 pages, 4 figures. (submitted)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO); Symbolic Computation (cs.SC); Optimization and Control (math.OC)
[155] arXiv:2402.01431 [pdf, html, other]
Title: Approximate Control for Continuous-Time POMDPs
Yannick Eich, Bastian Alt, Heinz Koeppl
Comments: To be published in AISTATS 2024
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Quantitative Methods (q-bio.QM)
[156] arXiv:2402.01439 [pdf, other]
Title: From Words to Molecules: A Survey of Large Language Models in Chemistry
Chang Liao, Yemin Yu, Yu Mei, Ying Wei
Comments: Submitted to IJCAI 2024 survey track
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Biomolecules (q-bio.BM); Quantitative Methods (q-bio.QM)
[157] arXiv:2402.01440 [pdf, html, other]
Title: A Survey of Few-Shot Learning on Graphs: from Meta-Learning to Pre-Training and Prompt Learning
Xingtong Yu, Yuan Fang, Zemin Liu, Yuxia Wu, Zhihao Wen, Jianyuan Bo, Xinming Zhang, Steven C.H. Hoi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[158] arXiv:2402.01444 [pdf, other]
Title: Mission Critical -- Satellite Data is a Distinct Modality in Machine Learning
Esther Rolf, Konstantin Klemmer, Caleb Robinson, Hannah Kerner
Comments: 15 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[159] arXiv:2402.01450 [pdf, other]
Title: Improving importance estimation in covariate shift for providing accurate prediction error
Laura Fdez-Díaz, Sara González Tomillo, Elena Montañés, José Ramón Quevedo
Journal-ref: Expert Systems With Applications 2022 Volume 193 116376
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[160] arXiv:2402.01454 [pdf, html, other]
Title: Integrating Large Language Models in Causal Discovery: A Statistical Causal Approach
Masayuki Takayama, Tadahisa Okuda, Thong Pham, Tatsuyoshi Ikenoue, Shingo Fukuma, Shohei Shimizu, Akiyoshi Sannai
Journal-ref: Published in Transactions in Machine Learning Research (05/2025) https://openreview.net/forum?id=Reh1S8rxfh
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[161] arXiv:2402.01476 [pdf, html, other]
Title: Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian Processes
Yingyi Chen, Qinghua Tao, Francesco Tonin, Johan A.K. Suykens
Comments: We propose Kernel-Eigen Pair Sparse Variational Gaussian Processes (KEP-SVGP) for building uncertainty-aware self-attention where the asymmetry of attention kernel is tackled by KSVD and a reduced time complexity is acquired
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[162] arXiv:2402.01481 [pdf, html, other]
Title: Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains
Jiale Zhao, Wanru Zhuang, Jia Song, Yaqi Li, Shuqi Lu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Biomolecules (q-bio.BM)
[163] arXiv:2402.01484 [pdf, html, other]
Title: Connecting the Dots: Is Mode-Connectedness the Key to Feasible Sample-Based Inference in Bayesian Neural Networks?
Emanuel Sommer, Lisa Wimmer, Theodore Papamarkou, Ludwig Bothmann, Bernd Bischl, David Rügamer
Subjects: Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
[164] arXiv:2402.01514 [pdf, html, other]
Title: Mapping the Multiverse of Latent Representations
Jeremy Wayland, Corinna Coupette, Bastian Rieck
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Algebraic Topology (math.AT); Machine Learning (stat.ML)
[165] arXiv:2402.01515 [pdf, html, other]
Title: Enhancing Stochastic Gradient Descent: A Unified Framework and Novel Acceleration Methods for Faster Convergence
Yichuan Deng, Zhao Song, Chiwun Yang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[166] arXiv:2402.01528 [pdf, html, other]
Title: Decoding Speculative Decoding
Minghao Yan, Saurabh Agarwal, Shivaram Venkataraman
Comments: Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2025)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[167] arXiv:2402.01543 [pdf, html, other]
Title: Adaptive Optimization for Prediction with Missing Data
Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[168] arXiv:2402.01546 [pdf, other]
Title: Privacy-Preserving Distributed Learning for Residential Short-Term Load Forecasting
Yi Dong, Yingjie Wang, Mariana Gama, Mustafa A. Mustafa, Geert Deconinck, Xiaowei Huang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA); Systems and Control (eess.SY)
[169] arXiv:2402.01567 [pdf, html, other]
Title: Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise
Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook, Yan Dai
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[170] arXiv:2402.01608 [pdf, other]
Title: Contingency Analysis of a Grid of Connected EVs for Primary Frequency Control of an Industrial Microgrid Using Efficient Control Scheme
J.N. Sabhahit, S.S. Solanke, V.K. Jadoun, H. Malik, F.P. García Márquez, J.M. Pinar-Pérez
Comments: Published in energies (MDPI) 2022
Journal-ref: Energies 2022, 15, 3102
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[171] arXiv:2402.01614 [pdf, other]
Title: L2G2G: a Scalable Local-to-Global Network Embedding with Graph Autoencoders
Ruikang Ouyang, Andrew Elliott, Stratis Limnios, Mihai Cucuringu, Gesine Reinert
Comments: 13 pages, 4 figures, Complex Networks 2023, Volume I, SCI 1141
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[172] arXiv:2402.01621 [pdf, html, other]
Title: Stochastic Two Points Method for Deep Model Zeroth-order Optimization
Yijiang Pang, Jiayu Zhou
Subjects: Machine Learning (cs.LG)
[173] arXiv:2402.01632 [pdf, html, other]
Title: Time-Varying Gaussian Process Bandits with Unknown Prior
Juliusz Ziomek, Masaki Adachi, Michael A. Osborne
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[174] arXiv:2402.01768 [pdf, other]
Title: Enriched Physics-informed Neural Networks for Dynamic Poisson-Nernst-Planck Systems
Xujia Huang, Fajie Wang, Benrong Zhang, Hanqing Liu
Comments: 24 pages, 16 figures, 6 tables
Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[175] arXiv:2402.01785 [pdf, html, other]
Title: DoubleMLDeep: Estimation of Causal Effects with Multimodal Data
Sven Klaassen, Jan Teichert-Kluge, Philipp Bach, Victor Chernozhukov, Martin Spindler, Suhas Vijaykumar
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Econometrics (econ.EM); Methodology (stat.ME); Machine Learning (stat.ML)
[176] arXiv:2402.01790 [pdf, html, other]
Title: An introduction to graphical tensor notation for mechanistic interpretability
Jordan K. Taylor
Comments: 30 pages, 75 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[177] arXiv:2402.01797 [pdf, html, other]
Title: Robust support vector machines via conic optimization
Valentina Cepeda, Andrés Gómez, Shaoning Han
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Computation (stat.CO)
[178] arXiv:2402.01798 [pdf, html, other]
Title: Improved Quantization Strategies for Managing Heavy-tailed Gradients in Distributed Learning
Guangfeng Yan, Tan Li, Yuanzhang Xiao, Hanxu Hou, Linqi Song
Comments: arXiv admin note: substantial text overlap with arXiv:2402.01160
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[179] arXiv:2402.01799 [pdf, html, other]
Title: Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward
Arnav Chavan, Raghav Magazine, Shubham Kushwaha, Mérouane Debbah, Deepak Gupta
Comments: Accepted at IJCAI '24 (Survey Track), Updated TGI results
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[180] arXiv:2402.01801 [pdf, html, other]
Title: Large Language Models for Time Series: A Survey
Xiyuan Zhang, Ranak Roy Chowdhury, Rajesh K. Gupta, Jingbo Shang
Comments: GitHub repository: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[181] arXiv:2402.01802 [pdf, html, other]
Title: An Auction-based Marketplace for Model Trading in Federated Learning
Yue Cui, Liuyi Yao, Yaliang Li, Ziqian Chen, Bolin Ding, Xiaofang Zhou
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[182] arXiv:2402.01811 [pdf, html, other]
Title: A Distributionally Robust Optimisation Approach to Fair Credit Scoring
Pablo Casas, Christophe Mues, Huan Yu
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[183] arXiv:2402.01821 [pdf, html, other]
Title: Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks
Akshay K. Jagadish, Julian Coda-Forno, Mirko Thalmann, Eric Schulz, Marcel Binz
Comments: 27 pages (9 pages of main text, 4 pages of references, and 14 pages of appendix), 13 figures, and 7 Tables
Journal-ref: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[184] arXiv:2402.01845 [pdf, html, other]
Title: Multi-Armed Bandits with Interference
Su Jia, Peter Frazier, Nathan Kallus
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[185] arXiv:2402.01849 [pdf, html, other]
Title: Capturing waste collection planning expert knowledge in a fitness function through preference learning
Laura Fernández Díaz, Miriam Fernández Díaz, José Ramón Quevedo, Elena Montañés
Journal-ref: Engineering Applications of Artificial Intelligence 2021 Volume 99 104113
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[186] arXiv:2402.01857 [pdf, html, other]
Title: Position Paper: Assessing Robustness, Privacy, and Fairness in Federated Learning Integrated with Foundation Models
Xi Li, Jiaqi Wang
Comments: Under review
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computers and Society (cs.CY)
[187] arXiv:2402.01858 [pdf, html, other]
Title: Explaining latent representations of generative models with large multimodal models
Mengdan Zhu, Zhenke Liu, Bo Pan, Abhinav Angirekula, Liang Zhao
Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[188] arXiv:2402.01862 [pdf, html, other]
Title: Parametric Feature Transfer: One-shot Federated Learning with Foundation Models
Mahdi Beitollahi, Alex Bie, Sobhan Hemati, Leo Maxime Brunswic, Xu Li, Xi Chen, Guojun Zhang
Comments: 20 pages, 12 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[189] arXiv:2402.01863 [pdf, html, other]
Title: DFML: Decentralized Federated Mutual Learning
Yasser H. Khalil, Amir H. Estiri, Mahdi Beitollahi, Nader Asadi, Sobhan Hemati, Xu Li, Guojun Zhang, Xi Chen
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[190] arXiv:2402.01865 [pdf, html, other]
Title: What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement
Xisen Jin, Xiang Ren
Comments: ICML 2024 (Spotlight)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[191] arXiv:2402.01867 [pdf, html, other]
Title: Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision
Jinyan Su, Peilin Yu, Jieyu Zhang, Stephen H. Bach
Comments: Accepted to IEEE International Conference on Big Data 2023
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[192] arXiv:2402.01868 [pdf, html, other]
Title: Challenges in Training PINNs: A Loss Landscape Perspective
Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, Madeleine Udell
Comments: ICML 2024 Oral; 33 pages (including appendices), 10 figures, 3 tables
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[193] arXiv:2402.01869 [pdf, html, other]
Title: InferCept: Efficient Intercept Support for Augmented Large Language Model Inference
Reyna Abhyankar, Zijian He, Vikranth Srivatsa, Hao Zhang, Yiying Zhang
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[194] arXiv:2402.01879 [pdf, html, other]
Title: $σ$-zero: Gradient-based Optimization of $\ell_0$-norm Adversarial Examples
Antonio Emanuele Cinà, Francesco Villani, Maura Pintor, Lea Schönherr, Battista Biggio, Marcello Pelillo
Comments: Paper accepted at International Conference on Learning Representations (ICLR 2025). Code available at this https URL
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[195] arXiv:2402.01881 [pdf, other]
Title: Large Language Model Agent for Hyper-Parameter Optimization
Siyi Liu, Chen Gao, Yong Li
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[196] arXiv:2402.01886 [pdf, html, other]
Title: Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
Mark Beliaev, Ramtin Pedarsani
Comments: 11 pages, 4 figures, extended version of AAAI publication
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[197] arXiv:2402.01902 [pdf, html, other]
Title: EBV: Electronic Bee-Veterinarian for Principled Mining and Forecasting of Honeybee Time Series
Mst. Shamima Hossain, Christos Faloutsos, Boris Baer, Hyoseung Kim, Vassilis J. Tsotras
Comments: 9 pages, 7 figure, Accepted at 2024 SIAM International Conference on Data Mining (SDM'24)
Subjects: Machine Learning (cs.LG)
[198] arXiv:2402.01909 [pdf, html, other]
Title: On Catastrophic Inheritance of Large Foundation Models
Hao Chen, Bhiksha Raj, Xing Xie, Jindong Wang
Comments: Accepted by DMLR
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[199] arXiv:2402.01911 [pdf, html, other]
Title: From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers
Bharat Runwal, Tejaswini Pedapati, Pin-Yu Chen
Comments: Preprint
Subjects: Machine Learning (cs.LG)
[200] arXiv:2402.01920 [pdf, html, other]
Title: Preference Poisoning Attacks on Reward Model Learning
Junlin Wu, Jiongxiao Wang, Chaowei Xiao, Chenguang Wang, Ning Zhang, Yevgeniy Vorobeychik
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Total of 3960 entries : 1-100 101-200 201-300 301-400 401-500 ... 3901-3960
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack