Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for December 2023

Total of 2733 entries : 201-450 251-500 501-750 751-1000 ... 2501-2733
Showing up to 250 entries per page: fewer | more | all
[201] arXiv:2312.02470 [pdf, other]
Title: Generator Born from Classifier
Runpeng Yu, Xinchao Wang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[202] arXiv:2312.02473 [pdf, other]
Title: NeutronStream: A Dynamic GNN Training Framework with Sliding Window for Graph Streams
Chaoyi Chen, Dechao Gao, Yanfeng Zhang, Qiange Wang, Zhenbo Fu, Xuecang Zhang, Junhua Zhu, Yu Gu, Ge Yu
Comments: 12 pages, 15 figures
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[203] arXiv:2312.02490 [pdf, html, other]
Title: Constrained Twin Variational Auto-Encoder for Intrusion Detection in IoT Systems
Phai Vu Dinh, Quang Uy Nguyen, Dinh Thai Hoang, Diep N. Nguyen, Son Pham Bao, Eryk Dutkiewicz
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[204] arXiv:2312.02491 [pdf, other]
Title: Pseudo Replay-based Class Continual Learning for Online New Category Anomaly Detection in Advanced Manufacturing
Yuxuan Li, Tianxin Xie, Chenang Liu, Zhangyue Shi
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[205] arXiv:2312.02515 [pdf, html, other]
Title: mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs
Zhengmao Ye, Dengchun Li, Zetao Hu, Tingfeng Lan, Jian Sha, Sicong Zhang, Lei Duan, Jie Zuo, Hui Lu, Yuanchun Zhou, Mingjie Tang
Comments: 14 pages, 16 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[206] arXiv:2312.02517 [pdf, other]
Title: Simplifying Neural Network Training Under Class Imbalance
Ravid Shwartz-Ziv, Micah Goldblum, Yucen Lily Li, C. Bayan Bruss, Andrew Gordon Wilson
Comments: NeurIPS 2023. Code available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[207] arXiv:2312.02522 [pdf, html, other]
Title: MASP: Scalable GNN-based Planning for Multi-Agent Navigation
Xinyi Yang, Xinting Yang, Chao Yu, Jiayu Chen, Wenbo Ding, Huazhong Yang, Yu Wang
Comments: Submitted to IEEE RA-L
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[208] arXiv:2312.02530 [pdf, other]
Title: MEMTO: Memory-guided Transformer for Multivariate Time Series Anomaly Detection
Junho Song, Keonwoo Kim, Jeonglyul Oh, Sungzoon Cho
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[209] arXiv:2312.02554 [pdf, html, other]
Title: ULMA: Unified Language Model Alignment with Human Demonstration and Point-wise Preference
Tianchi Cai, Xierui Song, Jiyan Jiang, Fei Teng, Jinjie Gu, Guannan Zhang
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[210] arXiv:2312.02566 [pdf, other]
Title: Structured World Representations in Maze-Solving Transformers
Michael Igorevich Ivanitskiy, Alex F. Spies, Tilman Räuker, Guillaume Corlouer, Chris Mathwin, Lucia Quirke, Can Rager, Rusheb Shah, Dan Valentine, Cecilia Diniz Behn, Katsumi Inoue, Samy Wu Fung
Comments: 15 pages, 18 figures, 15 tables. Corresponding author: Michael Ivanitskiy ([email protected]). Code available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[211] arXiv:2312.02573 [pdf, html, other]
Title: UTBoost: Gradient Boosted Decision Trees for Uplift Modeling
Junjie Gao, Xiangyu Zheng, DongDong Wang, Zhixiang Huang, Bangqi Zheng, Kai Yang
Comments: Accepted by PRICAI 2024
Journal-ref: PRICAI 2024: Trends in Artificial Intelligence. PRICAI 2024. Lecture Notes in Computer Science(), vol 15281
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[212] arXiv:2312.02592 [pdf, html, other]
Title: FRAPPE: A Group Fairness Framework for Post-Processing Everything
Alexandru Tifrea, Preethi Lahoti, Ben Packer, Yoni Halpern, Ahmad Beirami, Flavien Prost
Comments: Conference paper at ICML 2024
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[213] arXiv:2312.02596 [pdf, html, other]
Title: LSTSVR-PI: Least square twin support vector regression with privileged information
Anuradha Kumari, M. Tanveer
Subjects: Machine Learning (cs.LG)
[214] arXiv:2312.02611 [pdf, other]
Title: Privacy-Aware Data Acquisition under Data Similarity in Regression Markets
Shashi Raj Pandey, Pierre Pinson, Petar Popovski
Comments: Submitted to IEEE Transactions on Neural Networks and Learning Systems
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Science and Game Theory (cs.GT)
[215] arXiv:2312.02614 [pdf, html, other]
Title: Prompt Optimization via Adversarial In-Context Learning
Xuan Long Do, Yiran Zhao, Hannah Brown, Yuxi Xie, James Xu Zhao, Nancy F. Chen, Kenji Kawaguchi, Michael Shieh, Junxian He
Comments: ACL 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[216] arXiv:2312.02615 [pdf, html, other]
Title: Projection Regret: Reducing Background Bias for Novelty Detection via Diffusion Models
Sungik Choi, Hankook Lee, Honglak Lee, Moontae Lee
Comments: NeurIPS 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[217] arXiv:2312.02619 [pdf, other]
Title: Rethinking and Simplifying Bootstrapped Graph Latents
Wangbin Sun, Jintang Li, Liang Chen, Bingzhe Wu, Yatao Bian, Zibin Zheng
Comments: Accepted by WSDM 2024
Subjects: Machine Learning (cs.LG)
[218] arXiv:2312.02622 [pdf, html, other]
Title: On the Initialization of Graph Neural Networks
Jiahang Li, Yakun Song, Xiang Song, David Paul Wipf
Comments: Accepted by ICML 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[219] arXiv:2312.02646 [pdf, html, other]
Title: SAMSGL: Series-Aligned Multi-Scale Graph Learning for Spatio-Temporal Forecasting
Xiaobei Zou, Luolin Xiong, Yang Tang, Jürgen Kurths
Comments: Accepted by Chaos
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[220] arXiv:2312.02658 [pdf, other]
Title: Do AI models produce better weather forecasts than physics-based models? A quantitative evaluation case study of Storm Ciarán
Andrew J. Charlton-Perez, Helen F. Dacre, Simon Driscoll, Suzanne L. Gray, Ben Harvey, Natalie J. Harvey, Kieran M. R. Hunt, Robert W. Lee, Ranjini Swaminathan, Remy Vandaele, Ambrogio Volonté
Subjects: Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph)
[221] arXiv:2312.02661 [pdf, html, other]
Title: A Self-Commissioning Edge Computing Method for Data-Driven Anomaly Detection in Power Electronic Systems
Pere Izquierdo Gomez, Miguel E. Lopez Gajardo, Nenad Mijatovic, Tomislav Dragicevic
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[222] arXiv:2312.02674 [pdf, html, other]
Title: Amortized Bayesian Decision Making for simulation-based models
Mila Gorecki, Jakob H. Macke, Michael Deistler
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[223] arXiv:2312.02682 [pdf, other]
Title: H-GAP: Humanoid Control with a Generalist Planner
Zhengyao Jiang, Yingchen Xu, Nolan Wagener, Yicheng Luo, Michael Janner, Edward Grefenstette, Tim Rocktäschel, Yuandong Tian
Comments: 18 pages including appendix, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[224] arXiv:2312.02708 [pdf, other]
Title: Provable Adversarial Robustness for Group Equivariant Tasks: Graphs, Point Clouds, Molecules, and More
Jan Schuchardt, Yan Scholten, Stephan Günnemann
Comments: Accepted at NeurIPS 2023
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
[225] arXiv:2312.02720 [pdf, other]
Title: Towards the Inferrence of Structural Similarity of Combinatorial Landscapes
Mingyu Huang, Ke Li
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[226] arXiv:2312.02730 [pdf, html, other]
Title: Towards Measuring Representational Similarity of Large Language Models
Max Klabunde, Mehdi Ben Amor, Michael Granitzer, Florian Lemmerich
Comments: Extended abstract in UniReps Workshop @ NeurIPS 2023
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[227] arXiv:2312.02739 [pdf, other]
Title: LExCI: A Framework for Reinforcement Learning with Embedded Systems
Kevin Badalian, Lucas Koch, Tobias Brinkmann, Mario Picerno, Marius Wegener, Sung-Yong Lee, Jakob Andert
Comments: The code, models, and data used for this work are available in a separate branch of LExCI's GitHub repository (this https URL). This paper has been submitted to Applied Intelligence (this https URL). 2024-06-27: Updated the footnote on the title page so that it provides information about the paper's Version of Record
Journal-ref: Applied Intelligence (2024)
Subjects: Machine Learning (cs.LG)
[228] arXiv:2312.02770 [pdf, other]
Title: Learning "Look-Ahead" Nonlocal Traffic Dynamics in a Ring Road
Chenguang Zhao, Huan Yu
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[229] arXiv:2312.02780 [pdf, html, other]
Title: Scaling Laws for Adversarial Attacks on Language Model Activations
Stanislav Fort
Comments: 15 pages, 9 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[230] arXiv:2312.02798 [pdf, html, other]
Title: Weakly Supervised Detection of Hallucinations in LLM Activations
Miriam Rateike, Celia Cintas, John Wamburu, Tanya Akumu, Skyler Speakman
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[231] arXiv:2312.02804 [pdf, html, other]
Title: Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions: Applications to Product-Form Stochastic Networks and Queueing Systems
Céline Comte, Matthieu Jonckheere, Jaron Sanders, Albert Senen-Cerda
Comments: 60 pages, 5 figures. Extended numerical results in section 6 and included sample complexity in section 5
Subjects: Machine Learning (cs.LG); Performance (cs.PF); Optimization and Control (math.OC); Probability (math.PR)
[232] arXiv:2312.02826 [pdf, html, other]
Title: Calibrated Adaptive Teacher for Domain Adaptive Intelligent Fault Diagnosis
Florent Forest, Olga Fink
Comments: Accepted for publication in Sensors. 24 pages
Journal-ref: Sensors, 24(23) 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP); Machine Learning (stat.ML)
[233] arXiv:2312.02829 [pdf, other]
Title: MIMONets: Multiple-Input-Multiple-Output Neural Networks Exploiting Computation in Superposition
Nicolas Menet (1 and 2), Michael Hersche (1 and 2), Geethan Karunaratne (1), Luca Benini (2), Abu Sebastian (1), Abbas Rahimi (1) ((1) IBM Research - Zurich, (2) ETH Zurich)
Comments: accepted in NeurIPS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[234] arXiv:2312.02852 [pdf, html, other]
Title: Expert-guided Bayesian Optimisation for Human-in-the-loop Experimental Design of Known Systems
Tom Savage, Ehecatl Antonio del Rio Chanona
Comments: NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World. Main text: 6 pages
Subjects: Machine Learning (cs.LG); Human-Computer Interaction (cs.HC); Optimization and Control (math.OC)
[235] arXiv:2312.02858 [pdf, html, other]
Title: Towards Causal Representations of Climate Model Data
Julien Boussard, Chandni Nagda, Julia Kaltenborn, Charlotte Emilie Elektra Lange, Philippe Brouillard, Yaniv Gurwicz, Peer Nowack, David Rolnick
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Atmospheric and Oceanic Physics (physics.ao-ph); Methodology (stat.ME)
[236] arXiv:2312.02859 [pdf, html, other]
Title: Lessons from Usable ML Deployments and Application to Wind Turbine Monitoring
Alexandra Zytek, Wei-En Wang, Sofia Koukoura, Kalyan Veeramachaneni
Comments: Presented in XAI in Action: Past, Present, and Future Applications @ NeurIPS 2023. 8 pages, 3 figures
Subjects: Machine Learning (cs.LG)
[237] arXiv:2312.02867 [pdf, html, other]
Title: Semi-Supervised Health Index Monitoring with Feature Generation and Fusion
Gaëtan Frusque, Ismail Nejjar, Majid Nabavi, Olga Fink
Comments: 13 pages, 8 figures
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[238] arXiv:2312.02871 [pdf, other]
Title: Attention-enhanced neural differential equations for physics-informed deep learning of ion transport
Danyal Rehman, John H. Lienhard
Comments: 8 pages, 2 figures. Accepted in the NeurIPS Machine Learning and the Physical Sciences Workshop
Subjects: Machine Learning (cs.LG); Mathematical Physics (math-ph); Computational Physics (physics.comp-ph)
[239] arXiv:2312.02872 [pdf, other]
Title: Experimental Insights Towards Explainable and Interpretable Pedestrian Crossing Prediction
Angie Nataly Melo, Carlota Salinas, Miguel Angel Sotelo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Systems and Control (eess.SY)
[240] arXiv:2312.02873 [pdf, other]
Title: Toward autocorrection of chemical process flowsheets using large language models
Lukas Schulze Balhorn, Marc Caballero, Artur M. Schweidtmann
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[241] arXiv:2312.02901 [pdf, html, other]
Title: Concept Drift Adaptation in Text Stream Mining Settings: A Systematic Review
Cristiano Mesquita Garcia, Ramon Simoes Abilio, Alessandro Lameiras Koerich, Alceu de Souza Britto Jr., Jean Paul Barddal
Comments: 69 pages
Journal-ref: ACM Transactions on Intelligent Systems and Technology. 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[242] arXiv:2312.02984 [pdf, other]
Title: Diff-GO: Diffusion Goal-Oriented Communications to Achieve Ultra-High Spectrum Efficiency
Achintha Wijesinghe, Songyang Zhang, Suchinthaka Wanninayaka, Weiwei Wang, Zhi Ding
Comments: Submitted to IEEE International Conference on Communications (ICC) 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Signal Processing (eess.SP)
[243] arXiv:2312.03002 [pdf, html, other]
Title: The mechanistic basis of data dependence and abrupt learning in an in-context classification task
Gautam Reddy
Subjects: Machine Learning (cs.LG)
[244] arXiv:2312.03005 [pdf, html, other]
Title: Few-Shot Anomaly Detection with Adversarial Loss for Robust Feature Representations
Jae Young Lee, Wonjun Lee, Jaehyun Choi, Yongkwi Lee, Young Seog Yoon
Comments: BMVC 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[245] arXiv:2312.03008 [pdf, html, other]
Title: Deep Reinforcement Learning for Community Battery Scheduling under Uncertainties of Load, PV Generation, and Energy Prices
Jiarong Fan, Hao Wang
Comments: The 7th IEEE Conference on Energy Internet and Energy System Integration (EI2 2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[246] arXiv:2312.03014 [pdf, html, other]
Title: Foundation Models for Weather and Climate Data Understanding: A Comprehensive Survey
Shengchao Chen, Guodong Long, Jing Jiang, Dikai Liu, Chengqi Zhang
Comments: Ongoing work. Survey Paper. 35 pages, 2 figures, 4 tables. The first work to comprehensively and systematically summarize DL-based weather and climate data understanding, paving the way for the development of weather and climate foundation models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Atmospheric and Oceanic Physics (physics.ao-ph)
[247] arXiv:2312.03017 [pdf, html, other]
Title: AI-driven emergence of frequency information non-uniform distribution via THz metasurface spectrum prediction
Xiaohua Xing, Yuqi Ren, Die Zou, Qiankun Zhang, Bingxuan Mao, Jianquan Yao, Deyi Xiong, Shuang Zhang, Liang Wu
Comments: 11 pages, 4 figures
Subjects: Machine Learning (cs.LG); Optics (physics.optics)
[248] arXiv:2312.03037 [pdf, other]
Title: Analysis and mining of low-carbon and energy-saving tourism data characteristics based on machine learning algorithm
Lukasz Wierzbinski
Subjects: Machine Learning (cs.LG)
[249] arXiv:2312.03038 [pdf, html, other]
Title: Sample-based Dynamic Hierarchical Transformer with Layer and Head Flexibility via Contextual Bandit
Fanfei Meng, Lele Zhang, Yu Chen, Yuxin Wang
Comments: We miss some authorship information. And miss some important information in references
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[250] arXiv:2312.03041 [pdf, other]
Title: Transformer-Based Deep Learning Model for Bored Pile Load-Deformation Prediction in Bangkok Subsoil
Sompote Youwai, Chissanupong Thongnoo
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE)
[251] arXiv:2312.03044 [pdf, html, other]
Title: REST: Enhancing Group Robustness in DNNs through Reweighted Sparse Training
Jiaxu Zhao, Lu Yin, Shiwei Liu, Meng Fang, Mykola Pechenizkiy
Subjects: Machine Learning (cs.LG)
[252] arXiv:2312.03051 [pdf, html, other]
Title: Generating Interpretable Networks using Hypernetworks
Isaac Liao, Ziming Liu, Max Tegmark
Comments: 15 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[253] arXiv:2312.03096 [pdf, other]
Title: What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity from Incidental Causes
Victor Lecomte, Kushal Thaman, Rylan Schaeffer, Naomi Bashkansky, Trevor Chow, Sanmi Koyejo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[254] arXiv:2312.03120 [pdf, html, other]
Title: The Landscape of Modern Machine Learning: A Review of Machine, Distributed and Federated Learning
Omer Subasi, Oceane Bel, Joseph Manzano, Kevin Barker
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[255] arXiv:2312.03140 [pdf, html, other]
Title: FlexModel: A Framework for Interpretability of Distributed Large Language Models
Matthew Choi, Muhammad Adil Asif, John Willes, David Emerson
Comments: 14 pages, 8 figures. To appear at the Socially Responsible Language Modelling Research (SoLaR) Workshop, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[256] arXiv:2312.03147 [pdf, html, other]
Title: Neural parameter calibration and uncertainty quantification for epidemic forecasting
Thomas Gaskin, Tim Conrad, Grigorios A. Pavliotis, Christof Schütte
Journal-ref: PLOS ONE 2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Physics and Society (physics.soc-ph)
[257] arXiv:2312.03151 [pdf, html, other]
Title: Multitask Learning Can Improve Worst-Group Outcomes
Atharva Kulkarni, Lucio Dery, Amrith Setlur, Aditi Raghunathan, Ameet Talwalkar, Graham Neubig
Comments: 20 pages, 7 tables, 6 Figures
Subjects: Machine Learning (cs.LG)
[258] arXiv:2312.03166 [pdf, html, other]
Title: Deep Learning for Fast Inference of Mechanistic Models' Parameters
Maxim Borisyak, Stefan Born, Peter Neubauer, Mariano Nicolas Cruz-Bournazou
Comments: 7 pages, 3 figures
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[259] arXiv:2312.03176 [pdf, html, other]
Title: Active Learning for Abrupt Shifts Change-point Detection via Derivative-Aware Gaussian Processes
Hao Zhao, Rong Pan
Subjects: Machine Learning (cs.LG)
[260] arXiv:2312.03177 [pdf, other]
Title: Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning
Pankayaraj Pathmanathan, Natalia Díaz-Rodríguez, Javier Del Ser
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[261] arXiv:2312.03186 [pdf, html, other]
Title: Data-Driven Traffic Reconstruction and Kernel Methods for Identifying Stop-and-Go Congestion
Edgar Ramirez Sanchez, Shreyaa Raghavan, Cathy Wu
Comments: Presented at NeurIPS 2023 workshops: Tackling Climate Change with Machine Learning & Computational Sustainability
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[262] arXiv:2312.03196 [pdf, html, other]
Title: Domain Invariant Representation Learning and Sleep Dynamics Modeling for Automatic Sleep Staging
Seungyeon Lee, Thai-Hoang Pham, Zhao Cheng, Ping Zhang
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[263] arXiv:2312.03212 [pdf, html, other]
Title: Constrained Bayesian Optimization Under Partial Observations: Balanced Improvements and Provable Convergence
Shengbo Wang, Ke Li
Comments: The full version of our accepted paper in AAAI 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[264] arXiv:2312.03213 [pdf, html, other]
Title: Bootstrap Your Own Variance
Polina Turishcheva, Jason Ramapuram, Sinead Williamson, Dan Busbridge, Eeshan Dhekane, Russ Webb
Journal-ref: NeurIPS 2023 Workshop: Self-Supervised Learning - Theory and Practice
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[265] arXiv:2312.03216 [pdf, html, other]
Title: SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy Learning
Eric H. Jiang, Andrew Lizarraga
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[266] arXiv:2312.03218 [pdf, other]
Title: Accelerated Gradient Algorithms with Adaptive Subspace Search for Instance-Faster Optimization
Yuanshi Liu, Hanzhen Zhao, Yang Xu, Pengyun Yue, Cong Fang
Comments: Optimization for Machine Learning
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Machine Learning (stat.ML)
[267] arXiv:2312.03231 [pdf, html, other]
Title: Deep Multimodal Fusion for Surgical Feedback Classification
Rafal Kocielnik, Elyssa Y. Wong, Timothy N. Chu, Lydia Lin, De-An Huang, Jiayun Wang, Anima Anandkumar, Andrew J. Hung
Journal-ref: Published in Proceedings of Machine Learning for Health 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Audio and Speech Processing (eess.AS)
[268] arXiv:2312.03236 [pdf, html, other]
Title: Multicoated and Folded Graph Neural Networks with Strong Lottery Tickets
Jiale Yan, Hiroaki Ito, Ángel López García-Arias, Yasuyuki Okoshi, Hikari Otsuka, Kazushi Kawamura, Thiem Van Chu, Masato Motomura
Comments: 9 pages, accepted in the Second Learning on Graphs Conference (LoG 2023)
Journal-ref: Proceedings of the Second Learning on Graphs Conference (LoG 2023), PMLR 231
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[269] arXiv:2312.03248 [pdf, other]
Title: Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning
Haowen Wang, Tao Sun, Cong Fan, Jinjie Gu
Comments: 22 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[270] arXiv:2312.03253 [pdf, html, other]
Title: Seller-side Outcome Fairness in Online Marketplaces
Zikun Ye, Reza Yousefi Maragheh, Lalitesh Morishetti, Shanu Vashishtha, Jason Cho, Kaushiki Nag, Sushant Kumar, Kannan Achan
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[271] arXiv:2312.03256 [pdf, html, other]
Title: CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models
Hailin Zhang, Zirui Liu, Boxuan Chen, Yikai Zhao, Tong Zhao, Tong Yang, Bin Cui
Subjects: Machine Learning (cs.LG)
[272] arXiv:2312.03259 [pdf, html, other]
Title: f-FERM: A Scalable Framework for Robust Fair Empirical Risk Minimization
Sina Baharlouei, Shivam Patel, Meisam Razaviyayn
Comments: 24 Pages,5 figures
Journal-ref: ICLR 2024
Subjects: Machine Learning (cs.LG)
[273] arXiv:2312.03277 [pdf, html, other]
Title: Anomaly Detection for Scalable Task Grouping in Reinforcement Learning-based RAN Optimization
Jimmy Li, Igor Kozlov, Di Wu, Xue Liu, Gregory Dudek
Subjects: Machine Learning (cs.LG)
[274] arXiv:2312.03291 [pdf, html, other]
Title: Evaluation of human-model prediction difference on the Internet Scale of Data
Weitang Liu, Ying Wai Li, Yuelei Li, Zihan Wang, Yi-Zhuang You, Jingbo Shang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[275] arXiv:2312.03292 [pdf, html, other]
Title: Enhancing Molecular Property Prediction via Mixture of Collaborative Experts
Xu Yao, Shuang Liang, Songqiao Han, Hailiang Huang
Comments: 11 pages, 8 figures
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA); Quantitative Methods (q-bio.QM)
[276] arXiv:2312.03309 [pdf, html, other]
Title: Benchmarking Continual Learning from Cognitive Perspectives
Xiaoqian Liu, Junge Zhang, Mingyi Zhang, Peipei Yang
Comments: 12 pages, 11 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[277] arXiv:2312.03318 [pdf, other]
Title: Complementary Benefits of Contrastive Learning and Self-Training Under Distribution Shift
Saurabh Garg, Amrith Setlur, Zachary Chase Lipton, Sivaraman Balakrishnan, Virginia Smith, Aditi Raghunathan
Comments: NeurIPS 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[278] arXiv:2312.03344 [pdf, html, other]
Title: Interpretable Mechanistic Representations for Meal-level Glycemic Control in the Wild
Ke Alexander Wang, Emily B. Fox
Comments: Proceedings of Machine Learning for Health (ML4H) 2023. Code available at: this https URL
Subjects: Machine Learning (cs.LG); Dynamical Systems (math.DS); Applications (stat.AP); Machine Learning (stat.ML)
[279] arXiv:2312.03386 [pdf, other]
Title: An Infinite-Width Analysis on the Jacobian-Regularised Training of a Neural Network
Taeyoung Kim, Hongseok Yang
Comments: Accepted at ICML 2024. 74 pages, 18 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[280] arXiv:2312.03397 [pdf, html, other]
Title: Generalized Contrastive Divergence: Joint Training of Energy-Based Model and Diffusion Model through Inverse Reinforcement Learning
Sangwoong Yoon, Dohyun Kwon, Himchan Hwang, Yung-Kyun Noh, Frank C. Park
Comments: NeurIPS 2023 Workshop on Diffusion Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[281] arXiv:2312.03413 [pdf, html, other]
Title: Approximating Solutions to the Knapsack Problem using the Lagrangian Dual Framework
Mitchell Keegan, Mahdi Abolghasemi
Journal-ref: Lecture Notes in Computer Science, vol 14471 (2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[282] arXiv:2312.03414 [pdf, other]
Title: Compressed Context Memory For Online Language Model Interaction
Jang-Hyun Kim, Junyoung Yeom, Sangdoo Yun, Hyun Oh Song
Comments: ICLR 2024. Add streaming setting results and training set analyses
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[283] arXiv:2312.03415 [pdf, html, other]
Title: Run LoRA Run: Faster and Lighter LoRA Implementations
Daria Cherniuk, Aleksandr Mikhalev, Ivan Oseledets
Subjects: Machine Learning (cs.LG)
[284] arXiv:2312.03464 [pdf, other]
Title: Subnetwork-to-go: Elastic Neural Network with Dynamic Training and Customizable Inference
Kai Li, Yi Luo
Comments: 5 pages, 3 figures
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[285] arXiv:2312.03466 [pdf, other]
Title: Search Strategies for Self-driving Laboratories with Pending Experiments
Hao Wen, Jakob Zeitler, Connor Rupnow
Comments: Accepted at NeurIPS 2023, AI4Mat
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[286] arXiv:2312.03475 [pdf, html, other]
Title: Molecule Joint Auto-Encoding: Trajectory Pretraining with 2D and 3D Diffusion
Weitao Du, Jiujiu Chen, Xuecang Zhang, Zhiming Ma, Shengchao Liu
Comments: NeurIPS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Biomolecules (q-bio.BM)
[287] arXiv:2312.03491 [pdf, html, other]
Title: Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis
Zehua Chen, Guande He, Kaiwen Zheng, Xu Tan, Jun Zhu
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[288] arXiv:2312.03492 [pdf, html, other]
Title: Learning From Scenarios for Stochastic Repairable Scheduling
Kim van den Houten, David M.J. Tax, Esteban Freydell, Mathijs de Weerdt
Comments: 8 pages, updated according to camera-ready version CPAIOR'24
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[289] arXiv:2312.03510 [pdf, other]
Title: Towards Sobolev Pruning
Neil Kichler, Sher Afghan, Uwe Naumann
Comments: 11 pages
Journal-ref: Proceedings of the Platform for Advanced Scientific Computing Conference PASC24 (2024)
Subjects: Machine Learning (cs.LG); Computational Finance (q-fin.CP)
[290] arXiv:2312.03612 [pdf, html, other]
Title: Physical Symbolic Optimization
Wassim Tenachi, Rodrigo Ibata, Foivos I. Diakogiannis
Comments: 6 pages, 2 figures, 1 table. Accepted to NeurIPS 2023, Machine Learning for Physical Sciences workshop
Subjects: Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM); Symbolic Computation (cs.SC); Computational Physics (physics.comp-ph); Data Analysis, Statistics and Probability (physics.data-an)
[291] arXiv:2312.03642 [pdf, html, other]
Title: Transformer-Powered Surrogates Close the ICF Simulation-Experiment Gap with Extremely Limited Data
Matthew L. Olson, Shusen Liu, Jayaraman J. Thiagarajan, Bogdan Kustowski, Weng-Keen Wong, Rushil Anirudh
Comments: MLST
Subjects: Machine Learning (cs.LG)
[292] arXiv:2312.03644 [pdf, html, other]
Title: MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment
Ziyan Wang, Yali Du, Yudi Zhang, Meng Fang, Biwei Huang
Comments: 16 pages, 4 figures
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[293] arXiv:2312.03656 [pdf, html, other]
Title: Interpretability Illusions in the Generalization of Simplified Models
Dan Friedman, Andrew Lampinen, Lucas Dixon, Danqi Chen, Asma Ghandeharioun
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[294] arXiv:2312.03675 [pdf, other]
Title: GeoShapley: A Game Theory Approach to Measuring Spatial Effects in Machine Learning Models
Ziqi Li
Comments: 30 pages, 10 figures, 6 tables
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[295] arXiv:2312.03682 [pdf, html, other]
Title: What Planning Problems Can A Relational Neural Network Solve?
Jiayuan Mao, Tomás Lozano-Pérez, Joshua B. Tenenbaum, Leslie Pack Kaelbling
Comments: NeurIPS 2023 (Spotlight). Project page: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[296] arXiv:2312.03691 [pdf, other]
Title: On the Role of Edge Dependency in Graph Generative Models
Sudhanshu Chanpuriya, Cameron Musco, Konstantinos Sotiropoulos, Charalampos Tsourakakis
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[297] arXiv:2312.03762 [pdf, html, other]
Title: Colour versus Shape Goal Misgeneralization in Reinforcement Learning: A Case Study
Karolis Ramanauskas, Özgür Şimşek
Comments: ATTRIB: Workshop on Attributing Model Behavior at Scale at NeurIPS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[298] arXiv:2312.03764 [pdf, other]
Title: Similarity-based Knowledge Transfer for Cross-Domain Reinforcement Learning
Sergio A. Serrano, Jose Martinez-Carranza, L. Enrique Sucar
Comments: 30 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[299] arXiv:2312.03780 [pdf, other]
Title: Predicting the Transportation Activities of Construction Waste Hauling Trucks: An Input-Output Hidden Markov Approach
Hongtai Yang, Boyi Lei, Ke Han, Luna Liu
Comments: 21 pages, 8 figures
Subjects: Machine Learning (cs.LG)
[300] arXiv:2312.03788 [pdf, html, other]
Title: SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM
Jiayi Pan, Chengcan Wang, Kaifu Zheng, Yangguang Li, Zhenyu Wang, Bin Feng
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[301] arXiv:2312.03796 [pdf, html, other]
Title: Multi-Scale and Multi-Modal Contrastive Learning Network for Biomedical Time Series
Hongbo Guo, Xinzi Xu, Hao Wu, Guoxing Wang
Comments: 4 pages, 3 figures, submitted to ICASSP 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[302] arXiv:2312.03801 [pdf, html, other]
Title: Generalization to New Sequential Decision Making Tasks with In-Context Learning
Sharath Chandra Raparthy, Eric Hambro, Robert Kirk, Mikael Henaff, Roberta Raileanu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[303] arXiv:2312.03814 [pdf, html, other]
Title: Pearl: A Production-ready Reinforcement Learning Agent
Zheqing Zhu, Rodrigo de Salvo Braz, Jalaj Bhandari, Daniel Jiang, Yi Wan, Yonathan Efroni, Liyuan Wang, Ruiyang Xu, Hongbo Guo, Alex Nikulkov, Dmytro Korenkevych, Urun Dogan, Frank Cheng, Zheng Wu, Wanqiao Xu
Journal-ref: Journal of Machine Learning Research, 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[304] arXiv:2312.03865 [pdf, html, other]
Title: Learning Genomic Sequence Representations using Graph Neural Networks over De Bruijn Graphs
Kacper Kapuśniak, Manuel Burger, Gunnar Rätsch, Amir Joudaki
Comments: Poster at "NeurIPS 2023 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2023)"
Subjects: Machine Learning (cs.LG); Genomics (q-bio.GN)
[305] arXiv:2312.03867 [pdf, html, other]
Title: Multi-Group Fairness Evaluation via Conditional Value-at-Risk Testing
Lucas Monteiro Paes, Ananda Theertha Suresh, Alex Beutel, Flavio P. Calmon, Ahmad Beirami
Comments: Accepted for publication in the IEEE Journal on Selected Areas in Information Theory (JSAIT)
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Information Theory (cs.IT); Machine Learning (stat.ML)
[306] arXiv:2312.03878 [pdf, html, other]
Title: Domain constraints improve risk prediction when outcome data is missing
Sidhika Balachandar, Nikhil Garg, Emma Pierson
Comments: Published at ICLR 2024
Subjects: Machine Learning (cs.LG)
[307] arXiv:2312.03881 [pdf, other]
Title: FoMo Rewards: Can we cast foundation models as reward functions?
Ekdeep Singh Lubana, Johann Brehmer, Pim de Haan, Taco Cohen
Comments: Accepted to NeurIPS FMDM workshop
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[308] arXiv:2312.03885 [pdf, html, other]
Title: Gathering and Exploiting Higher-Order Information when Training Large Structured Models
Pierre Wolinski
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[309] arXiv:2312.03886 [pdf, html, other]
Title: On The Fairness Impacts of Hardware Selection in Machine Learning
Sree Harsha Nelaturu, Nishaanth Kanna Ravichandran, Cuong Tran, Sara Hooker, Ferdinando Fioretto
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[310] arXiv:2312.03889 [pdf, html, other]
Title: A Masked Pruning Approach for Dimensionality Reduction in Communication-Efficient Federated Learning Systems
Tamir L.S. Gez, Kobi Cohen
Comments: 12 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[311] arXiv:2312.03903 [pdf, html, other]
Title: Adaptive Dependency Learning Graph Neural Networks
Abishek Sriramulu, Nicolas Fourrier, Christoph Bergmeir
Journal-ref: Information Sciences, 625, 700-714 (2023)
Subjects: Machine Learning (cs.LG)
[312] arXiv:2312.03905 [pdf, other]
Title: A Pseudo-Semantic Loss for Autoregressive Models with Logical Constraints
Kareem Ahmed, Kai-Wei Chang, Guy Van den Broeck
Comments: Updated detoxification experiments; moved example toxic generations to Github and added link
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[313] arXiv:2312.03911 [pdf, html, other]
Title: Improving Gradient-guided Nested Sampling for Posterior Inference
Pablo Lemos, Nikolay Malkin, Will Handley, Yoshua Bengio, Yashar Hezaveh, Laurence Perreault-Levasseur
Comments: 10 pages, 5 figures. Code available at this https URL
Subjects: Machine Learning (cs.LG); Computation (stat.CO); Methodology (stat.ME); Machine Learning (stat.ML)
[314] arXiv:2312.03928 [pdf, html, other]
Title: Adaptive Weighted Co-Learning for Cross-Domain Few-Shot Learning
Abdullah Alchihabi, Marzi Heidari, Yuhong Guo
Subjects: Machine Learning (cs.LG)
[315] arXiv:2312.03951 [pdf, other]
Title: Understanding the Role of Optimization in Double Descent
Chris Yuhao Liu, Jeffrey Flanigan
Comments: NeurIPS Workshop 2023 Optimization for Machine Learning
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[316] arXiv:2312.03979 [pdf, other]
Title: Node-aware Bi-smoothing: Certified Robustness against Graph Injection Attacks
Yuni Lai, Yulin Zhu, Bailin Pan, Kai Zhou
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[317] arXiv:2312.03989 [pdf, other]
Title: Rapid detection of rare events from in situ X-ray diffraction data using machine learning
Weijian Zheng, Jun-Sang Park, Peter Kenesei, Ahsan Ali, Zhengchun Liu, Ian T. Foster, Nicholas Schwarz, Rajkumar Kettimuthu, Antonino Miceli, Hemant Sharma
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci); Image and Video Processing (eess.IV); Data Analysis, Statistics and Probability (physics.data-an)
[318] arXiv:2312.03991 [pdf, html, other]
Title: MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
Xiao-Yin Liu, Xiao-Hu Zhou, Guotao Li, Hao Li, Mei-Jiang Gui, Tian-Yu Xiang, De-Xing Huang, Zeng-Guang Hou
Comments: Accepted by IJCAI 2024 (the 33rd International Joint Conference on Artificial Intelligence)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[319] arXiv:2312.03998 [pdf, other]
Title: Series2Vec: Similarity-based Self-supervised Representation Learning for Time Series Classification
Navid Mohammadi Foumani, Chang Wei Tan, Geoffrey I. Webb, Hamid Rezatofighi, Mahsa Salehi
Subjects: Machine Learning (cs.LG)
[320] arXiv:2312.04000 [pdf, other]
Title: LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures
Vimal Thilak, Chen Huang, Omid Saremi, Laurent Dinh, Hanlin Goh, Preetum Nakkiran, Joshua M. Susskind, Etai Littwin
Comments: Technical report
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[321] arXiv:2312.04024 [pdf, html, other]
Title: k* Distribution: Evaluating the Latent Space of Deep Neural Networks using Local Neighborhood Analysis
Shashank Kotyan, Tatsuya Ueda, Danilo Vasconcellos Vargas
Comments: Published in IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[322] arXiv:2312.04027 [pdf, other]
Title: The sample complexity of multi-distribution learning
Binghui Peng
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[323] arXiv:2312.04038 [pdf, html, other]
Title: Reconstruction of dynamical systems from data without time labels
Zhijun Zeng, Pipi Hu, Chenglong Bao, Yi Zhu, Zuoqiang Shi
Subjects: Machine Learning (cs.LG); Dynamical Systems (math.DS); Numerical Analysis (math.NA)
[324] arXiv:2312.04055 [pdf, other]
Title: Jointly spatial-temporal representation learning for individual trajectories
Fei Huang, Jianrong Lv, Yang Yue
Comments: 27 pages, 3 tables, 7 figures
Journal-ref: Computers, Environment and Urban Systems, 112(2024)
Subjects: Machine Learning (cs.LG)
[325] arXiv:2312.04065 [pdf, html, other]
Title: A Robust and Efficient Boundary Point Detection Method by Measuring Local Direction Dispersion
Dehua Peng, Zhipeng Gui, Jie Gui, Huayi Wu
Comments: 14 pages, 12 figures, 3 tables
Subjects: Machine Learning (cs.LG)
[326] arXiv:2312.04067 [pdf, other]
Title: MeanCut: A Greedy-Optimized Graph Clustering via Path-based Similarity and Degree Descent Criterion
Dehua Peng, Zhipeng Gui, Huayi Wu
Comments: 17 pages, 8 figures, 6 tables
Subjects: Machine Learning (cs.LG)
[327] arXiv:2312.04070 [pdf, html, other]
Title: A Transformer Model for Symbolic Regression towards Scientific Discovery
Florian Lalande, Yoshitomo Matsubara, Naoya Chiba, Tatsunori Taniai, Ryo Igarashi, Yoshitaka Ushiku
Comments: Accepted for oral presentation at NeurIPS2023 AI4Science Workshop. OpenReview: this https URL
Journal-ref: NeurIPS 2023 AI for Science Workshop
Subjects: Machine Learning (cs.LG)
[328] arXiv:2312.04083 [pdf, other]
Title: On the adaptation of in-context learners for system identification
Dario Piga, Filippo Pura, Marco Forgione
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[329] arXiv:2312.04095 [pdf, other]
Title: Learn to Unlearn for Deep Neural Networks: Minimizing Unlearning Interference with Gradient Projection
Tuan Hoang, Santu Rana, Sunil Gupta, Svetha Venkatesh
Comments: Accepted to WACV 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[330] arXiv:2312.04111 [pdf, html, other]
Title: Breaking the Entanglement of Homophily and Heterophily in Semi-supervised Node Classification
Henan Sun, Xunkai Li, Zhengyu Wu, Daohan Su, Rong-Hua Li, Guoren Wang
Comments: Accepted by ICDE 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[331] arXiv:2312.04142 [pdf, html, other]
Title: TimeDRL: Disentangled Representation Learning for Multivariate Time-Series
Ching Chang, Chiao-Tung Chan, Wei-Yao Wang, Wen-Chih Peng, Tien-Fu Chen
Comments: This paper has been accepted by the International Conference on Data Engineering (ICDE) 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[332] arXiv:2312.04166 [pdf, html, other]
Title: Improving Communication Efficiency of Federated Distillation via Accumulating Local Updates
Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Tian Wen, Wen Wang
Comments: 2 pages, 3 figures
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[333] arXiv:2312.04167 [pdf, other]
Title: Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation
Xiaoyu Lin, Laurent Girin, Xavier Alameda-Pineda
Comments: arXiv admin note: substantial text overlap with arXiv:2202.09315
Subjects: Machine Learning (cs.LG)
[334] arXiv:2312.04171 [pdf, html, other]
Title: A novel feature selection framework for incomplete data
Cong Guo
Subjects: Machine Learning (cs.LG)
[335] arXiv:2312.04209 [pdf, html, other]
Title: Constrained Hierarchical Clustering via Graph Coarsening and Optimal Cuts
Eliabelle Mauduit, Andrea Simonetto
Comments: 5 pages, appeared at the Asilomar Conference on Signals, Systems, and Computer, 11/2023
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[336] arXiv:2312.04216 [pdf, other]
Title: CODEX: A Cluster-Based Method for Explainable Reinforcement Learning
Timothy K. Mathes, Jessica Inman, Andrés Colón, Simon Khan
Comments: Presented at the International Joint Conference on Artificial Intelligence (IJCAI) 2023 Workshop on Explainable Artificial Intelligence (XAI)
Subjects: Machine Learning (cs.LG)
[337] arXiv:2312.04234 [pdf, html, other]
Title: Graph Convolutions Enrich the Self-Attention in Transformers!
Jeongwhan Choi, Hyowon Wi, Jayoung Kim, Yehjin Shin, Kookjin Lee, Nathaniel Trask, Noseong Park
Comments: Accepted to NeurIPS 2024. Jeongwhan Choi and Hyowon Wi are co-first authors with equal contributions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[338] arXiv:2312.04273 [pdf, html, other]
Title: Invariant Random Forest: Tree-Based Model Solution for OOD Generalization
Yufan Liao, Qi Wu, Xing Yan
Comments: AAAI Conference on Artificial Intelligence, 2024 (Oral Presentation)
Subjects: Machine Learning (cs.LG)
[339] arXiv:2312.04275 [pdf, other]
Title: Estimating Countries with Similar Maternal Mortality Rate using Cluster Analysis and Pairing Countries with Identical MMR
S. Nandini, Sanjjushri Varshini R
Comments: 14 pages, 6 figures
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[340] arXiv:2312.04307 [pdf, other]
Title: A Structural-Clustering Based Active Learning for Graph Neural Networks
Ricky Maulana Fajri, Yulong Pei, Lu Yin, Mykola Pechenizkiy
Subjects: Machine Learning (cs.LG)
[341] arXiv:2312.04311 [pdf, other]
Title: Finding Interpretable Class-Specific Patterns through Efficient Neural Search
Nils Philipp Walter, Jonas Fischer, Jilles Vreeken
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[342] arXiv:2312.04327 [pdf, other]
Title: Learning to sample in Cartesian MRI
Thomas Sanchez
Comments: PhD Thesis; 198 pages
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[343] arXiv:2312.04330 [pdf, other]
Title: Surrogate Modelling for Sea Ice Concentration using Lightweight Neural Ensemble
Julia Borisova, Nikolay O. Nikitin
Comments: 7 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Atmospheric and Oceanic Physics (physics.ao-ph)
[344] arXiv:2312.04339 [pdf, html, other]
Title: Merging by Matching Models in Task Parameter Subspaces
Derek Tam, Mohit Bansal, Colin Raffel
Comments: TMLR
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[345] arXiv:2312.04343 [pdf, html, other]
Title: Causality and Explainability for Trustworthy Integrated Pest Management
Ilias Tsoumas, Vasileios Sitokonstantinou, Georgios Giannarakis, Evagelia Lampiri, Christos Athanassiou, Gustau Camps-Valls, Charalampos Kontoes, Ioannis Athanasiadis
Comments: Accepted at NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning: Blending New and Existing Knowledge Systems
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[346] arXiv:2312.04346 [pdf, html, other]
Title: Detection and Imputation based Two-Stage Denoising Diffusion Power System Measurement Recovery under Cyber-Physical Uncertainties
Jianhua Pei, Jingyu Wang, Dongyuan Shi, Ping Wang
Journal-ref: IEEE Transactions on Smart Grid, vol. 15, no. 6, pp. 5965-5980, Nov. 2024
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[347] arXiv:2312.04386 [pdf, html, other]
Title: Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization
Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters
Comments: arXiv admin note: substantial text overlap with arXiv:2302.12526
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[348] arXiv:2312.04404 [pdf, html, other]
Title: On the Impact of Multi-dimensional Local Differential Privacy on Fairness
Karima Makhlouf, Heber H. Arcolezi, Sami Zhioua, Ghassen Ben Brahim, Catuscia Palamidessi
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computers and Society (cs.CY)
[349] arXiv:2312.04416 [pdf, html, other]
Title: Monitoring Sustainable Global Development Along Shared Socioeconomic Pathways
Michelle W.L. Wan, Jeffrey N. Clark, Edward A. Small, Elena Fillola Mayoral, Raúl Santos-Rodríguez
Comments: 5 pages, 1 figure. Presented at NeurIPS 2023 Workshop: Tackling Climate Change with Machine Learning
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[350] arXiv:2312.04464 [pdf, other]
Title: Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation
Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[351] arXiv:2312.04469 [pdf, html, other]
Title: On the Learnability of Watermarks for Language Models
Chenchen Gu, Xiang Lisa Li, Percy Liang, Tatsunori Hashimoto
Comments: Accepted at ICLR 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[352] arXiv:2312.04501 [pdf, html, other]
Title: Graph Metanetworks for Processing Diverse Neural Architectures
Derek Lim, Haggai Maron, Marc T. Law, Jonathan Lorraine, James Lucas
Comments: 29 pages. v2 updated experimental results and details
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[353] arXiv:2312.04504 [pdf, html, other]
Title: Coordination-free Decentralised Federated Learning on Complex Networks: Overcoming Heterogeneity
Lorenzo Valerio, Chiara Boldrini, Andrea Passarella, János Kertész, Márton Karsai, Gerardo Iñiguez
Comments: Supported by the H2020 HumaneAI Net (#952026), H2020 INFRAIA-01-2018-2019 SoBigData++ (#871042), and by the CHIST-ERA-19-XAI010 SAI projects, FWF (grant No. I 5205). Also funded by PNRR MUR Partenariato Esteso PE00000013 FAIR, PNRR MUR Partenariato Esteso PE00000001 - "RESTART"
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA); Social and Information Networks (cs.SI)
[354] arXiv:2312.04528 [pdf, html, other]
Title: Using Large Language Models for Hyperparameter Optimization
Michael R. Zhang, Nishkrit Desai, Juhan Bae, Jonathan Lorraine, Jimmy Ba
Comments: 28 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[355] arXiv:2312.04535 [pdf, html, other]
Title: Trajeglish: Traffic Modeling as Next-Token Prediction
Jonah Philion, Xue Bin Peng, Sanja Fidler
Comments: ICLR 2024
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[356] arXiv:2312.04540 [pdf, html, other]
Title: Sim-to-Real Causal Transfer: A Metric Learning Approach to Causally-Aware Interaction Representations
Ahmad Rahimi, Po-Chien Luan, Yuejiang Liu, Frano Rajič, Alexandre Alahi
Comments: CVPR 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA); Robotics (cs.RO)
[357] arXiv:2312.04546 [pdf, html, other]
Title: Adversarial Learning for Feature Shift Detection and Correction
Miriam Barrabes, Daniel Mas Montserrat, Margarita Geleta, Xavier Giro-i-Nieto, Alexander G. Ioannidis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP); Machine Learning (stat.ML)
[358] arXiv:2312.04574 [pdf, other]
Title: Differentiable Visual Computing for Inverse Problems and Machine Learning
Andrew Spielberg, Fangcheng Zhong, Konstantinos Rematas, Krishna Murthy Jatavallabhula, Cengiz Oztireli, Tzu-Mao Li, Derek Nowrouzezahrai
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Graphics (cs.GR); Neural and Evolutionary Computing (cs.NE)
[359] arXiv:2312.04595 [pdf, other]
Title: Evaluating The Accuracy of Classification Algorithms for Detecting Heart Disease Risk
Alhaam Alariyibi, Mohamed El-Jarai, Abdelsalam Maatuk
Journal-ref: Machine Learning and Applications: An International Journal (MLAIJ) Vol.10, No.4, December 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[360] arXiv:2312.04604 [pdf, html, other]
Title: Transferable Candidate Proposal with Bounded Uncertainty
Kyeongryeol Go, Kye-Hyeon Kim
Comments: Accepted in NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World
Subjects: Machine Learning (cs.LG)
[361] arXiv:2312.04606 [pdf, html, other]
Title: Urban Region Representation Learning with Attentive Fusion
Fengze Sun, Jianzhong Qi, Yanchuan Chang, Xiaoliang Fan, Shanika Karunasekera, Egemen Tanin
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[362] arXiv:2312.04609 [pdf, html, other]
Title: Short-term prediction of construction waste transport activities using AI-Truck
Meng Xu, Ke Han
Comments: 11 pages, 15 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[363] arXiv:2312.04610 [pdf, other]
Title: Data-Driven Semi-Supervised Machine Learning with Safety Indicators for Abnormal Driving Behavior Detection
Yongqi Dong, Lanxin Zhang, Haneen Farah, Arkady Zgonnikov, Bart van Arem
Comments: 16 pages, 10 figures, accepted by the 103rd Transportation Research Board (TRB) Annual Meeting, accepted and published by Transportation Research Record: Journal of the Transportation Research Board
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP); Other Statistics (stat.OT)
[364] arXiv:2312.04615 [pdf, html, other]
Title: Relational Deep Learning: Graph Representation Learning on Relational Databases
Matthias Fey, Weihua Hu, Kexin Huang, Jan Eric Lenssen, Rishabh Ranjan, Joshua Robinson, Rex Ying, Jiaxuan You, Jure Leskovec
Comments: this https URL
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[365] arXiv:2312.04653 [pdf, html, other]
Title: Learning Thresholds with Latent Values and Censored Feedback
Jiahao Zhang, Tao Lin, Weiqiang Zheng, Zhe Feng, Yifeng Teng, Xiaotie Deng
Comments: 18 pages
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT)
[366] arXiv:2312.04658 [pdf, html, other]
Title: PAC-Bayes Generalization Certificates for Learned Inductive Conformal Prediction
Apoorva Sharma, Sushant Veer, Asher Hancock, Heng Yang, Marco Pavone, Anirudha Majumdar
Comments: NeurIPS 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[367] arXiv:2312.04675 [pdf, html, other]
Title: Reverse Engineering Deep ReLU Networks An Optimization-based Algorithm
Mehrab Hamidi
Comments: 9 pages, 2 supplementary pages
Subjects: Machine Learning (cs.LG)
[368] arXiv:2312.04688 [pdf, html, other]
Title: Federated Learning for 6G: Paradigms, Taxonomy, Recent Advances and Insights
Maryam Ben Driss, Essaid Sabir, Halima Elbiaze, Walid Saad
Comments: 32 pages, 7 figures; 9 Tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Networking and Internet Architecture (cs.NI); Signal Processing (eess.SP)
[369] arXiv:2312.04693 [pdf, html, other]
Title: GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts
Shirley Wu, Kaidi Cao, Bruno Ribeiro, James Zou, Jure Leskovec
Comments: Graph Neural Networks, Mixture-of-experts, Distribution Shifts, Generalization
Subjects: Machine Learning (cs.LG)
[370] arXiv:2312.04709 [pdf, html, other]
Title: How to guess a gradient
Utkarsh Singhal, Brian Cheung, Kartik Chandra, Jonathan Ragan-Kelley, Joshua B. Tenenbaum, Tomaso A. Poggio, Stella X. Yu
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[371] arXiv:2312.04712 [pdf, html, other]
Title: Error Discovery by Clustering Influence Embeddings
Fulton Wang, Julius Adebayo, Sarah Tan, Diego Garcia-Olano, Narine Kokhlikyan
Comments: NeuRIPs 2023 conference paper
Subjects: Machine Learning (cs.LG)
[372] arXiv:2312.04719 [pdf, html, other]
Title: Distributed Optimization via Kernelized Multi-armed Bandits
Ayush Rai, Shaoshuai Mou
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA); Optimization and Control (math.OC)
[373] arXiv:2312.04737 [pdf, html, other]
Title: Efficient End-to-end Language Model Fine-tuning on Graphs
Rui Xue, Xipeng Shen, Ruozhou Yu, Xiaorui Liu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[374] arXiv:2312.04740 [pdf, html, other]
Title: Train 'n Trade: Foundations of Parameter Markets
Tzu-Heng Huang, Harit Vishwakarma, Frederic Sala
Comments: accepted at NeurIPS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[375] arXiv:2312.04752 [pdf, html, other]
Title: A Test-Time Learning Approach to Reparameterize the Geophysical Inverse Problem with a Convolutional Neural Network
Anran Xu, Lindsey J. Heagy
Subjects: Machine Learning (cs.LG); Geophysics (physics.geo-ph)
[376] arXiv:2312.04762 [pdf, other]
Title: The Graph Lottery Ticket Hypothesis: Finding Sparse, Informative Graph Structure
Anton Tsitsulin, Bryan Perozzi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[377] arXiv:2312.04815 [pdf, html, other]
Title: Not All Negatives Are Worth Attending to: Meta-Bootstrapping Negative Sampling Framework for Link Prediction
Yakun Wang, Binbin Hu, Shuo Yang, Meiqi Zhu, Zhiqiang Zhang, Qiyang Zhang, Jun Zhou, Guo Ye, Huimei He
Subjects: Machine Learning (cs.LG)
[378] arXiv:2312.04862 [pdf, html, other]
Title: Damage GAN: A Generative Model for Imbalanced Data
Ali Anaissi, Yuanzhe Jia, Ali Braytee, Mohamad Naji, Widad Alyassine
Comments: Accepted by AusDM 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[379] arXiv:2312.04865 [pdf, html, other]
Title: StructComp: Substituting Propagation with Structural Compression in Training Graph Contrastive Learning
Shengzhong Zhang, Wenjie Yang, Xinyuan Cao, Hongwei Zhang, Zengfeng Huang
Comments: Accepted by ICLR 2024
Subjects: Machine Learning (cs.LG)
[380] arXiv:2312.04879 [pdf, other]
Title: HC-Ref: Hierarchical Constrained Refinement for Robust Adversarial Training of GNNs
Xiaobing Pei, Haoran Yang, Gang Shen
Subjects: Machine Learning (cs.LG)
[381] arXiv:2312.04883 [pdf, html, other]
Title: Understanding Community Bias Amplification in Graph Representation Learning
Shengzhong Zhang, Wenjie Yang, Yimin Zhang, Hongwei Zhang, Divin Yan, Zengfeng Huang
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[382] arXiv:2312.04905 [pdf, other]
Title: Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games
Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman Ozdaglar, Adam Wierman
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[383] arXiv:2312.04911 [pdf, html, other]
Title: Collinear datasets augmentation using Procrustes validation sets
Sergey Kucheryavskiy, Sergei Zhilin
Comments: 21 pages, 7 figures
Subjects: Machine Learning (cs.LG)
[384] arXiv:2312.04916 [pdf, html, other]
Title: EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism
Yanxi Chen, Xuchen Pan, Yaliang Li, Bolin Ding, Jingren Zhou
Comments: ICML 2024 camera-ready version
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[385] arXiv:2312.04918 [pdf, html, other]
Title: Pruning Convolutional Filters via Reinforcement Learning with Entropy Minimization
Bogdan Musat, Razvan Andonie
Subjects: Machine Learning (cs.LG)
[386] arXiv:2312.04985 [pdf, html, other]
Title: SparQ Attention: Bandwidth-Efficient LLM Inference
Luka Ribar, Ivan Chelombiev, Luke Hudlass-Galley, Charlie Blake, Carlo Luschi, Douglas Orr
Subjects: Machine Learning (cs.LG)
[387] arXiv:2312.04992 [pdf, html, other]
Title: PFLlib: A Beginner-Friendly and Comprehensive Personalized Federated Learning Library and Benchmark
Jianqing Zhang, Yang Liu, Yang Hua, Hao Wang, Tao Song, Zhengui Xue, Ruhui Ma, Jian Cao
Comments: Accepted by JMLR
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[388] arXiv:2312.05021 [pdf, html, other]
Title: A Negative Result on Gradient Matching for Selective Backprop
Lukas Balles, Cedric Archambeau, Giovanni Zappella
Comments: Paper accepted at the ICBINB Workshop at NeurIPS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[389] arXiv:2312.05044 [pdf, html, other]
Title: Backward Learning for Goal-Conditioned Policies
Marc Höftmann, Jan Robine, Stefan Harmeling
Comments: World Models, Goal-conditioned, Reward-free, Workshop on Goal-Conditioned Reinforcement Learning - NeurIPS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[390] arXiv:2312.05134 [pdf, html, other]
Title: Optimal Multi-Distribution Learning
Zihan Zhang, Wenhao Zhan, Yuxin Chen, Simon S. Du, Jason D. Lee
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[391] arXiv:2312.05140 [pdf, html, other]
Title: Membership Inference Attacks on Diffusion Models via Quantile Regression
Shuai Tang, Zhiwei Steven Wu, Sergul Aydore, Michael Kearns, Aaron Roth
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[392] arXiv:2312.05185 [pdf, html, other]
Title: AI Competitions and Benchmarks: Competition platforms
Andrey Ustyuzhanin, Harald Carlens
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[393] arXiv:2312.05195 [pdf, html, other]
Title: Conformal Prediction in Multi-User Settings: An Evaluation
Enrique Garcia-Ceja, Luciano Garcia-Banuelos, Nicolas Jourdan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[394] arXiv:2312.05225 [pdf, html, other]
Title: Neural Spectral Methods: Self-supervised learning in the spectral domain
Yiheng Du, Nithin Chalapathi, Aditi Krishnapriyan
Comments: Accepted to International Conference on Learning Representations (ICLR) 2024
Subjects: Machine Learning (cs.LG)
[395] arXiv:2312.05231 [pdf, other]
Title: Modeling Risk in Reinforcement Learning: A Literature Mapping
Leonardo Villalobos-Arias, Derek Martin, Abhijeet Krishnan, Madeleine Gagné, Colin M. Potts, Arnav Jhala
Comments: 36 pages, 8 figures, Submitted to Artificial Intelligence Reviews
Subjects: Machine Learning (cs.LG)
[396] arXiv:2312.05250 [pdf, html, other]
Title: TaskMet: Task-Driven Metric Learning for Model Learning
Dishank Bansal, Ricky T. Q. Chen, Mustafa Mukadam, Brandon Amos
Comments: NeurIPS 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[397] arXiv:2312.05253 [pdf, other]
Title: DiSK: A Diffusion Model for Structured Knowledge
Ouail Kitouni, Niklas Nolte, James Hensman, Bhaskar Mitra
Comments: 24 pages, 12 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[398] arXiv:2312.05274 [pdf, html, other]
Title: Two Simple Principles for Diffusion-Based Test-Time Adaptation
Kaiyu Song, Hanjiang Lai, Yan Pan, Kun Yue, Jian Yin
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[399] arXiv:2312.05282 [pdf, html, other]
Title: Towards On-device Learning on the Edge: Ways to Select Neurons to Update under a Budget Constraint
Aël Quélennec, Enzo Tartaglione, Pavlo Mozharovskyi, Van-Tam Nguyen
Comments: 8 pages, 4 figures, 2 tables, WACV2024 - SCIoT workshop
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[400] arXiv:2312.05296 [pdf, other]
Title: AI Competitions and Benchmarks: The life cycle of challenges and benchmarks
Gustavo Stolovitzky, Julio Saez-Rodriguez, Julie Bletz, Jacob Albrecht, Gaia Andreoletti, James C. Costello, Paul Boutros
Subjects: Machine Learning (cs.LG)
[401] arXiv:2312.05299 [pdf, other]
Title: Learning to be Simple
Yang-Hui He, Vishnu Jejjala, Challenger Mishra, Em Sharnoff
Comments: 25 pages, 6 figures and 5 tables
Subjects: Machine Learning (cs.LG); High Energy Physics - Theory (hep-th); Mathematical Physics (math-ph); Group Theory (math.GR)
[402] arXiv:2312.05327 [pdf, html, other]
Title: Better, Not Just More: Data-Centric Machine Learning for Earth Observation
Ribana Roscher, Marc Rußwurm, Caroline Gevaert, Michael Kampffmeyer, Jefersson A. dos Santos, Maria Vakalopoulou, Ronny Hänsch, Stine Hansen, Keiller Nogueira, Jonathan Prexl, Devis Tuia
Journal-ref: IEEE Geoscience and Remote Sensing Magazine, vol. 12, no. 4, pp. 335-355, Dec. 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[403] arXiv:2312.05333 [pdf, html, other]
Title: A Data-Driven Framework for Improving Public EV Charging Infrastructure: Modeling and Forecasting
Nassr Al-Dahabreh, Mohammad Ali Sayed, Khaled Sarieddine, Mohamed Elhattab, Maurice Khabbaz, Ribal Atallah, Chadi Assi
Comments: Accepted for Publication in IEEE Transactions on Intelligent Transportation Systems
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[404] arXiv:2312.05337 [pdf, html, other]
Title: Artificial Neural Nets and the Representation of Human Concepts
Timo Freiesleben
Comments: For: Philosophy of Science for Machine Learning: Core Issues and New Perspectives, edited by Juan Duran and Giorgia Pozzi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[405] arXiv:2312.05355 [pdf, html, other]
Title: Neither hype nor gloom do DNNs justice
Felix A. Wichmann, Simon Kornblith, Robert Geirhos
Comments: Preprint version of a commentary published by Behavioral and Brain Sciences (this https URL)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[406] arXiv:2312.05359 [pdf, html, other]
Title: Learning 3D Particle-based Simulators from RGB-D Videos
William F. Whitney, Tatiana Lopez-Guevara, Tobias Pfaff, Yulia Rubanova, Thomas Kipf, Kimberly Stachenfeld, Kelsey R. Allen
Subjects: Machine Learning (cs.LG)
[407] arXiv:2312.05386 [pdf, html, other]
Title: Model Extraction Attacks Revisited
Jiacheng Liang, Ren Pang, Changjiang Li, Ting Wang
Comments: Under Review
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[408] arXiv:2312.05387 [pdf, html, other]
Title: Cross Domain Generative Augmentation: Domain Generalization with Latent Diffusion Models
Sobhan Hemati, Mahdi Beitollahi, Amir Hossein Estiri, Bassel Al Omari, Xi Chen, Guojun Zhang
Subjects: Machine Learning (cs.LG)
[409] arXiv:2312.05397 [pdf, html, other]
Title: On the Performance of Temporal Difference Learning With Neural Networks
Haoxing Tian, Ioannis Ch. Paschalidis, Alex Olshevsky
Subjects: Machine Learning (cs.LG)
[410] arXiv:2312.05404 [pdf, html, other]
Title: Disentangled Latent Representation Learning for Tackling the Confounding M-Bias Problem in Causal Inference
Debo Cheng (1), Yang Xie (2), Ziqi Xu (1), Jiuyong Li (1), Lin Liu (1), Jixue Liu (1), Yinghao Zhang (2), Zaiwen Feng (2) ((1) UniSA STEM, University of South Australia, Adelaide, Australia and (2) College of Informatics, Huazhong Agricultural University, Wuhan, China)
Comments: 10 pages, 3 figures and 5 tables. Accepted by ICDM2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[411] arXiv:2312.05405 [pdf, html, other]
Title: Guaranteed Trust Region Optimization via Two-Phase KL Penalization
K.R. Zentner, Ujjwal Puri, Zhehui Huang, Gaurav S. Sukhatme
Subjects: Machine Learning (cs.LG)
[412] arXiv:2312.05409 [pdf, html, other]
Title: Large-scale Training of Foundation Models for Wearable Biosignals
Salar Abbaspourazad, Oussama Elachqar, Andrew C. Miller, Saba Emrani, Udhyakumar Nallasamy, Ian Shapiro
Comments: Camera ready version for ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)
[413] arXiv:2312.05410 [pdf, html, other]
Title: Rethinking materials simulations: Blending direct numerical simulations with neural operators
Vivek Oommen, Khemraj Shukla, Saaketh Desai, Remi Dingreville, George Em Karniadakis
Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[414] arXiv:2312.05412 [pdf, html, other]
Title: CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling
Ruihan Yang, Hannes Gamper, Sebastian Braun
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[415] arXiv:2312.05429 [pdf, other]
Title: Mitigating Nonlinear Algorithmic Bias in Binary Classification
Wendy Hui, Wai Kwong Lau
Comments: 5 pages, 3 figures, 12 tables. arXiv admin note: text overlap with arXiv:2310.12421
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Applications (stat.AP)
[416] arXiv:2312.05432 [pdf, html, other]
Title: Fusing Multiple Algorithms for Heterogeneous Online Learning
Darshan Gadginmath, Shivanshu Tripathi, Fabio Pasqualetti
Comments: 13 pages, 3 figures
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[417] arXiv:2312.05440 [pdf, html, other]
Title: Consistency Models for Scalable and Fast Simulation-Based Inference
Marvin Schmitt, Valentin Pratz, Ullrich Köthe, Paul-Christian Bürkner, Stefan T Radev
Journal-ref: Neural Information Processing Systems (NeurIPS 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[418] arXiv:2312.05456 [pdf, html, other]
Title: On the calibration of compartmental epidemiological models
Nikunj Gupta, Anh Mai, Azza Abouzied, Dennis Shasha
Subjects: Machine Learning (cs.LG); Physics and Society (physics.soc-ph); Populations and Evolution (q-bio.PE)
[419] arXiv:2312.05461 [pdf, html, other]
Title: STREAMLINE: An Automated Machine Learning Pipeline for Biomedicine Applied to Examine the Utility of Photography-Based Phenotypes for OSA Prediction Across International Sleep Centers
Ryan J. Urbanowicz, Harsh Bandhey, Brendan T. Keenan, Greg Maislin, Sy Hwang, Danielle L. Mowery, Shannon M. Lynch, Diego R. Mazzotti, Fang Han, Qing Yun Li, Thomas Penzel, Sergio Tufik, Lia Bittencourt, Thorarinn Gislason, Philip de Chazal, Bhajan Singh, Nigel McArdle, Ning-Hung Chen, Allan Pack, Richard J. Schwab, Peter A. Cistulli, Ulysses J. Magalang
Comments: 23 pages, 7 figures, 1 table, 1 supplemental information document (77 pages), and 7 ancillary files
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[420] arXiv:2312.05465 [pdf, html, other]
Title: On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR
Jaeuk Shin, Giho Kim, Howon Lee, Joonho Han, Insoon Yang
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[421] arXiv:2312.05479 [pdf, html, other]
Title: Exploring Sparsity in Graph Transformers
Chuang Liu, Yibing Zhan, Xueqi Ma, Liang Ding, Dapeng Tao, Jia Wu, Wenbin Hu, Bo Du
Comments: 9 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[422] arXiv:2312.05502 [pdf, html, other]
Title: Poisoning $\times$ Evasion: Symbiotic Adversarial Robustness for Graph Neural Networks
Ege Erdogan, Simon Geisler, Stephan Günnemann
Comments: NeurIPS 2023 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2023)
Subjects: Machine Learning (cs.LG)
[423] arXiv:2312.05508 [pdf, html, other]
Title: Improving Adversarial Robust Fairness via Anti-Bias Soft Label Distillation
Shiji Zhao, Ranjie Duan, Xizhe Wang, Xingxing Wei
Comments: Accepted by NeurIPS2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[424] arXiv:2312.05516 [pdf, html, other]
Title: Stateful Large Language Model Serving with Pensieve
Lingfan Yu, Jinkun Lin, Jinyang Li
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[425] arXiv:2312.05519 [pdf, html, other]
Title: Isomorphic-Consistent Variational Graph Auto-Encoders for Multi-Level Graph Representation Learning
Hanxuan Yang, Qingchao Kong, Wenji Mao
Subjects: Machine Learning (cs.LG)
[426] arXiv:2312.05526 [pdf, html, other]
Title: Reinforcement Neighborhood Selection for Unsupervised Graph Anomaly Detection
Yuanchen Bei, Sheng Zhou, Qiaoyu Tan, Hao Xu, Hao Chen, Zhao Li, Jiajun Bu
Comments: 1O pages, 7 figures, accepted by ICDM2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[427] arXiv:2312.05540 [pdf, html, other]
Title: Federated Causality Learning with Explainable Adaptive Optimization
Dezhi Yang, Xintong He, Jun Wang, Guoxian Yu, Carlotta Domeniconi, Jinglin Zhang
Comments: Accepted by the Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI2024)
Subjects: Machine Learning (cs.LG)
[428] arXiv:2312.05549 [pdf, html, other]
Title: Multi-granularity Causal Structure Learning
Jiaxuan Liang, Jun Wang, Guoxian Yu, Shuyin Xia, Guoyin Wang
Comments: Accepted by the Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI2024)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[429] arXiv:2312.05551 [pdf, html, other]
Title: Multi-dimensional Fair Federated Learning
Cong Su, Guoxian Yu, Jun Wang, Hui Li, Qingzhong Li, Han Yu
Comments: Accepted by the Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI2024)
Subjects: Machine Learning (cs.LG)
[430] arXiv:2312.05560 [pdf, html, other]
Title: Enhancing the Accuracy of Predictors of Activity Sequences of Business Processes
Muhammad Awais Ali, Marlon Dumas, Fredrik Milani
Subjects: Machine Learning (cs.LG)
[431] arXiv:2312.05568 [pdf, html, other]
Title: Sparse Variational Student-t Processes
Jian Xu, Delu Zeng
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[432] arXiv:2312.05583 [pdf, other]
Title: Better Neural PDE Solvers Through Data-Free Mesh Movers
Peiyan Hu, Yue Wang, Zhi-Ming Ma
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Numerical Analysis (math.NA)
[433] arXiv:2312.05586 [pdf, html, other]
Title: Deeper Understanding of Black-box Predictions via Generalized Influence Functions
Hyeonsu Lyu, Jonggyu Jang, Sehyun Ryu, Hyun Jong Yang
Comments: 16 pages, 6 figures, and 2 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[434] arXiv:2312.05596 [pdf, other]
Title: Factorized Explainer for Graph Neural Networks
Rundong Huang, Farhad Shirani, Dongsheng Luo
Comments: AAAI 24
Subjects: Machine Learning (cs.LG)
[435] arXiv:2312.05598 [pdf, html, other]
Title: Boosting the Cross-Architecture Generalization of Dataset Distillation through an Empirical Study
Lirui Zhao, Yuxin Zhang, Fei Chao, Rongrong Ji
Subjects: Machine Learning (cs.LG)
[436] arXiv:2312.05605 [pdf, html, other]
Title: TCNCA: Temporal Convolution Network with Chunked Attention for Scalable Sequence Processing
Aleksandar Terzic, Michael Hersche, Geethan Karunaratne, Luca Benini, Abu Sebastian, Abbas Rahimi
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[437] arXiv:2312.05611 [pdf, html, other]
Title: Triplet Edge Attention for Algorithmic Reasoning
Yeonjoon Jung, Sungsoo Ahn
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[438] arXiv:2312.05642 [pdf, html, other]
Title: Speed Up Federated Learning in Heterogeneous Environment: A Dynamic Tiering Approach
Seyed Mahmoud Sajjadi Mohammadabadi, Syed Zawad, Feng Yan, Lei Yang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Performance (cs.PF)
[439] arXiv:2312.05657 [pdf, html, other]
Title: PerfRL: A Small Language Model Framework for Efficient Code Optimization
Shukai Duan, Nikos Kanakaris, Xiongye Xiao, Heng Ping, Chenyu Zhou, Nesreen K. Ahmed, Guixiang Ma, Mihai Capota, Theodore L. Willke, Shahin Nazarian, Paul Bogdan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Programming Languages (cs.PL); Software Engineering (cs.SE)
[440] arXiv:2312.05659 [pdf, html, other]
Title: Optimal Unbiased Randomizers for Regression with Label Differential Privacy
Ashwinkumar Badanidiyuru, Badih Ghazi, Pritish Kamath, Ravi Kumar, Ethan Leeman, Pasin Manurangsi, Avinash V Varadarajan, Chiyuan Zhang
Comments: Proceedings version to appear at NeurIPS 2023
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[441] arXiv:2312.05677 [pdf, html, other]
Title: Batched Low-Rank Adaptation of Foundation Models
Yeming Wen, Swarat Chaudhuri
Comments: 16 pages, 3 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[442] arXiv:2312.05693 [pdf, html, other]
Title: Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge
Xuan Shen, Peiyan Dong, Lei Lu, Zhenglun Kong, Zhengang Li, Ming Lin, Chao Wu, Yanzhi Wang
Comments: Accepted by AAAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[443] arXiv:2312.05698 [pdf, html, other]
Title: Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning
Chen Liang, Donghua Yang, Zhiyu Liang, Hongzhi Wang, Zheng Liang, Xiyang Zhang, Jianfeng Huang
Subjects: Machine Learning (cs.LG)
[444] arXiv:2312.05705 [pdf, html, other]
Title: Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC
Wu Lin, Felix Dangel, Runa Eschenhagen, Kirill Neklyudov, Agustinus Kristiadi, Richard E. Turner, Alireza Makhzani
Comments: A long version of the ICML 2024 paper, updated the text about a related work
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[445] arXiv:2312.05715 [pdf, html, other]
Title: Micro-Macro Consistency in Multiscale Modeling: Score-Based Model Assisted Sampling of Fast/Slow Dynamical Systems
Ellis R. Crabtree, Juan M. Bello-Rivas, Ioannis G. Kevrekidis
Comments: 20 pages, 9 figures
Subjects: Machine Learning (cs.LG); Dynamical Systems (math.DS)
[446] arXiv:2312.05717 [pdf, html, other]
Title: Forecasting Lithium-Ion Battery Longevity with Limited Data Availability: Benchmarking Different Machine Learning Algorithms
Hudson Hilal, Pramit Saha
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[447] arXiv:2312.05720 [pdf, html, other]
Title: Beyond Gradient and Priors in Privacy Attacks: Leveraging Pooler Layer Inputs of Language Models in Federated Learning
Jianwei Li, Sheng Liu, Qi Lei
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[448] arXiv:2312.05736 [pdf, html, other]
Title: ASWT-SGNN: Adaptive Spectral Wavelet Transform-based Self-Supervised Graph Neural Network
Ruyue Liu, Rong Yin, Yong Liu, Weiping Wang
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[449] arXiv:2312.05742 [pdf, html, other]
Title: The Generalization Gap in Offline Reinforcement Learning
Ishita Mediratta, Qingfei You, Minqi Jiang, Roberta Raileanu
Comments: Published as a conference paper at ICLR 2024; First two authors contributed equally
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[450] arXiv:2312.05743 [pdf, html, other]
Title: Building Variable-sized Models via Learngene Pool
Boyu Shi, Shiyu Xia, Xu Yang, Haokun Chen, Zhiqiang Kou, Xin Geng
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Total of 2733 entries : 201-450 251-500 501-750 751-1000 ... 2501-2733
Showing up to 250 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack